Deep Learning

Explainability and Model Development

My research in deep learning started addressing a seemingly simple yet profound question: Can a neural network be schematized as a graph, and how its functionality relate with the spectral properties of such graph? Remarkably, following this line, it becomes possible to rank neurons by relevance for Structural Pruning, and reformulating the learning process through dynamical systems to build more adaptive recurrent architectures. More recently, my focus has shifted towards Graph Neural Networks (GNNs) in the context of molecular physics, a field where mathematical and physical intution can mutually reinforce each other. This includes developing techniques to approximate the behavior of Message Passing Neural Networks (MPNNs) used in molecular potentials, as well as designing architectures tailored to the force matching problem — a central challenge in learning accurate and transferable interatomic force fields from highly accuratly simulated data.I also have some applied works and more theoretical ones going on, follow me for new papers on the subject!

Related Publications

Peering inside the black box by learning the relevance of many-body functions in neural network potentials

This paper extend tools recently proposed in the nascent field of explainable artificial intelligence, such as Layerwise Relevance Propagation, to coarse-grained potentials based on graph neural networks.

Klara Bonneau, Jonas Lederer, Clark Templeton, David Rosenberger, Lorenzo Giambagli, Klaus-Robert Muller, Cecilia Clementi

Peering inside the black box by learning the relevance of many-body functions in neural network potentials

Deterministic versus stochastic dynamical classifiers: opposing random adversarial attacks with noise

This article explores the comparison between deterministic and stochastic dynamical classifiers in the context of opposing random adversarial attacks using noise. The study provides insights into how these different types of classifiers can be used to mitigate adversarial threats.

Lorenzo Chicchi, Duccio Fanelli, Diego Febbe, Lorenzo Buffoni, Francesca Di Patti, Lorenzo Giambagli, Raffaele Marino

Kernel shape renormalization explains output-output correlations in finite Bayesian one-hidden-layer networks

Finite-width one hidden layer networks display nontrivial output-output correlations that vanish in the lazy-training infinite-width limit. This manuscript rationalizes this evidence using kernel shape renormalization in the proportional limit of Bayesian deep learning.

Paolo Baglioni, Lorenzo Giambagli, Alessandro Vezzani, Raffaella Burioni, Pietro Rotondo, Rosalba Pacelli

Learning in Wilson-Cowan model for metapopulation

This research introduces a learning algorithm based on the Wilson-Cowan model for metapopulation, a neural mass network model that treats different subcortical regions of the brain as connected nodes. The model incorporates stable attractors into its dynamics, enabling it to solve various classification tasks. The algorithm is tested on datasets such as MNIST, Fashion MNIST, CIFAR-10, and TF-FLOWERS, as well as in combination with a transformer architecture (BERT) on IMDB, achieving high classification accuracy.

Raffaele Marino, Lorenzo Buffoni, Lorenzo Chicchi, Francesca Di Patti, Diego Febbe, Lorenzo Giambagli, Duccio Fanelli

Topology shapes dynamics of higher-order networks

This research explores how higher-order interactions in complex systems influence the dynamics of topological signals, revealing new insights into the interplay between topology and dynamics.

Ana P. Millán, Hanlin Sun, Lorenzo Giambagli, Riccardo Muolo, Timoteo Carletti, Joaquín J. Torres, Filippo Radicchi, Jürgen Kurths, Ginestra Bianconi

Topology shapes dynamics of higher-order networks

Global topological Dirac synchronization

This research introduces Global Topological Dirac Synchronization, a state where oscillators associated with simplices and cells of arbitrary dimension, coupled by the Topological Dirac operator, operate in unison. The study combines algebraic topology, non-linear dynamics, and machine learning to derive the conditions for the existence and stability of this synchronization state.

Timoteo Carletti, Lorenzo Giambagli, Riccardo Muolo, Ginestra Bianconi

Turing patterns on discrete topologies

This research explores Turing patterns on discrete topologies, extending the classical theory of pattern formation to networks and higher-order structures. The study highlights the potential of this approach to transcend the conventional boundaries of PDE-based methods, offering insights into self-organization phenomena across various disciplines.

Riccardo Muolo, Lorenzo Giambagli, Hiroya Nakao, Duccio Fanelli, Timoteo Carletti

How a student becomes a teacher: learning and forgetting through Spectral methods

This study explores the teacher-student paradigm in machine learning, focusing on overparameterized student networks trained by fixed teacher networks. It introduces a new optimization scheme using spectral representation of linear information transfer between layers. This approach allows identifying a stable student substructure that mirrors the teacher’s complexity. The method shows that pruning unimportant nodes, based on optimized eigenvalues, does not degrade performance, indicating a second-order phase transition with universality traits in neural network training.

Lorenzo Giambagli, Lorenzo Buffoni, Lorenzo Chicchi, Duccio Fanelli

How a student becomes a teacher: learning and forgetting through Spectral methods

A Bridge between Dynamical Systems and Machine Learning: Engineered Ordinary Differential Equations as Classification Algorithm (EODECA)

EODECAs, merging machine learning with dynamical systems, enhance interpretability and transparency in neural networks. They employ continuous ordinary differential equations, offering both high classification accuracy and an understanding of data processes, addressing the opacity of traditional deep learning models. This approach signifies a step towards more comprehensible machine learning models.

Raffaele Marino, Lorenzo Giambagli, Lorenzo Chicchi, Lorenzo Buffoni, Duccio Fanelli

Complex Recurrent Spectral Network

The Complex Recurrent Spectral Network (C-RSN) is a novel AI model that more accurately mimics biological neural processes using localized non-linearity, complex eigenvalues, and separated memory/input functionalities. It demonstrates dynamic, oscillatory behavior akin to biological cognition and effectively classifies data, as shown in tests with the MNIST dataset.

Lorenzo Chicchi, Lorenzo Giambagli, Lorenzo Buffoni, Raffaele Marino, Duccio Fanelli

Global topological synchronization on simplicial and cell complexes

This research explores the global synchronization of topological signals on higher-order networks, revealing that topological constraints impact synchronization differently across various network structures.

Timoteo Carletti, Lorenzo Giambagli, Ginestra Bianconi

Global topological synchronization on simplicial and cell complexes

Non-parametric analysis of the Hubble Diagram with Neural Networks

This study introduces a neural network-based method for nonparametric analysis of the Hubble diagram, extended to high redshifts. Validated using simulated data, the method aligns with a flat Λ (Lambda) cold dark matter model (ΩM ≈ 0.3) up to z ≈ 1-1.5, but deviates at higher redshifts. It also suggests increasing ΩM values with redshift, indicating potential dark energy evolution.

Lorenzo Giambagli, Duccio Fanelli, Guido Risaliti, Matilde Signorini

Non-parametric analysis of the Hubble Diagram with Neural Networks

Recurrent Spectral Network (RSN): Shaping a discrete map to reach automated classification

The Recurrent Spectral Network (RSN) is a new automated classification method that uses dynamical systems to direct data to specific targets, demonstrating effectiveness with both a simple model and a standard image processing dataset.

Lorenzo Chicchi, Duccio Fanelli, Lorenzo Giambagli, Lorenzo Buffoni, Timoteo Carletti

Recurrent Spectral Network (RSN): Shaping a discrete map to reach automated classification

Diffusion-driven instability of topological signals coupled by the Dirac operator

This research examines reaction-diffusion processes on networks, particularly focusing on topological signals across nodes, links, and cells. It uses the Dirac operator to study interactions and reveals conditions for Turing pattern emergence, validating the findings on network models and square lattices.

Lorenzo Giambagli, Lucille Calmon, Riccardo Muolo, Timoteo Carletti, Ginestra Bianconi

Diffusion-driven instability of topological signals coupled by the Dirac operator

Spectral pruning of fully connected layers

Training neural networks in spectral space focuses on optimizing eigenvalues and eigenvectors instead of individual weights, allowing effective implicit bias that node enables pruning without sacrificing performance.

Lorenzo Buffoni, Enrico Civitelli, Lorenzo Giambagli, Lorenzo Chicchi, Duccio Fanelli

Spectral pruning of fully connected layers

Machine learning in spectral domain

We introduce a new method for training deep neural networks by focusing on the spectral space, rather than the traditional node space. It involves adjusting the eigenvalues and eigenvectors of transfer operators, offering improved performance over standard methods with an equivalent number of parameters.

Lorenzo Giambagli, Lorenzo Buffoni, Timoteo Carletti, Walter Nocentini, Duccio Fanelli

Mobility-based prediction of SARS-CoV-2 spreading

This paper analyzes the effectiveness of containment measures for SARS-CoV-2, using mobility data to gauge their impact. A deep learning model predicts virus spread scenarios in Italy, showing how these measures help flatten the infection curve and estimating the time required for their noticeable effects.

Lorenzo Chicchi, Lorenzo Giambagli, Lorenzo Buffoni, Duccio Fanelli

Training of sparse and dense deep neural networks: Fewer parameters, same performance

This study presents a variant of spectral learning for deep neural networks, where adjusting two sets of eigenvalues for each layer mapping significantly enhances network performance with fewer trainable parameters. This method, inspired by homeostatic plasticity, offers a computationally efficient alternative to conventional training, achieving comparable results with a simpler parameter setup. It also enables the creation of sparser networks with impressive classification abilities.

Lorenzo Chicchi, Lorenzo Giambagli, Lorenzo Buffoni, Timoteo Carletti, Marco Ciavarella, Duccio Fanelli

Training of sparse and dense deep neural networks: Fewer parameters, same performance