Search Results - "Malatesta, Enrico M."
-
1
The twin peaks of learning neural networks
Published in Machine learning: science and technology (01-06-2024)“…Recent works demonstrated the existence of a double-descent phenomenon for the generalization error of neural networks, where highly overparameterized models…”
Get full text
Journal Article -
2
Properties of the Geometry of Solutions and Capacity of Multilayer Neural Networks with Rectified Linear Unit Activations
Published in Physical review letters (25-10-2019)“…Rectified linear units (ReLUs) have become the main model for the neural units in current deep learning systems. This choice was originally suggested as a way…”
Get full text
Journal Article -
3
Unveiling the Structure of Wide Flat Minima in Neural Networks
Published in Physical review letters (31-12-2021)“…The success of deep learning has revealed the application potential of neural networks across the sciences and opened up fundamental theoretical problems. In…”
Get full text
Journal Article -
4
Star-Shaped Space of Solutions of the Spherical Negative Perceptron
Published in Physical review letters (01-12-2023)“…Empirical studies on the landscape of neural networks have shown that low-energy configurations are often found in complex connected structures, where…”
Get full text
Journal Article -
5
Two-loop corrections to large order behavior of φ4 theory
Published in Nuclear physics. B (01-09-2017)“…We consider the large order behavior of the perturbative expansion of the scalar φ4 field theory in terms of a perturbative expansion around an instanton…”
Get full text
Journal Article -
6
Two-loop corrections to large order behavior of φ 4 theory
Published in Nuclear physics. B (01-09-2017)Get full text
Journal Article -
7
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Published 17-09-2023“…In these pedagogic notes I review the statistical mechanics approach to neural networks, focusing on the paradigmatic example of the perceptron architecture…”
Get full text
Journal Article -
8
Properties of the geometry of solutions and capacity of multi-layer neural networks with Rectified Linear Units activations
Published 03-05-2024“…Phys. Rev. Lett. 123, 170602 (2019) Rectified Linear Units (ReLU) have become the main model for the neural units in current deep learning systems. This choice…”
Get full text
Journal Article -
9
Exact full-RSB SAT/UNSAT transition in infinitely wide two-layer neural networks
Published 09-10-2024“…We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with…”
Get full text
Journal Article -
10
The twin peaks of learning neural networks
Published 23-01-2024“…Recent works demonstrated the existence of a double-descent phenomenon for the generalization error of neural networks, where highly overparameterized models…”
Get full text
Journal Article -
11
Random Combinatorial Optimization Problems: Mean Field and Finite-Dimensional Results
Published 01-02-2019“…This PhD thesis is organized as follows. In the first two chapters I will review some basic notions of statistical physics of disordered systems, such as…”
Get full text
Journal Article -
12
Typical and atypical solutions in non-convex neural networks with discrete and continuous weights
Published 26-04-2023“…We study the binary and continuous negative-margin perceptrons as simple non-convex neural network models learning random rules and associations. We analyze…”
Get full text
Journal Article -
13
Impact of dendritic non-linearities on the computational capabilities of neurons
Published 10-07-2024“…Multiple neurophysiological experiments have shown that dendritic non-linearities can have a strong influence on synaptic input integration. In this work we…”
Get full text
Journal Article -
14
Random Features Hopfield Networks generalize retrieval to previously unseen examples
Published 08-07-2024“…It has been recently shown that a learning transition happens when a Hopfield Network stores examples generated as superpositions of random features, where new…”
Get full text
Journal Article -
15
Instantons in $\phi^4$ Theories: Transseries, Virial Theorems and Numerical Aspects
Published 28-05-2024“…Phys.Rev.D 110 (2024) 036003 We discuss numerical aspects of instantons in two- and three-dimensional $\phi^4$ theories with an internal $O(N)$ symmetry group,…”
Get full text
Journal Article -
16
The star-shaped space of solutions of the spherical negative perceptron
Published 17-05-2023“…Empirical studies on the landscape of neural networks have shown that low-energy configurations are often found in complex connected structures, where…”
Get full text
Journal Article -
17
Fluctuations in the random-link matching problem
Published 30-08-2019“…Phys. Rev. E 100, 032102 (2019) Using the replica approach and the cavity method, we study the fluctuations of the optimal cost in the random-link matching…”
Get full text
Journal Article -
18
Wide flat minima and optimal generalization in classifying high-dimensional Gaussian mixtures
Published 17-11-2020“…We analyze the connection between minimizers with good generalizing properties and high local entropy regions of a threshold-linear classifier in Gaussian…”
Get full text
Journal Article -
19
Learning through atypical "phase transitions" in overparameterized neural networks
Published 11-06-2022“…Current deep neural networks are highly overparameterized (up to billions of connection weights) and nonlinear. Yet they can fit data almost perfectly through…”
Get full text
Journal Article -
20
Unveiling the structure of wide flat minima in neural networks
Published 14-02-2022“…The success of deep learning has revealed the application potential of neural networks across the sciences and opened up fundamental theoretical problems. In…”
Get full text
Journal Article