Search Results - "Advani, Madhu S."
-
1
High-dimensional dynamics of generalization error in neural networks
Published in Neural networks (01-12-2020)“…We perform an analysis of the average generalization dynamics of large neural networks trained using gradient descent. We study the practically-relevant…”
Get full text
Journal Article -
2
Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning
Published in Molecular physics (17-11-2018)“…Finding parameters that minimise a loss function is at the core of many machine learning methods. The Stochastic Gradient Descent (SGD) algorithm is widely…”
Get full text
Journal Article -
3
High-dimensional dynamics of generalization error in neural networks
Published 10-10-2017“…We perform an average case analysis of the generalization dynamics of large neural networks trained using gradient descent. We study the practically-relevant…”
Get full text
Journal Article -
4
Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup
Published 27-10-2019“…J. Stat. Mech. 2020 124010 & NeurIPS 2019 Deep neural networks achieve stellar generalisation even when they have enough parameters to easily fit all their…”
Get full text
Journal Article -
5
Generalisation dynamics of online learning in over-parameterised neural networks
Published 25-01-2019“…Presented at the ICML 2019 Workshop on Theoretical Physics for Deep Learning Deep neural networks achieve stellar generalisation on a variety of problems,…”
Get full text
Journal Article -
6
Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning
Published 05-03-2018“…Finding parameters that minimise a loss function is at the core of many machine learning methods. The Stochastic Gradient Descent algorithm is widely used and…”
Get full text
Journal Article