Search Results - "Advani, Madhu S."

  • Showing 1 - 6 results of 6
Refine Results
  1. 1

    High-dimensional dynamics of generalization error in neural networks by Advani, Madhu S., Saxe, Andrew M., Sompolinsky, Haim

    Published in Neural networks (01-12-2020)
    “…We perform an analysis of the average generalization dynamics of large neural networks trained using gradient descent. We study the practically-relevant…”
    Get full text
    Journal Article
  2. 2

    Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning by Zhang, Yao, Saxe, Andrew M., Advani, Madhu S., Lee, Alpha A.

    Published in Molecular physics (17-11-2018)
    “…Finding parameters that minimise a loss function is at the core of many machine learning methods. The Stochastic Gradient Descent (SGD) algorithm is widely…”
    Get full text
    Journal Article
  3. 3

    High-dimensional dynamics of generalization error in neural networks by Advani, Madhu S, Saxe, Andrew M

    Published 10-10-2017
    “…We perform an average case analysis of the generalization dynamics of large neural networks trained using gradient descent. We study the practically-relevant…”
    Get full text
    Journal Article
  4. 4

    Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup by Goldt, Sebastian, Advani, Madhu S, Saxe, Andrew M, Krzakala, Florent, Zdeborová, Lenka

    Published 27-10-2019
    “…J. Stat. Mech. 2020 124010 & NeurIPS 2019 Deep neural networks achieve stellar generalisation even when they have enough parameters to easily fit all their…”
    Get full text
    Journal Article
  5. 5

    Generalisation dynamics of online learning in over-parameterised neural networks by Goldt, Sebastian, Advani, Madhu S, Saxe, Andrew M, Krzakala, Florent, Zdeborová, Lenka

    Published 25-01-2019
    “…Presented at the ICML 2019 Workshop on Theoretical Physics for Deep Learning Deep neural networks achieve stellar generalisation on a variety of problems,…”
    Get full text
    Journal Article
  6. 6

    Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning by Zhang, Yao, Saxe, Andrew M, Advani, Madhu S, Lee, Alpha A

    Published 05-03-2018
    “…Finding parameters that minimise a loss function is at the core of many machine learning methods. The Stochastic Gradient Descent algorithm is widely used and…”
    Get full text
    Journal Article