Announcement_4

I will present our recent work Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty (paper, poster) at the SCIS workshop at ICML 2022.




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Derivatives through a batch norm layer
  • What is the empirical Fisher ?
  • How to compute the Fisher of a conditional when applying natural gradient to neural networks?
  • The algebra of second order methods in neural networks
  • Demystifying Natural Neural Networks