Announcement_0

I will give a talk titled Optimization and generalization through the lens of the linearization of neural networks training dynamics in front of Roger Grosse’s group at Vector Institute Toronto.




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Derivatives through a batch norm layer
  • What is the empirical Fisher ?
  • How to compute the Fisher of a conditional when applying natural gradient to neural networks?
  • The algebra of second order methods in neural networks
  • Demystifying Natural Neural Networks