Explores the learning dynamics of deep neural networks using linear networks for analysis, covering two-layer and multi-layer networks, self-supervised learning, and benefits of decoupled initialization.
Explores Stochastic Gradient Descent and Mean Field Analysis in two-layer neural networks, emphasizing their iterative processes and mathematical foundations.