Skip to main content
Publication

Leveraging Continuous Time to Understand Momentum When Training Diagonal Linear Networks

Related concepts (24)