Publication

On the Generalization of Stochastic Gradient Descent with Momentum