Matthieu Wyart, Mario Geiger, Stefano Spigler, Arthur Jacot
Two distinct limits for deep learning have been derived as the network width h -> infinity, depending on how the weights of the last layer scale with h. In the neural tangent Kernel (NTK) limit, the dynamics becomes linear in the weights and is described b ...
IOP PUBLISHING LTD2020