Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

A recent series of theoretical works showed that the dynamics of neural networks with a certain initialisation are well-captured by kernel methods. Concurrent empirical work demonstrated that kernel methods can come close to the performance of neural networks on some image classification tasks. These results raise the question of whether neural networks only learn successfully if kernels also learn successfully, despite being the more expressive function class. Here, we show that two-layer neural networks with only a few neurons achieve near-optimal performance on high-dimensional Gaussian mixture classification while lazy training approaches such as random features and kernel methods do not. Our analysis is based on the derivation of a set of ordinary differential equations that exactly track the dynamics of the network and thus allow to extract the asymptotic performance of the network as a function of regularisation or signal-to-noise ratio. We also show how over-parametrising the neural network leads to faster convergence, but does not improve its final performance.

Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

Graph Chatbot

Chattez avec Graph Search

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Random matrix methods for high-dimensional machine learning models

Task-driven neural network models predict neural dynamics of proprioception: Experimental data, activations and predictions of neural network models

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Random matrix methods for high-dimensional machine learning models

Task-driven neural network models predict neural dynamics of proprioception: Experimental data, activations and predictions of neural network models