Lénaïc Chizat
We consider the idealized setting of gradient flow on the population risk for infinitely wide two-layer ReLU neural networks (without bias), and study the effect of symmetries on the learned parameters and predictors. We first describe a general class of s ...
AMER INST MATHEMATICAL SCIENCES-AIMS2023