Mathieu Salzmann, Shuxuan Guo
We introduce an approach to training a given compact network. To this end, we leverage over-parameterization, which typically improves both neural network optimization and generalization. Specifically, we propose to expand each linear layer of the compact ...
Curran Associates Inc.2020