AL2: Progressive Activation Loss for Learning General Representations in Classification Neural Networks

The large capacity of neural networks enables them to learn complex functions. To avoid overfitting, networks however require a lot of training data that can be expensive and time-consuming to collect. A common practical approach to attenuate overfitting is the use of network regularization techniques. We propose a novel regularization method that progressively penalizes the magnitude of activations during training. The combined activation signals produced by all neurons in a given layer form the representation of the input image in that feature space. We propose to regularize this representation in the last feature layer before classification layers. Our method's effect on generalization is analyzed with label randomization tests and cumulative ablations. Experimental results show the advantages of our approach in comparison with commonly-used regularizers on standard benchmark datasets.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

AL2: Progressive Activation Loss for Learning General Representations in Classification Neural Networks

Graph Chatbot

Chat with Graph Search

InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

Error assessment of an adaptive finite elements-neural networks method for an elliptic parametric PDE

Enabling Uncertainty Estimation in Iterative Neural Networks

Enabling Uncertainty Estimation in Iterative Neural Networks

InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

Error assessment of an adaptive finite elements-neural networks method for an elliptic parametric PDE