Masksembles for Uncertainty Estimation

Deep neural networks have amply demonstrated their prowess but estimating the reliability of their predictions remains challenging. Deep Ensembles are widely considered as being one of the best methods for generating uncertainty estimates but are very expensive to train and evaluate. MC-Dropout is another popular alternative, which is less expensive, but also less reliable. Our central intuition is that there is a continuous spectrum of ensemble-like models of which MC-Dropout and Deep Ensembles are extreme examples. The first uses effectively infinite number of highly correlated models while the second relies on a finite number of independent models. To combine the benefits of both, we introduce Masksembles. Instead of randomly dropping parts of the network as in MC-dropout, Masksemble relies on a fixed number of binary masks, which are parameterized in a way that allows to change correlations between individual models. Namely, by controlling the overlap between the masks and their density one can choose the optimal configuration for the task at hand. This leads to a simple and easy to implement method with performance on par with Ensembles at a fraction of the cost. We experimentally validate Masksembles on two widely used datasets, CIFAR10 and ImageNet.

Masksembles for Uncertainty Estimation

Graph Chatbot

Chat with Graph Search

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Deep Learning Generalization with Limited and Noisy Labels

Hamiltonian Deep Neural Networks Guaranteeing Non-Vanishing Gradients by Design

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Deep Learning Generalization with Limited and Noisy Labels

Hamiltonian Deep Neural Networks Guaranteeing Non-Vanishing Gradients by Design