Lecture

NFNets: Removing BatchNorm for High-Performance Image Recognition

Description

This lecture discusses the challenges of Batch Normalization in ResNets, proposing an alternative approach called NFNets. It explores the benefits of BatchNorm, the drawbacks it introduces, and how NFNets address these issues by downsizing the residual branch, using adaptive gradient clipping, explicit regularization, and Scaled Weight Standardization. The presentation covers the impact of these modifications on signal propagation, large batch training, implicit regularization, and mean-shift elimination in ReLU networks. It concludes by showcasing the ImageNet results of NFNets, demonstrating their superior performance and efficiency compared to existing models.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.