Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search

Weight sharing has become a de facto standard in neural architecture search because it enables the search to be done on commodity hardware. However, recent works have empirically shown a ranking disorder between the performance of stand-alone architectures and that of the corresponding shared-weight networks. This violates the main assumption of weight-sharing NAS algorithms, thus limiting their effectiveness. We tackle this issue by proposing a regularization term that aims to maximize the correlation between the performance rankings of the shared-weight network and that of the standalone architectures using a small set of landmark architectures. We incorporate our regularization term into three different NAS algorithms and show that it consistently improves performance across algorithms, search-spaces, and tasks.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search

Graph Chatbot

Chat with Graph Search

Efficient local linearity regularization to overcome catastrophic overfitting

Robust NAS under adversarial training: benchmark, theory, and beyond

From Kernel Methods to Neural Networks: A Unifying Variational Formulation

Robust NAS under adversarial training: benchmark, theory, and beyond

Efficient local linearity regularization to overcome catastrophic overfitting

From Kernel Methods to Neural Networks: A Unifying Variational Formulation