TTT++: When Does Self-Supervised Test-Time Training Fail or Thrive?

Alexandre Massoud Alahi, Bastien Germain C. Van Delft, Taylor Ferdinand Mordan, Yuejiang Liu, Parth Ashit Kothari
2021
Conference paper

Abstract

Test-time training (TTT) through self-supervised learning (SSL) is an emerging paradigm to tackle distributional shifts. Despite encouraging results, it remains unclear when this approach thrives or fails. In this work, we first provide an in-depth look at its limitations and show that TTT can possibly deteriorate, instead of improving, the test-time performance in the presence of severe distribution shifts. To address this issue, we introduce a test-time feature alignment strategy utilizing offline feature summarization and online moment matching, which regularizes adaptation without revisiting training data. We further scale this strategy in the online setting through batch-queue decoupling to enable robust moment estimates even with limited batch size. Given aligned feature distributions, we then shed light on the strong potential of TTT by theoretically analyzing its performance post adaptation. This analysis motivates our use of more informative self-supervision in the form of contrastive learning for visual recognition problems. We empirically demonstrate that our modified version of test-time training, termed TTT++, outperforms state-of-the-art methods by significant margins on several benchmarks. Our result indicates that storing and exploiting extra information, in addition to model parameters, can be a promising direction towards robust test-time adaptation.

Official source

https://infoscience.epfl.ch/record/296742?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

TTT++: When Does Self-Supervised Test-Time Training Fail or Thrive?

Graph Chatbot

Chat with Graph Search

Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

Few-shot Learning for Efficient and Effective Machine Learning Model Adaptation

Robust machine learning for neuroscientific inference

Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

Few-shot Learning for Efficient and Effective Machine Learning Model Adaptation

Robust machine learning for neuroscientific inference