CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Single Domain Generalization (SDG) tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain. While this has been well studied for image classification, the literature on SDG object detection remains almost non-existent. To address the challenges of simultaneously learning robust object localization and representation, we propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts. We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss. Our experiments evidence the benefits of our approach, outperforming by 10% the only existing SDG object detection method, Single-DGOD [52], on their own diverse weather-driving benchmark.

CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Graph Chatbot

Chat with Graph Search

Land Cover Mapping From Multiple Complementary Experts Under Heavy Class Imbalance

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Land Cover Mapping From Multiple Complementary Experts Under Heavy Class Imbalance

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Advancing Self-Supervised Deep Learning for 3D Scene Understanding