Reinforced Attention for Few-Shot Learning and Beyond

Few-shot learning aims to correctly recognize query samples from unseen classes given a limited number of support samples, often by relying on global embeddings of images. In this paper, we propose to equip the backbone network with an attention agent, which is trained by reinforcement learning. The policy gradient algorithm is employed to train the agent towards adaptively localizing the representative regions on feature maps over time. We further design a reward function based on the prediction of the held-out data, thus helping the attention mechanism to generalize better across the unseen classes. The extensive experiments show, with the help of the reinforced attention, that our embedding network has the capability to progressively generate a more discriminative representation in few-shot learning. Moreover, experiments on the task of image classification also show the effectiveness of the proposed design.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Reinforced Attention for Few-Shot Learning and Beyond

Graph Chatbot

Chat with Graph Search

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Striatal Dopamine Signals and Reward Learning

Learning rich optical embeddings for privacy-preserving lensless image classification

Striatal Dopamine Signals and Reward Learning

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Learning rich optical embeddings for privacy-preserving lensless image classification