Publication

Reinforced Attention for Few-Shot Learning and Beyond

Tong Zhang, Weihao Li
2021
Conference paper
Abstract

Few-shot learning aims to correctly recognize query samples from unseen classes given a limited number of support samples, often by relying on global embeddings of images. In this paper, we propose to equip the backbone network with an attention agent, which is trained by reinforcement learning. The policy gradient algorithm is employed to train the agent towards adaptively localizing the representative regions on feature maps over time. We further design a reward function based on the prediction of the held-out data, thus helping the attention mechanism to generalize better across the unseen classes. The extensive experiments show, with the help of the reinforced attention, that our embedding network has the capability to progressively generate a more discriminative representation in few-shot learning. Moreover, experiments on the task of image classification also show the effectiveness of the proposed design.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related concepts (31)
Reinforcement
In reinforcement theory, it is argued that human behavior is a result of "contingent consequences" to human actions The publication pushes forward the idea that "you get what you reinforce" This means that behavior when given the right types of reinforcers can change employee behavior for the better and negative behavior can be weeded out. The model of self-regulation has three main aspects of human behavior, which are self-awareness, self-reflection, and self-regulation. Reinforcements traditionally align with self-regulation.
Backbone network
A backbone or core network is a part of a computer network which interconnects networks, providing a path for the exchange of information between different LANs or subnetworks. A backbone can tie together diverse networks in the same building, in different buildings in a campus environment, or over wide areas. Normally, the backbone's capacity is greater than the networks connected to it.
Experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs when a particular factor is manipulated. Experiments vary greatly in goal and scale but always rely on repeatable procedure and logical analysis of the results. There also exist natural experimental studies.
Show more
Related publications (33)

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Jesus Sanchez Rodriguez

Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does n ...
PUBLIC LIBRARY SCIENCE2023

Striatal Dopamine Signals and Reward Learning

Carl Petersen, Sylvain Crochet, Yanqi Liu, Parviz Ghaderi, Mauro Pulin, Anthony Pierre Robert Renard, Christos Sourmpis, Pol Bech Vilaseca, Meriam Malekzadeh, Robin François Virginien Dard

We are constantly bombarded by sensory information and constantly making decisions on how to act. In order to optimally adapt behavior, we must judge which sequences of sensory inputs and actions lead to successful outcomes in specific circumstances. Neuro ...
Oxford2023

Learning rich optical embeddings for privacy-preserving lensless image classification

Martin Vetterli, Eric Bezzam, Matthieu Martin Jean-André Simeoni

By replacing the lens with a thin optical element, lensless imaging enables new applications and solutions beyond those supported by traditional camera design and post-processing, e.g. compact and lightweight form factors and visual privacy. The latter ari ...
2022
Show more

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.