Publication

Adding prediction risk to the theory of reward learning

Related publications (52)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Learning the Globally Optimal Distributed LQ Regulator

Maryam Kamgarpour, Luca Furieri

We study model-free learning methods for the output-feedback Linear Quadratic (LQ) control problem in finite-horizon subject to subspace constraints on the control policy. Subspace constraints naturally arise in the field of distributed control and present ...

PMLR2020

Learning Control

Sylvain Calinon

This chapter presents an overview of learning approaches for the acquisition of controllers and movement skills in humanoid robots. The term learning control refers to the process of acquiring a control strategy to achieve a task. While the definition is i ...

Springer2019

AdaptHD: Adaptive Efficient Training for Brain-Inspired Hyperdimensional Computing

Giovanni De Micheli, Samuel Bosch, Mohsen Imani

Brain-inspired Hyperdimensional (HD) computing is a promising solution for energy-efficient classification. However, the existing HD computing algorithms have a lack of controllability on the training iterations which often results in slow training or dive ...

2019

Model-based reinforcement learning and navigation in animals and machines

Dane Sterling Corneil

For decades, neuroscientists and psychologists have observed that animal performance on spatial navigation tasks suggests an internal learned map of the environment. More recently, map-based (or model-based) reinforcement learning has become a highly activ ...

EPFL2018

Learning Control

Sylvain Calinon

Springer2018

Evidence for eligibility traces in human learning

Michael Herzog, Wulfram Gerstner, Kerstin Preuschoff, Marco Philipp Lehmann, He Xu, Vasiliki Liakoni

Whether we prepare a coffee or navigate to a shop: in many tasks we make multiple decisions before reaching a goal. Learning such state-action sequences from sparse reward raises the problem of credit-assignment: which actions out of a long sequence should ...

arXiv2017

Unsupervised Learning of Phase-Change-Based Neuromorphic Systems

Stanislaw Andrzej Wozniak

Neuromorphic systems provide brain-inspired methods of computing. In a neuromorphic architecture, inputs are processed by a network of neurons receiving operands through synaptic interconnections, tuned in the process of learning. Neurons act simultaneousl ...

EPFL2017

What to Choose Next? A Paradigm for Testing Human Sequential Decision Making

Michael Herzog, Aaron Michael Clarke, Elisa Tartaglia

Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While sequential decision-making has been extensively investigated in theory (e.g., by reinforcement learning models) there is no systematic experimental paradigm ...

2017

Reinforcement learning: the effect of environment

Michael Herzog, He Xu

Reinforcement learning is a type of supervised learning, where reward is sparse and delayed. For example in chess, a series of moves is made until a sparse reward (win, loss) is issued, which makes it impossible to evaluate the value of a single move. Stil ...

2016

Theory of representation learning in cortical neural networks

Carlos Stein Naves de Brito

Our brain continuously self-organizes to construct and maintain an internal representation of the world based on the information arriving through sensory stimuli. Remarkably, cortical areas related to different sensory modalities appear to share the same f ...

EPFL2016