Publication

Model-based reinforcement learning and navigation in animals and machines

Publications associées (133)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Towards bio-hybrid systems made of social animals and robots

Francesco Mondada

For making artificial systems collaborate with group-living animals, the scientific challenge is to build artificial systems that can perceive, communicate to, interact with and adapt to animals. When such capabilities are available then it should be possi ...

Springer2013

Sparse reward processes

Christos Dimitrakakis

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is a relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Cons ...

arxiv2012

Lack of cyclin D2 impairing adult brain neurogenesis alters hippocampal-dependent behavioral tasks without reducing learning ability

Maria del Carmen Sandi Perez

The exact function of the adult brain neurogenesis remains elusive, although it has been suggested to play a role in learning and memory processes. In our studies, we employed cyclin D2 gene knockout (cD2 KO) mice showing impaired neurogenesis as well as d ...

2012

Perceptual Learning, Roving, and Synaptic Drift

Michael Herzog, Aaron Michael Clarke

Perceptual learning improves with most basic stimuli. Interestingly, performance does not improve when stimuli of two types are randomly presented during training (roving). For example, there is no perceptual learning when left or right bisection stimuli w ...

2012

Information Processing and Structure of Dynamical Networks

Ali Ajdari Rad

Networks are everywhere and we are confronted with many networks in our daily life. Networks such as Internet, World Wide Web, social, biological and economical networks have been subject to extensive studies in the last decade. The volume of publications ...

EPFL2011

Robust Bayesian reinforcement learning through tight lower bounds

Christos Dimitrakakis

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinforcement learning problems. While utility bounds are known to exist for this ...

2011

Preference elicitation and inverse reinforcement learning

Christos Dimitrakakis

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous work on Bayesian inverse reinforcement learning and allows us to obtain a pos ...

2011

Reaching Correlated Equilibria Through Multi-agent Learning

Boi Faltings, Ludek Cigler

Many games have undesirable Nash equilibria. For exam- ple consider a resource allocation game in which two players compete for an exclusive access to a single resource. It has three Nash equilibria. The two pure-strategy NE are effi- cient, but not fair. ...

2011

Perceptual Learning, Roving and the Unsupervised Bias

Michael Herzog, Wulfram Gerstner, Aaron Michael Clarke

Perceptual learning is reward-based. A recent mathematical analysis showed that any reward-based learning system can learn two tasks only when the mean reward is identical for both tasks [Frémaux, Sprekeler and Gerstner, 2010, The Journal of Neuroscience, ...

2011

A Study of Spatial Reasoning Skills in Carpenters’ Training

Pierre Dillenbourg, Patrick Jermann, Sébastien Cuendet

One difficulty with the Swiss dual system is the gap between the practical work in the company and the theoretical teaching at school. In this article, we examine the case of carpenters. We observe that the school-workplace gap exists and materializes thro ...

2011