Publication

What to Choose Next? A Paradigm for Testing Human Sequential Decision Making

Related publications (33)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Human and Machine Learning in Non-Markovian Decision Making

Michael Herzog, Aaron Michael Clarke, Elisa Tartaglia, Silvia Marchesotti, Walter Senn

Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), where a series of rewarded decisions must be made, is a particularly important type of learning. Computational and behavioral studies of RL have focused mainly on Ma ...

Public Library of Science2015

Models of Reward-Modulated Spike-Timing-Dependent Plasticity

Nicolas Frémaux

How do animals learn to repeat behaviors that lead to the obtention of food or other “rewarding” objects? As a biologically plausible paradigm for learning in spiking neural networks, spike-timing dependent plasticity (STDP) has been shown to perform well ...

EPFL2013

Sparse reward processes

Christos Dimitrakakis

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is a relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Cons ...

arxiv2012

Robust Bayesian reinforcement learning through tight lower bounds

Christos Dimitrakakis

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinforcement learning problems. While utility bounds are known to exist for this ...

2011

Selfish Response to Epidemic Propagation

Jean-Yves Le Boudec, Georgios Theodorakopoulos

An epidemic spreading in a network calls for a decision on the part of the network members: They should decide whether to protect themselves or not. Their decision depends on the trade off between their perceived risk of being infected and the cost of bein ...

Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa2011

Robot Reinforcement Learning using EEG-based reward signals

Inaki Asier Iturrate Gil

Reinforcement learning algorithms have been successfully applied in robotics to learn how to solve tasks based on reward signals obtained during task execution. These reward signals are usually modeled by the programmer or provided by supervision. However, ...

2010

A simulation and behavioral study of decision and risk in blackjack

The purpose of this master project was to explore decision making process applied to a blackjack game and make the links with facets of impulsivity. The first part of this study goes through the mathematical of this game and presented the optimal policy, c ...

2009

Stimulus sampling as an exploration mechanism for fast reinforcement learning

Eleni Vasilaki, Walter Senn

Reinforcement learning in neural networks requires a mechanism for exploring new network states in response to a single, nonspecific reward signal. Existing models have introduced synaptic or neuronal noise to drive this exploration. However, those types o ...

Springer Verlag2009

Adding prediction risk to the theory of reward learning

This article analyzes the simple Rescorla-Wagner learning rule from the vantage point of least squares learning theory. In particular, it suggests how measures of risk, such as prediction risk, can be used to adjust the learning constant in reinforcement l ...

2007

Adding prediction risk to the theory of reward learning

2007