What to Choose Next? A Paradigm for Testing Human Sequential Decision Making

Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While sequential decision-making has been extensively investigated in theory (e.g., by reinforcement learning models) there is no systematic experimental paradigm to test it. Here, we developed such a paradigm and investigated key components of reinforcement learning models: the eligibility trace (i.e., the memory trace of previous decision steps), the external reward, and the ability to exploit the statistics of the environment's structure (model-free vs. model-based mechanisms). We show that the eligibility trace decays not with sheer time, but rather with the number of discrete decision steps made by the participants. We further show that, unexpectedly, neither monetary rewards nor the environment's spatial regularity significantly modulate behavioral performance. Finally, we found that model-free learning algorithms describe human performance better than model-based algorithms.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

What to Choose Next? A Paradigm for Testing Human Sequential Decision Making

Graph Chatbot

Chat with Graph Search

End-to-End Learning for Stochastic Optimization: A Bayesian Perspective

Learning continuous-time working memory tasks with on-policy neural reinforcement learning

Decision Learning and Adaptation Over Multi-Task Networks

Decision Learning and Adaptation Over Multi-Task Networks

End-to-End Learning for Stochastic Optimization: A Bayesian Perspective

Learning continuous-time working memory tasks with on-policy neural reinforcement learning