Publication

Surprise-based model estimation in reinforcement learning: algorithms and brain signatures

Related publications (94)

About
Privacy
Disclaimer

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Surprise-based model estimation in reinforcement learning: algorithms and brain signatures

Graph Chatbot

Chat with Graph Search

Stress, noradrenaline, and realistic prediction of mouse behaviour using reinforcement learning

Novelty of Behaviour as a Basis for the Neuro-evolution of Operant Reward Learning

Stress, genotype and norepinephrine in the prediction of mouse behavior using reinforcement learning

Rollout sampling approximate policy iteration

Adding prediction risk to the theory of reward learning

Adding prediction risk to the theory of reward learning

Ensembles for sequence learning

Effects of stress and genotype on meta-parameter dynamics in reinforcement learning

The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans

Ensembles for Sequence Learning

Adding prediction risk to the theory of reward learning

Ensembles for Sequence Learning

Novelty of Behaviour as a Basis for the Neuro-evolution of Operant Reward Learning

Stress, noradrenaline, and realistic prediction of mouse behaviour using reinforcement learning

Rollout sampling approximate policy iteration

Effects of stress and genotype on meta-parameter dynamics in reinforcement learning

The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans

Adding prediction risk to the theory of reward learning

Ensembles for sequence learning

Stress, genotype and norepinephrine in the prediction of mouse behavior using reinforcement learning