Publication

Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making

Wulfram Gerstner, Johanni Michael Brea, Alireza Modirshanechi, Kerstin Preuschoff, Marco Philipp Lehmann, Vasiliki Liakoni
2022
Article

Résumé

Learning how to reach a reward over long series of actions is a remarkable capability of humans, and potentially guided by multiple parallel learning modules. Current brain imaging of learning modules is limited by (i) simple experimental paradigms, (ii) entanglement of brain signals of different learning modules, and (iii) a limited number of computational models considered as candidates for explaining behavior. Here, we address these three limitations and (i) introduce a complex sequential decision making task with surprising events that allows us to (ii) dissociate correlates of reward prediction errors from those of surprise in functional magnetic resonance imaging (fMRI); and (iii) we test behavior against a large repertoire of model-free, model-based, and hybrid reinforcement learning algorithms, including a novel surprise-modulated actor-critic algorithm. Surprise, derived from an approximate Bayesian approach for learning the world-model, is extracted in our algorithm from a state prediction error. Surprise is then used to modulate the learning rate of a model-free actor, which itself learns via the reward prediction error from model-free value estimation by the critic. We find that action choices are well explained by pure model-free policy gradient, but reaction times and neural data are not. We identify signatures of both model-free and surprise-based learning signals in blood oxygen level dependent (BOLD) responses, supporting the existence of multiple parallel learning modules in the brain. Our results extend previous fMRI findings to a multi-step setting and emphasize the role of policy gradient and surprise signalling in human learning.

Source officielle

https://infoscience.epfl.ch/record/291631?ln=fr

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Proximité ontologique

Information engineering

Apprentissage automatique: Réseau de neurones artificiels

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Wulfram Gerstner, Johanni Michael Brea, Alireza Modirshanechi, Kerstin Preuschoff, Marco Philipp Lehmann, Vasiliki Liakoni
2022
Article

Résumé

Source officielle

https://infoscience.epfl.ch/record/291631?ln=fr

À propos de ce résultat

Proximité ontologique

Information engineering

Apprentissage automatique: Réseau de neurones artificiels

Concepts associés (35)

Publications associées (59)

MOOCs associés (32)

Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making

Graph Chatbot

Chattez avec Graph Search

Computational models of intrinsic motivation for curiosity and creativity

Breaking the Curse of Dimensionality in Deep Neural Networks by Learning Invariant Representations

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Computational models of intrinsic motivation for curiosity and creativity

Breaking the Curse of Dimensionality in Deep Neural Networks by Learning Invariant Representations