Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
Organisms use rewards to navigate and adapt to (uncertain) environments. Error-based learning about rewards issupported by the dopaminergic system, which is thought to signal reward prediction errors to make adjustmentsto past predictions. More recently, the phasic dopamine response was suggested to have two components: thefirstrapid component is thought to signal the detection of a potentially rewarding stimulus; the second, slightly latercomponent characterizes the stimulus by its reward prediction error. Error-based learning signals have also beenfound for risk. However, whether the neural generators of these signals employ a two-component coding schemelike the dopaminergic system is unknown. Here, using human high density EEG, we ask whether risk learning, ormore generally speaking surprise-based learning under uncertainty, is similarly comprised of two temporallydissociable components. Using a simple card game, we show that the risk prediction error is reflected in theamplitude of the P3b component. This P3b modulation is preceded by an earlier component, that is modulated bythe stimulus salience. Source analyses are compatible with the idea that both the early salience signal and the laterrisk prediction error signal are generated in insular, frontal, and temporal cortex. The identified sources are partsof the risk processing network that receives input from noradrenergic cells in the locus coeruleus. Finally, the P3bamplitude modulation is mirrored by an analogous modulation of pupil size, which is consistent with the idea thatboth the P3b and pupil size indirectly reflect locus coeruleus activity.
Maria del Carmen Sandi Perez, Elias Georges Gebara, Ana del Rocio Conde Moro