Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing

被引：1

作者：

Santo-Angles, A. ^{[1
,2
,3
,4
]}

Fuentes-Claramonte, P. ^{[1
,3
]}

Argila-Plaza, I ^{[1
]}

Guardiola-Ripoll, M. ^{[1
,3
]}

Almodovar-Paya, C. ^{[1
,3
]}

Munuera, J. ^{[5
]}

McKenna, P. J. ^{[1
,3
]}

Pomarol-Clotet, E. ^{[1
,3
]}

Radua, J. ^{[1
,3
,6
,7
,8
]}

机构：

[1] FIDMAG Germanes Hosp Res Fdn, Carrer Antoni Pujades 38, Barcelona 08830, Spain

[2] Univ Barcelona, Barcelona, Spain

[3] Mental Hlth Res Networking Ctr CIBERSAM, Barcelona, Spain

[4] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates

[5] Hosp St Joan de Deu, Fundacio Recerca, Diagnost Imaging Dept, Barcelona, Spain

[6] Inst Invest Biomed August Pi i Sunyer IDIBAPS, Barcelona, Spain

[7] Karolinska Inst, Ctr Psychiat Res & Educ, Dept Clin Neurosci, Stockholm, Sweden

[8] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Psychosis Studies, London, England

来源：

BRAIN STRUCTURE & FUNCTION | 2021年 / 226卷 / 05期

关键词：

Reward prediction error; Fictive prediction error; Counterfactual; fMRI; Model fitting;

D O I：

10.1007/s00429-021-02270-3

中图分类号：

R602 [外科病理学、解剖学]; R32 [人体形态学];

学科分类号：

100101 ;

摘要：

Reward prediction error, the difference between the expected and obtained reward, is known to act as a reinforcement learning neural signal. In the current study, we propose a model fitting approach that combines behavioral and neural data to fit computational models of reinforcement learning. Briefly, we penalized subject-specific fitted parameters that moved away too far from the group median, except when that deviation led to an improvement in the model's fit to neural responses. By means of a probabilistic monetary learning task and fMRI, we compared our approach with standard model fitting methods. Q-learning outperformed actor-critic at both behavioral and neural level, although the inclusion of neuroimaging data into model fitting improved the fit of actor-critic models. We observed both action-value and state-value prediction error signals in the striatum, while standard model fitting approaches failed to capture state-value signals. Finally, left ventral striatum correlated with reward prediction error while right ventral striatum with fictive prediction error, suggesting a functional hemispheric asymmetry regarding prediction-error driven learning.

引用

页码：1553 / 1569

页数：17

共 50 条

[1] Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing
A. Santo-Angles
P. Fuentes-Claramonte
I. Argila-Plaza
M. Guardiola-Ripoll
C. Almodóvar-Payá
J. Munuera
P. J. McKenna
E. Pomarol-Clotet
J. Radua
Brain Structure and Function, 2021, 226 : 1553 - 1569
[2] Beta Oscillations in Monkey Striatum Encode Reward Prediction Error Signals
Basanisi, Ruggero
Marche, Kevin
Combrisson, Etienne
Apicella, Paul
Brovelli, Andrea
JOURNAL OF NEUROSCIENCE, 2023, 43 (18): : 3339 - 3352
[3] Cognitive Strategies Regulate Fictive, but not Reward Prediction Error Signals in a Sequential Investment Task
Gu, Xiaosi
Kirk, Ulrich
Lohrenz, Terry M.
Montague, P. Read
HUMAN BRAIN MAPPING, 2014, 35 (08) : 3738 - 3749
[4] Subsecond dopamine fluctuations in human striatum encode superposed error signals about actual and counterfactual reward
Kishida, Kenneth T.
Saez, Ignacio
Lohrenz, Terry
Witcher, Mark R.
Laxton, Adrian W.
Tatter, Stephen B.
White, Jason P.
Ellis, Thomas L.
Phillips, Paul E. M.
Montague, P. Read
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (01) : 200 - 205
[5] Dissociable Reward and Timing Signals in Human Midbrain and Ventral Striatum
Klein-Fluegge, Miriam C.
Hunt, Laurence T.
Bach, Dominik R.
Dolan, Raymond J.
Behrens, Timothy E. J.
NEURON, 2011, 72 (04) : 654 - 664
[6] Activity in human ventral striatum locked to errors of reward prediction
Pagnoni, G
Zink, CF
Montague, PR
Berns, GS
NATURE NEUROSCIENCE, 2002, 5 (02) : 97 - 98
[7] Activity in human ventral striatum locked to errors of reward prediction
Giuseppe Pagnoni
Caroline F. Zink
P. Read Montague
Gregory S. Berns
Nature Neuroscience, 2002, 5 : 97 - 98
[8] A quantitative reward prediction error signal in the ventral pallidum
Ottenheimer, David J.
Bari, Bilal A.
Sutlief, Elissa
Fraser, Kurt M.
Kim, Tabitha H.
Richard, Jocelyn M.
Cohen, Jeremiah Y.
Janak, Patricia H.
NATURE NEUROSCIENCE, 2020, 23 (10) : 1267 - +
[9] A quantitative reward prediction error signal in the ventral pallidum
David J. Ottenheimer
Bilal A. Bari
Elissa Sutlief
Kurt M. Fraser
Tabitha H. Kim
Jocelyn M. Richard
Jeremiah Y. Cohen
Patricia H. Janak
Nature Neuroscience, 2020, 23 : 1267 - 1276
[10] Neuronal and oscillatory activity during reward processing in the human ventral striatum
Lega, Bradley C.
Kahana, Michael J.
Jaggi, Jurg
Baltuch, Gordon H.
Zaghloul, Kareem
NEUROREPORT, 2011, 22 (16) : 795 - 800

← 1 2 3 4 5 →