Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing

被引:1
|
作者
Santo-Angles, A. [1 ,2 ,3 ,4 ]
Fuentes-Claramonte, P. [1 ,3 ]
Argila-Plaza, I [1 ]
Guardiola-Ripoll, M. [1 ,3 ]
Almodovar-Paya, C. [1 ,3 ]
Munuera, J. [5 ]
McKenna, P. J. [1 ,3 ]
Pomarol-Clotet, E. [1 ,3 ]
Radua, J. [1 ,3 ,6 ,7 ,8 ]
机构
[1] FIDMAG Germanes Hosp Res Fdn, Carrer Antoni Pujades 38, Barcelona 08830, Spain
[2] Univ Barcelona, Barcelona, Spain
[3] Mental Hlth Res Networking Ctr CIBERSAM, Barcelona, Spain
[4] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates
[5] Hosp St Joan de Deu, Fundacio Recerca, Diagnost Imaging Dept, Barcelona, Spain
[6] Inst Invest Biomed August Pi i Sunyer IDIBAPS, Barcelona, Spain
[7] Karolinska Inst, Ctr Psychiat Res & Educ, Dept Clin Neurosci, Stockholm, Sweden
[8] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Psychosis Studies, London, England
来源
BRAIN STRUCTURE & FUNCTION | 2021年 / 226卷 / 05期
关键词
Reward prediction error; Fictive prediction error; Counterfactual; fMRI; Model fitting;
D O I
10.1007/s00429-021-02270-3
中图分类号
R602 [外科病理学、解剖学]; R32 [人体形态学];
学科分类号
100101 ;
摘要
Reward prediction error, the difference between the expected and obtained reward, is known to act as a reinforcement learning neural signal. In the current study, we propose a model fitting approach that combines behavioral and neural data to fit computational models of reinforcement learning. Briefly, we penalized subject-specific fitted parameters that moved away too far from the group median, except when that deviation led to an improvement in the model's fit to neural responses. By means of a probabilistic monetary learning task and fMRI, we compared our approach with standard model fitting methods. Q-learning outperformed actor-critic at both behavioral and neural level, although the inclusion of neuroimaging data into model fitting improved the fit of actor-critic models. We observed both action-value and state-value prediction error signals in the striatum, while standard model fitting approaches failed to capture state-value signals. Finally, left ventral striatum correlated with reward prediction error while right ventral striatum with fictive prediction error, suggesting a functional hemispheric asymmetry regarding prediction-error driven learning.
引用
收藏
页码:1553 / 1569
页数:17
相关论文
共 50 条
  • [1] Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing
    A. Santo-Angles
    P. Fuentes-Claramonte
    I. Argila-Plaza
    M. Guardiola-Ripoll
    C. Almodóvar-Payá
    J. Munuera
    P. J. McKenna
    E. Pomarol-Clotet
    J. Radua
    Brain Structure and Function, 2021, 226 : 1553 - 1569
  • [2] Beta Oscillations in Monkey Striatum Encode Reward Prediction Error Signals
    Basanisi, Ruggero
    Marche, Kevin
    Combrisson, Etienne
    Apicella, Paul
    Brovelli, Andrea
    JOURNAL OF NEUROSCIENCE, 2023, 43 (18): : 3339 - 3352
  • [3] Cognitive Strategies Regulate Fictive, but not Reward Prediction Error Signals in a Sequential Investment Task
    Gu, Xiaosi
    Kirk, Ulrich
    Lohrenz, Terry M.
    Montague, P. Read
    HUMAN BRAIN MAPPING, 2014, 35 (08) : 3738 - 3749
  • [4] Subsecond dopamine fluctuations in human striatum encode superposed error signals about actual and counterfactual reward
    Kishida, Kenneth T.
    Saez, Ignacio
    Lohrenz, Terry
    Witcher, Mark R.
    Laxton, Adrian W.
    Tatter, Stephen B.
    White, Jason P.
    Ellis, Thomas L.
    Phillips, Paul E. M.
    Montague, P. Read
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (01) : 200 - 205
  • [5] Dissociable Reward and Timing Signals in Human Midbrain and Ventral Striatum
    Klein-Fluegge, Miriam C.
    Hunt, Laurence T.
    Bach, Dominik R.
    Dolan, Raymond J.
    Behrens, Timothy E. J.
    NEURON, 2011, 72 (04) : 654 - 664
  • [6] Activity in human ventral striatum locked to errors of reward prediction
    Pagnoni, G
    Zink, CF
    Montague, PR
    Berns, GS
    NATURE NEUROSCIENCE, 2002, 5 (02) : 97 - 98
  • [7] Activity in human ventral striatum locked to errors of reward prediction
    Giuseppe Pagnoni
    Caroline F. Zink
    P. Read Montague
    Gregory S. Berns
    Nature Neuroscience, 2002, 5 : 97 - 98
  • [8] A quantitative reward prediction error signal in the ventral pallidum
    Ottenheimer, David J.
    Bari, Bilal A.
    Sutlief, Elissa
    Fraser, Kurt M.
    Kim, Tabitha H.
    Richard, Jocelyn M.
    Cohen, Jeremiah Y.
    Janak, Patricia H.
    NATURE NEUROSCIENCE, 2020, 23 (10) : 1267 - +
  • [9] A quantitative reward prediction error signal in the ventral pallidum
    David J. Ottenheimer
    Bilal A. Bari
    Elissa Sutlief
    Kurt M. Fraser
    Tabitha H. Kim
    Jocelyn M. Richard
    Jeremiah Y. Cohen
    Patricia H. Janak
    Nature Neuroscience, 2020, 23 : 1267 - 1276
  • [10] Neuronal and oscillatory activity during reward processing in the human ventral striatum
    Lega, Bradley C.
    Kahana, Michael J.
    Jaggi, Jurg
    Baltuch, Gordon H.
    Zaghloul, Kareem
    NEUROREPORT, 2011, 22 (16) : 795 - 800