Phasic dopamine reinforces distinct striatal stimulus encoding in the olfactory tubercle driving dopaminergic reward prediction

被引:24
|
作者
Oettl, Lars-Lennart [1 ,5 ]
Scheller, Max [1 ]
Filosa, Carla [1 ,2 ]
Wieland, Sebastian [1 ]
Haag, Franziska [1 ]
Loeb, Cathrin [1 ]
Durstewitz, Daniel [3 ]
Shusterman, Roman [4 ]
Russo, Eleonora [3 ]
Kelsch, Wolfgang [1 ,2 ]
机构
[1] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Psychiat & Psychotherapy, D-68159 Mannheim, Germany
[2] Johannes Gutenberg Univ Mainz, Univ Med Ctr, Dept Psychiat & Psychotherapy, D-55131 Mainz, Germany
[3] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Theoret Neurosci, D-68159 Mannheim, Germany
[4] Univ Oregon, Inst Neurosci, Eugene, OR 97403 USA
[5] Sainsbury Wellcome Ctr Neural Circuits & Behav, London W1T 4JG, England
关键词
VENTRAL STRIATUM; NEURONS; REPRESENTATIONS; UNCERTAINTY; ACTIVATION; SIGNALS; MODELS; ROLES; PSYCHOSIS; BEHAVIOR;
D O I
10.1038/s41467-020-17257-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The learning of stimulus-outcome associations allows for predictions about the environment. Ventral striatum and dopaminergic midbrain neurons form a larger network for generating reward prediction signals from sensory cues. Yet, the network plasticity mechanisms to generate predictive signals in these distributed circuits have not been entirely clarified. Also, direct evidence of the underlying interregional assembly formation and information transfer is still missing. Here we show that phasic dopamine is sufficient to reinforce the distinctness of stimulus representations in the ventral striatum even in the absence of reward. Upon such reinforcement, striatal stimulus encoding gives rise to interregional assemblies that drive dopaminergic neurons during stimulus-outcome learning. These assemblies dynamically encode the predicted reward value of conditioned stimuli. Together, our data reveal that ventral striatal and midbrain reward networks form a reinforcing loop to generate reward prediction coding. It is not entirely understood how network plasticity produces the coding of predicted value during stimulus-outcome learning. Here, the authors reveal a reinforcing loop in distributed limbic circuits, transforming sensory stimuli into reward prediction coding broadcasted by dopamine neurons to the brain.
引用
收藏
页数:14
相关论文
共 3 条
  • [1] Phasic dopamine reinforces distinct striatal stimulus encoding in the olfactory tubercle driving dopaminergic reward prediction
    Lars-Lennart Oettl
    Max Scheller
    Carla Filosa
    Sebastian Wieland
    Franziska Haag
    Cathrin Loeb
    Daniel Durstewitz
    Roman Shusterman
    Eleonora Russo
    Wolfgang Kelsch
    Nature Communications, 11
  • [2] Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: A simulated robotic study
    Mirolli, Marco
    Santucci, Vieri G.
    Baldassarre, Gianluca
    NEURAL NETWORKS, 2013, 39 : 40 - 51
  • [3] Hemispheric Asymmetries in Striatal Reward Responses Relate to Approach-Avoidance Learning and Encoding of Positive-Negative Prediction Errors in Dopaminergic Midbrain Regions
    Aberg, Kristoffer Carl
    Doell, Kimberly C.
    Schwartz, Sophie
    JOURNAL OF NEUROSCIENCE, 2015, 35 (43): : 14491 - 14500