Phasic dopamine reinforces distinct striatal stimulus encoding in the olfactory tubercle driving dopaminergic reward prediction

被引：24

作者：

Oettl, Lars-Lennart ^{[1
,5
]}

Scheller, Max ^{[1
]}

Filosa, Carla ^{[1
,2
]}

Wieland, Sebastian ^{[1
]}

Haag, Franziska ^{[1
]}

Loeb, Cathrin ^{[1
]}

Durstewitz, Daniel ^{[3
]}

Shusterman, Roman ^{[4
]}

Russo, Eleonora ^{[3
]}

Kelsch, Wolfgang ^{[1
,2
]}

机构：

[1] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Psychiat & Psychotherapy, D-68159 Mannheim, Germany

[2] Johannes Gutenberg Univ Mainz, Univ Med Ctr, Dept Psychiat & Psychotherapy, D-55131 Mainz, Germany

[3] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Theoret Neurosci, D-68159 Mannheim, Germany

[4] Univ Oregon, Inst Neurosci, Eugene, OR 97403 USA

[5] Sainsbury Wellcome Ctr Neural Circuits & Behav, London W1T 4JG, England

来源：

NATURE COMMUNICATIONS | 2020年 / 11卷 / 01期

关键词：

VENTRAL STRIATUM; NEURONS; REPRESENTATIONS; UNCERTAINTY; ACTIVATION; SIGNALS; MODELS; ROLES; PSYCHOSIS; BEHAVIOR;

D O I：

10.1038/s41467-020-17257-7

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The learning of stimulus-outcome associations allows for predictions about the environment. Ventral striatum and dopaminergic midbrain neurons form a larger network for generating reward prediction signals from sensory cues. Yet, the network plasticity mechanisms to generate predictive signals in these distributed circuits have not been entirely clarified. Also, direct evidence of the underlying interregional assembly formation and information transfer is still missing. Here we show that phasic dopamine is sufficient to reinforce the distinctness of stimulus representations in the ventral striatum even in the absence of reward. Upon such reinforcement, striatal stimulus encoding gives rise to interregional assemblies that drive dopaminergic neurons during stimulus-outcome learning. These assemblies dynamically encode the predicted reward value of conditioned stimuli. Together, our data reveal that ventral striatal and midbrain reward networks form a reinforcing loop to generate reward prediction coding. It is not entirely understood how network plasticity produces the coding of predicted value during stimulus-outcome learning. Here, the authors reveal a reinforcing loop in distributed limbic circuits, transforming sensory stimuli into reward prediction coding broadcasted by dopamine neurons to the brain.

引用

页数：14

共 3 条

[1] Phasic dopamine reinforces distinct striatal stimulus encoding in the olfactory tubercle driving dopaminergic reward prediction
Lars-Lennart Oettl
Max Scheller
Carla Filosa
Sebastian Wieland
Franziska Haag
Cathrin Loeb
Daniel Durstewitz
Roman Shusterman
Eleonora Russo
Wolfgang Kelsch
Nature Communications, 11
[2] Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: A simulated robotic study
Mirolli, Marco
Santucci, Vieri G.
Baldassarre, Gianluca
NEURAL NETWORKS, 2013, 39 : 40 - 51
[3] Hemispheric Asymmetries in Striatal Reward Responses Relate to Approach-Avoidance Learning and Encoding of Positive-Negative Prediction Errors in Dopaminergic Midbrain Regions
Aberg, Kristoffer Carl
Doell, Kimberly C.
Schwartz, Sophie
JOURNAL OF NEUROSCIENCE, 2015, 35 (43): : 14491 - 14500

← 1 →