Dopamine signals for reward value and risk: basic and recent data

被引:450
|
作者
Schultz, Wolfram [1 ]
机构
[1] Univ Cambridge, Dept Physiol Dev & Neurosci, Cambridge CB2 3DY, England
来源
基金
瑞士国家科学基金会; 英国惠康基金;
关键词
DELAYED-RESPONSE TASK; MIDBRAIN DOPAMINE; SUBSTANTIA-NIGRA; NEURONS ENCODE; NUCLEUS-ACCUMBENS; AVERSIVE STIMULI; MONKEY MIDBRAIN; CONDITIONED INHIBITION; BEHAVIORAL REACTIONS; SYNAPTIC PLASTICITY;
D O I
10.1186/1744-9081-6-24
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Background: Previous lesion, electrical self-stimulation and drug addiction studies suggest that the midbrain dopamine systems are parts of the reward system of the brain. This review provides an updated overview about the basic signals of dopamine neurons to environmental stimuli. Methods: The described experiments used standard behavioral and neurophysiological methods to record the activity of single dopamine neurons in awake monkeys during specific behavioral tasks. Results: Dopamine neurons show phasic activations to external stimuli. The signal reflects reward, physical salience, risk and punishment, in descending order of fractions of responding neurons. Expected reward value is a key decision variable for economic choices. The reward response codes reward value, probability and their summed product, expected value. The neurons code reward value as it differs from prediction, thus fulfilling the basic requirement for a bidirectional prediction error teaching signal postulated by learning theory. This response is scaled in units of standard deviation. By contrast, relatively few dopamine neurons show the phasic activation following punishers and conditioned aversive stimuli, suggesting a lack of relationship of the reward response to general attention and arousal. Large proportions of dopamine neurons are also activated by intense, physically salient stimuli. This response is enhanced when the stimuli are novel; it appears to be distinct from the reward value signal. Dopamine neurons show also unspecific activations to non-rewarding stimuli that are possibly due to generalization by similar stimuli and pseudoconditioning by primary rewards. These activations are shorter than reward responses and are often followed by depression of activity. A separate, slower dopamine signal informs about risk, another important decision variable. The prediction error response occurs only with reward; it is scaled by the risk of predicted reward. Conclusions: Neurophysiological studies reveal phasic dopamine signals that transmit information related predominantly but not exclusively to reward. Although not being entirely homogeneous, the dopamine signal is more restricted and stereotyped than neuronal activity in most other brain structures involved in goal directed behavior.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] The effect of effort on reward prediction error signals in midbrain dopamine neurons
    Tanaka, Shingo
    Taylor, Jessica E.
    Sakagami, Masamichi
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2021, 41 : 152 - 159
  • [32] Two Dimensions of Value: Dopamine Neurons Represent Reward But Not Aversiveness
    Fiorillo, Christopher D.
    SCIENCE, 2013, 341 (6145) : 546 - 549
  • [33] TRANSIENT ACTIVATION OF MIDBRAIN DOPAMINE NEURONS BY REWARD RISK
    Fiorillo, C. D.
    NEUROSCIENCE, 2011, 197 : 162 - 171
  • [34] Dopamine signals as temporal difference errors: recent advances
    Starkweather, Clara Kwon
    Uchida, Naoshige
    CURRENT OPINION IN NEUROBIOLOGY, 2021, 67 : 95 - 105
  • [35] Blunted Expected Reward Value Signals in Binge Alcohol Drinkers
    Tolomeo, Serenella
    Baldacchino, Alex
    Steele, J. Douglas
    JOURNAL OF NEUROSCIENCE, 2023, 43 (31): : 5685 - 5692
  • [36] NEURONAL REWARD AND DECISION SIGNALS: FROM THEORIES TO DATA
    Schultz, Wolfram
    PHYSIOLOGICAL REVIEWS, 2015, 95 (03) : 853 - 951
  • [37] Dopamine signals threat-coping behaviour in threat-reward conflicts
    Rogers, Jake
    NATURE REVIEWS NEUROSCIENCE, 2025, : 246 - 246
  • [38] Surprise! Dopamine signals mix action, value and error
    Anne G E Collins
    Michael J Frank
    Nature Neuroscience, 2016, 19 : 3 - 5
  • [39] The timing of action determines reward prediction signals in identified midbrain dopamine neurons
    Coddington, Luke T.
    Dudman, Joshua T.
    NATURE NEUROSCIENCE, 2018, 21 (11) : 1563 - +
  • [40] Surprise! Dopamine signals mix action, value and error
    Collins, Anne G. E.
    Frank, Michael J.
    NATURE NEUROSCIENCE, 2016, 19 (01) : 3 - 5