Reinforcement learning and dopamine in the striatum: A modeling perspective

被引:1
|
作者
Wanjerkhede, Shesharao M. [1 ]
Bapi, Raju S. [2 ]
Mytri, Vithal D. [3 ]
机构
[1] Guru Nanak Dev Engn Coll, Dept Comp Sci & Engn, Bidar, Karnataka, India
[2] Cent Univ Hyderabad, Dept Comp & Informat Sci, Hyderabad, Andhra Pradesh, India
[3] Guru Nanak Dev Engn Coll, Bidar, Karnataka, India
关键词
Actor-critic; Basal ganglia; Dopamine; LTP; LTD; PROTEIN-KINASE-II; BASAL GANGLIA; FRONTAL-CORTEX; DARPP-32; PHOSPHORYLATION; COINCIDENT ACTIVATION; COMPUTATIONAL MODELS; SYNAPTIC PLASTICITY; PREDICTION ERROR; NMDA RECEPTORS; WORKING-MEMORY;
D O I
10.1016/j.neucom.2013.02.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent research evidences show that the dopamine (DA) system in the brain is involved in various functions like reward-related learning, exploration, preparation, and execution in goal directed behavior. It is suggested that dopaminergic neurons provide a prediction error akin to the error computed in the temporal difference learning (TDL) models of reinforcement learning (RL). Houk et al. (1995) [26] proposed a biochemical model in the spine head of neurons at the striatum in the basal ganglia which generates and uses neural signals to predict reinforcement. The model explains how the DA neurons are able to predict reinforcement and how the output from these neurons might then be used to reinforce the behaviors that lead to primary reinforcement. They proposed a scheme drawing that parallels between actor-critic architecture and dopamine activity in the basal ganglia. Houk et al. (1995) [26] also proposed a biochemical model of interactions between protein molecules which supports learning earlier predictions of reinforcement in the spine head of medium spiny neurons at the striatum. However, Houk's proposed cellular model fails to account for the time delay between the dopaminergic and glutamatergic activity required for reward-related learning and also fails to explain the 'eligibility trace' condition needed in delayed tasks of associative conditioning in which a memory trace of the antecedent signal is needed at the time of a succeeding reward. In this article, we review various models of RL with an emphasis on the cellular models of RL. In particular, we emphasize biochemical models of RL, and point out the future directions. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:27 / 40
页数:14
相关论文
共 50 条
  • [21] Generalized reinforcement learning model of dopamine activity
    Kaveri, Sivaramakrishnan
    Nakahara, Hiroyuki
    NEUROSCIENCE RESEARCH, 2008, 61 : S195 - S195
  • [22] Simulating the Effect of Reinforcement Learning on Neuronal Synchrony and Periodicity in the Striatum
    Helie, Sebastien
    Fleischer, Pierson J.
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2016, 10
  • [23] Amygdala and Ventral Striatum Make Distinct Contributions to Reinforcement Learning
    Costa, Vincent D.
    Dal Monte, Olga
    Lucas, Daniel R.
    Murray, Elisabeth A.
    Averbeck, Bruno B.
    NEURON, 2016, 92 (02) : 505 - 517
  • [24] Dopamine release in human associative striatum during reversal learning
    Grill, Filip
    Guitart-Masip, Marc
    Johansson, Jarkko
    Stiernman, Lars
    Axelsson, Jan
    Nyberg, Lars
    Rieckmann, Anna
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [25] Dopamine release in human associative striatum during reversal learning
    Filip Grill
    Marc Guitart-Masip
    Jarkko Johansson
    Lars Stiernman
    Jan Axelsson
    Lars Nyberg
    Anna Rieckmann
    Nature Communications, 15 (1)
  • [26] Dissociable effects of dopamine on learning and performance within sensorimotor striatum
    Leventhal, Daniel K.
    Stoetzner, Colin R.
    Abraham, Rohit
    Pettibone, Jeff
    DeMarco, Kayla
    Berke, Joshua D.
    BASAL GANGLIA, 2014, 4 (02) : 43 - 54
  • [27] Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis
    Glimcher, Paul W.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 : 15647 - 15654
  • [28] Schizophrenia: A Computational Reinforcement Learning Perspective
    Frank, Michael J.
    SCHIZOPHRENIA BULLETIN, 2008, 34 (06) : 1008 - 1011
  • [29] An Optimistic Perspective on Offline Reinforcement Learning
    Agarwal, Rishabh
    Schuurmans, Dale
    Norouzi, Mohammad
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [30] Decoding trust: a reinforcement learning perspective
    Zheng, Guozhong
    Zhang, Jiqiang
    Zhang, Jing
    Cai, Weiran
    Chen, Li
    NEW JOURNAL OF PHYSICS, 2024, 26 (05):