Reinforcement learning and dopamine in the striatum: A modeling perspective

被引:1
|
作者
Wanjerkhede, Shesharao M. [1 ]
Bapi, Raju S. [2 ]
Mytri, Vithal D. [3 ]
机构
[1] Guru Nanak Dev Engn Coll, Dept Comp Sci & Engn, Bidar, Karnataka, India
[2] Cent Univ Hyderabad, Dept Comp & Informat Sci, Hyderabad, Andhra Pradesh, India
[3] Guru Nanak Dev Engn Coll, Bidar, Karnataka, India
关键词
Actor-critic; Basal ganglia; Dopamine; LTP; LTD; PROTEIN-KINASE-II; BASAL GANGLIA; FRONTAL-CORTEX; DARPP-32; PHOSPHORYLATION; COINCIDENT ACTIVATION; COMPUTATIONAL MODELS; SYNAPTIC PLASTICITY; PREDICTION ERROR; NMDA RECEPTORS; WORKING-MEMORY;
D O I
10.1016/j.neucom.2013.02.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent research evidences show that the dopamine (DA) system in the brain is involved in various functions like reward-related learning, exploration, preparation, and execution in goal directed behavior. It is suggested that dopaminergic neurons provide a prediction error akin to the error computed in the temporal difference learning (TDL) models of reinforcement learning (RL). Houk et al. (1995) [26] proposed a biochemical model in the spine head of neurons at the striatum in the basal ganglia which generates and uses neural signals to predict reinforcement. The model explains how the DA neurons are able to predict reinforcement and how the output from these neurons might then be used to reinforce the behaviors that lead to primary reinforcement. They proposed a scheme drawing that parallels between actor-critic architecture and dopamine activity in the basal ganglia. Houk et al. (1995) [26] also proposed a biochemical model of interactions between protein molecules which supports learning earlier predictions of reinforcement in the spine head of medium spiny neurons at the striatum. However, Houk's proposed cellular model fails to account for the time delay between the dopaminergic and glutamatergic activity required for reward-related learning and also fails to explain the 'eligibility trace' condition needed in delayed tasks of associative conditioning in which a memory trace of the antecedent signal is needed at the time of a succeeding reward. In this article, we review various models of RL with an emphasis on the cellular models of RL. In particular, we emphasize biochemical models of RL, and point out the future directions. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:27 / 40
页数:14
相关论文
共 50 条
  • [1] Modeling the sub-cellular signaling pathways involved in reinforcement learning at the striatum
    Wanjerkhede, Shesharao M.
    Bapi, Raju S.
    MODELS OF BRAIN AND MIND: PHYSICAL, COMPUTATIONAL AND PSYCHOLOGICAL APPROACHES, 2008, 168 : 193 - 206
  • [2] Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations
    Kim, Min Jung
    Gibson, Daniel J.
    Hu, Dan
    Yoshida, Tomoko
    Hueske, Emily
    Matsushima, Ayano
    Mahar, Ara
    Schofield, Cynthia J.
    Sompolpong, Patlapa
    Tran, Kathy T.
    Tian, Lin
    Graybiel, Ann M.
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [3] Nigrostriatal dopamine system may contribute to behavioral learning through providing reinforcement signals to the striatum
    Kimura, M
    Matsumoto, N
    EUROPEAN NEUROLOGY, 1997, 38 : 11 - 17
  • [4] Modeling influences of dopamine on synchronization behavior of striatum
    Cakir, Yueksel
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2017, 28 (01) : 28 - 52
  • [5] Modeling the Kinetic Diversity of Dopamine in the Dorsal Striatum
    Walters, Seth H.
    Robbins, Elaine M.
    Michael, Adrian C.
    ACS CHEMICAL NEUROSCIENCE, 2015, 6 (08): : 1468 - 1475
  • [6] A silent eligibility trace enables dopamine-dependent synaptic plasticity for reinforcement learning in the mouse striatum
    Shindou, Tomomi
    Shindou, Mayumi
    Watanabe, Sakurako
    Wickens, Jeffery
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 49 (05) : 726 - 736
  • [7] Dopamine, Reinforcement Learning, and Addiction
    Dayan, P.
    PHARMACOPSYCHIATRY, 2009, 42 : S56 - S65
  • [8] Dopamine agonists and reinforcement learning in Parkinsonism
    不详
    NEUROSCIENTIST, 2005, 11 (04): : 269 - 269
  • [9] The Specific Role of Dopamine in the Striatum during Operant Learning
    Ivlieva N.Y.
    Ivliev D.A.
    Neuroscience and Behavioral Physiology, 2016, 46 (1) : 73 - 76
  • [10] Reinforcement learning in a spiking neural model of striatum plasticity
    Gonzalez-Redondo, Alvaro
    Garridoa, Jesus
    Arrabal, Francisco Naveros
    Kotaleski, Jeanette Hellgren
    Grillner, Sten
    Ros, Eduardo
    NEUROCOMPUTING, 2023, 548