Reinforcement Learning in Multidimensional Environments Relies on Attention Mechanisms

被引:233
|
作者
Niv, Yael [1 ,2 ]
Daniel, Reka [1 ,2 ]
Geana, Andra [1 ,2 ]
Gershman, Samuel J. [3 ]
Leong, Yuan Chang [4 ]
Radulescu, Angela [1 ,2 ]
Wilson, Robert C. [5 ,6 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[2] Princeton Univ, Inst Neurosci, Princeton, NJ 08540 USA
[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
[4] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[5] Univ Arizona, Dept Psychol, Tucson, AZ 85721 USA
[6] Univ Arizona, Cognit Sci Program, Tucson, AZ 85721 USA
来源
JOURNAL OF NEUROSCIENCE | 2015年 / 35卷 / 21期
关键词
attention; fMRI; frontoparietal network; model comparison; reinforcement learning; representation learning; PREFRONTAL CORTEX; PREDICTION ERRORS; SELECTIVE ATTENTION; COGNITIVE FUNCTIONS; PARKINSONS-DISEASE; NEURAL MECHANISMS; FRONTAL-CORTEX; MODELS; TASK; CATEGORIZATION;
D O I
10.1523/JNEUROSCI.2978-14.2015
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
In recent years, ideas from the computational field of reinforcement learning have revolutionized the study of learning in the brain, famously providing new, precise theories of how dopamine affects learning in the basal ganglia. However, reinforcement learning algorithms are notorious for not scaling well to multidimensional environments, as is required for real-world learning. We hypothesized that the brain naturally reduces the dimensionality of real-world problems to only those dimensions that are relevant to predicting reward, and conducted an experiment to assess by what algorithms and with what neural mechanisms this "representation learning" process is realized in humans. Our results suggest that a bilateral attentional control network comprising the intraparietal sulcus, precuneus, and dorsolateral prefrontal cortex is involved in selecting what dimensions are relevant to the task at hand, effectively updating the task representation through trial and error. In this way, cortical attention mechanisms interact with learning in the basal ganglia to solve the "curse of dimensionality" in reinforcement learning.
引用
收藏
页码:8145 / 8157
页数:13
相关论文
共 50 条
  • [1] Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments
    Leong, Yuan Chang
    Radulescu, Angela
    Daniel, Reka
    DeWoskin, Vivian
    Niv, Yael
    NEURON, 2017, 93 (02) : 451 - 463
  • [2] Contributions of Attention to Learning in Multidimensional Reward Environments
    Wang, Michael Chong
    Soltani, Alireza
    JOURNAL OF NEUROSCIENCE, 2025, 45 (07):
  • [3] Attention-Based Mechanisms for Cognitive Reinforcement Learning
    Gao, Yue
    Li, Di
    Chen, Xiangjian
    Zhu, Junwu
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [4] An overview: Attention mechanisms in multi-agent reinforcement learning
    Hu, Kai
    Xu, Keer
    Xia, Qingfeng
    Li, Mingyang
    Song, Zhiqiang
    Song, Lipeng
    Sun, Ning
    NEUROCOMPUTING, 2024, 598
  • [5] The social transmission of empathy relies on observational reinforcement learning
    Zhou, Yuqing
    Han, Shihui
    Kang, Pyungwon
    Tobler, Philippe N.
    Hein, Grit
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2024, 121 (09)
  • [6] The social transmission of empathy relies on observational reinforcement learning
    Zhou, Yuqing
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2024, 59 : 702 - 702
  • [7] Attention-based Deep Reinforcement Learning for Multi-view Environments
    Barati, Elaheh
    Chen, Xuewen
    Zhong, Zichun
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1805 - 1807
  • [8] Portfolio trading system of digital currencies: A deep reinforcement learning with multidimensional attention gating mechanism
    Weng, Liguo
    Sun, Xudong
    Xia, Min
    Liu, Jia
    Xu, Yiqing
    NEUROCOMPUTING, 2020, 402 : 171 - 182
  • [9] Multidimensional triangulation and interpolation for reinforcement learning
    Davies, S
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 1005 - 1011
  • [10] Traffic Signal Control Optimization Based on Deep Reinforcement Learning with Attention Mechanisms
    Ni, Wenlong
    Wang, Peng
    Li, Zehong
    Li, Chuanzhuang
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT III, 2024, 14449 : 147 - 158