Reinforcement Learning in Multidimensional Environments Relies on Attention Mechanisms

被引:233
|
作者
Niv, Yael [1 ,2 ]
Daniel, Reka [1 ,2 ]
Geana, Andra [1 ,2 ]
Gershman, Samuel J. [3 ]
Leong, Yuan Chang [4 ]
Radulescu, Angela [1 ,2 ]
Wilson, Robert C. [5 ,6 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[2] Princeton Univ, Inst Neurosci, Princeton, NJ 08540 USA
[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
[4] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[5] Univ Arizona, Dept Psychol, Tucson, AZ 85721 USA
[6] Univ Arizona, Cognit Sci Program, Tucson, AZ 85721 USA
来源
JOURNAL OF NEUROSCIENCE | 2015年 / 35卷 / 21期
关键词
attention; fMRI; frontoparietal network; model comparison; reinforcement learning; representation learning; PREFRONTAL CORTEX; PREDICTION ERRORS; SELECTIVE ATTENTION; COGNITIVE FUNCTIONS; PARKINSONS-DISEASE; NEURAL MECHANISMS; FRONTAL-CORTEX; MODELS; TASK; CATEGORIZATION;
D O I
10.1523/JNEUROSCI.2978-14.2015
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
In recent years, ideas from the computational field of reinforcement learning have revolutionized the study of learning in the brain, famously providing new, precise theories of how dopamine affects learning in the basal ganglia. However, reinforcement learning algorithms are notorious for not scaling well to multidimensional environments, as is required for real-world learning. We hypothesized that the brain naturally reduces the dimensionality of real-world problems to only those dimensions that are relevant to predicting reward, and conducted an experiment to assess by what algorithms and with what neural mechanisms this "representation learning" process is realized in humans. Our results suggest that a bilateral attentional control network comprising the intraparietal sulcus, precuneus, and dorsolateral prefrontal cortex is involved in selecting what dimensions are relevant to the task at hand, effectively updating the task representation through trial and error. In this way, cortical attention mechanisms interact with learning in the basal ganglia to solve the "curse of dimensionality" in reinforcement learning.
引用
收藏
页码:8145 / 8157
页数:13
相关论文
共 50 条
  • [21] Delayed reinforcement learning of multidimensional control actions
    Cichosz, P.
    Systems Analysis Modelling Simulation, 1996, 24 (1-3): : 233 - 248
  • [22] Reinforcement Learning of Dimensional Attention for Categorization
    Phillips, Joshua L.
    Noelle, David C.
    PROCEEDINGS OF THE TWENTY-SIXTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 2004, : 1101 - 1106
  • [23] Deep Residual Attention Reinforcement Learning
    Zhu, Hanhua
    Kaneko, Tomoyuki
    2019 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2019,
  • [24] Reactive Reinforcement Learning in Asynchronous Environments
    Travnik, Jaden B.
    Mathewson, Kory W.
    Sutton, Richard S.
    Pilarski, Patrick M.
    FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [25] Reinforcement Learning in Configurable Continuous Environments
    Metelli, Alberto Maria
    Ghelfi, Emanuele
    Restelli, Marcello
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [26] Reinforcement Learning in Latent Heterogeneous Environments
    Chen, Elynn Y.
    Song, Rui
    Jordan, Michael I.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 3113 - 3126
  • [27] A Survey on Simulation Environments for Reinforcement Learning
    Kim, Taewoo
    Jang, Minsu
    Kim, Jaehong
    2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 63 - 67
  • [28] Unsupervised Reinforcement Learning in Multiple Environments
    Mutti, Mirco
    Mancassola, Mattia
    Restelli, Marcello
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7850 - 7858
  • [29] Review of Multimodal Environments for Reinforcement Learning
    Z. A. Volovikova
    M. P. Kuznetsova
    A. A. Skrynnik
    A. I. Panov
    Doklady Mathematics, 2024, 110 (Suppl 1) : S110 - S116
  • [30] A Multidimensional Space Approach to Innovative Learning Environments
    Sardinha, Lara
    Pisco Almeida, Ana Margarida
    Pedro, Neuza
    PROJECT AND DESIGN LITERACY AS CORNERSTONES OF SMART EDUCATION, 2020, 158 : 109 - 117