Prediction and Control in Continual Reinforcement Learning

被引:0
|
作者
Anand, Nishanth [1 ,2 ]
Precup, Doina [1 ,3 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Mila, Milan, Italy
[3] Deepmind, London, England
基金
加拿大自然科学与工程研究理事会;
关键词
GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal difference (TD) learning is often used to update the estimate of the value function which is used by RL agents to extract useful policies. In this paper, we focus on value function estimation in continual reinforcement learning. We propose to decompose the value function into two components which update at different timescales: a permanent value function, which holds general knowledge that persists over time, and a transient value function, which allows quick adaptation to new situations. We establish theoretical results showing that our approach is well suited for continual learning and draw connections to the complementary learning systems (CLS) theory from neuroscience. Empirically, this approach improves performance significantly on both prediction and control problems.
引用
收藏
页数:39
相关论文
共 50 条
  • [21] Towards Continual Reinforcement Learning through Evolutionary Meta-Learning
    Grbic, Djordje
    Risi, Sebastian
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 119 - 120
  • [22] SELF-ACTIVATING NEURAL ENSEMBLES FOR CONTINUAL REINFORCEMENT LEARNING
    Powers, Sam
    Xing, Eliot
    Gupta, Abhinav
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [23] Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
    Prashanth, L. A.
    Jie, Cheng
    Fu, Michael
    Marcus, Steve
    Szepesvari, Csaba
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [24] Solving Continual Combinatorial Selection via Deep Reinforcement Learning
    Song, Hyungseok
    Jang, Hyeryung
    Tran, Hai H.
    Yoon, Se-eun
    Son, Kyunghwan
    Yun, Donggyu
    Chung, Hyoju
    Yi, Yung
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3467 - 3474
  • [25] Neural Manifold Modulated Continual Reinforcement Learning for Musculoskeletal Robots
    Chen, Jiahao
    Chen, Ziyu
    Yao, Chaojing
    Qiao, Hong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 86 - 99
  • [26] Continual Vision-based Reinforcement Learning with Group Symmetries
    Liu, Shiqi
    Xu, Mengdi
    Huang, Peide
    Zhang, Xilun
    Liu, Yongkang
    Oguchi, Kentaro
    Zhao, Ding
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [27] Expandable Orbit Decay Prediction Using Continual Learning
    He, Junhua
    Wang, Hua
    Wang, Haitao
    Fang, Xuankun
    Huo, Chengyi
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2024, 2024
  • [28] Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation
    Davaslioglu, Kemal
    Kompella, Sastry
    Erpek, Tugba
    Sagduyu, Yalin E.
    arXiv,
  • [29] Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning
    Pathmanathan, Pankayaraj
    Diaz-Rodriguez, Natalia
    Del Ser, Javier
    COGNITIVE COMPUTATION, 2024, 16 (01) : 425 - 453
  • [30] Continual Reinforcement Learning for Intelligent Agricultural Management under Climate Changes
    Wang, Zhaoan
    Jha, Kishlay
    Xiao, Shaoping
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (01): : 1319 - 1336