Momentum in Reinforcement Learning

被引:0
|
作者
Vieillard, Nino [1 ,2 ]
Scherrer, Bruno [2 ]
Pietquin, Olivier [1 ]
Geist, Matthieu [1 ]
机构
[1] Google Res, Brain Team, Mountain View, CA 94043 USA
[2] Univ Lorraine, CNRS, INRIA, IECL, F-54000 Nancy, France
关键词
ENVIRONMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We adapt the optimization's concept of momentum to reinforcement learning. Seeing the state-action value functions as an analog to the gradients in optimization, we interpret momentum as an average of consecutive q-functions. We derive Momentum Value Iteration (MoVI), a variation of Value iteration that incorporates this momentum idea. Our analysis shows that this allows MoVI to average errors over successive iterations. We show that the proposed approach can be readily extended to deep learning. Specifically,we propose a simple improvement on DQN based on MoVI, and experiment it on Atari games.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Momentum-Based Contextual Federated Reinforcement Learning
    Yue, Sheng
    Hua, Xingyuan
    Deng, Yongheng
    Chen, Lili
    Ren, Ju
    Zhang, Yaoxue
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024,
  • [2] Performance Comparision of Different Momentum Techniques on Deep Reinforcement Learning
    Sarigul, Mehmet
    Avci, Mutlu
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 302 - 306
  • [3] Performance comparison of different momentum techniques on deep reinforcement learning*
    Sarigul, M.
    Avci, M.
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2018, 2 (02) : 205 - 216
  • [4] Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
    Yue, Sheng
    Hua, Xingyuan
    Chen, Lili
    Ren, Ju
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 1131 - 1140
  • [5] Using Collision Momentum in Deep Reinforcement Learning based Adversarial Pedestrian Modeling
    Chen, Dianwei
    Yurtsever, Ekim
    Redmill, Keith A.
    Ozguner, Umit
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [6] Contrafreeloading, reinforcement rate, and behavioral momentum
    Podlesnik, Christopher A.
    Jimenez-Gomez, Corina
    BEHAVIOURAL PROCESSES, 2016, 128 : 24 - 28
  • [7] A distributed adaptive policy gradient method based on momentum for multi-agent reinforcement learning
    Shi, Junru
    Wang, Xin
    Zhang, Mingchuan
    Liu, Muhua
    Zhu, Junlong
    Wu, Qingtao
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 7297 - 7310
  • [8] Decentralized multi-task reinforcement learning policy gradient method with momentum over networks
    Shi Junru
    Wang Qiong
    Liu Muhua
    Ji Zhihang
    Zheng Ruijuan
    Wu Qingtao
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10365 - 10379
  • [9] Decentralized multi-task reinforcement learning policy gradient method with momentum over networks
    Shi Junru
    Wang Qiong
    Liu Muhua
    Ji Zhihang
    Zheng Ruijuan
    Wu Qingtao
    Applied Intelligence, 2023, 53 : 10365 - 10379
  • [10] The Advance of Reinforcement Learning and Deep Reinforcement Learning
    Lyu, Le
    Shen, Yang
    Zhang, Sicheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648