Distributional Reward Decomposition for Reinforcement Learning

被引:0
|
作者
Lin, Zichuan [1 ,2 ]
Zhao, Li [2 ]
Yang, Derek [3 ]
Qin, Tao [2 ]
Yang, Guangwen [1 ]
Liu, Tie-Yan [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res, Redmond, WA USA
[3] Univ Calif San Diego, La Jolla, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many reinforcement learning (RL) tasks have specific properties that can be lever-aged to modify existing RL algorithms to adapt to those tasks and further improve performance, and a general class of such properties is the multiple reward channel. In those environments the full reward can be decomposed into sub-rewards obtained from different channels. Existing work on reward decomposition either requires prior knowledge of the environment to decompose the full reward, or decomposes reward without prior knowledge but with degraded performance. In this paper, we propose Distributional Reward Decomposition for Reinforcement Learning (DRDRL), a novel reward decomposition algorithm which captures the multiple reward channel structure under distributional setting. Empirically, our method captures the multi-channel structure and discovers meaningful reward decomposition, without any requirements on prior knowledge. Consequently, our agent achieves better performance than existing methods on environments with multiple reward channels.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Distributional Reinforcement Learning for Multi-Dimensional Reward Functions
    Zhang, Pushi
    Chen, Xiaoyu
    Zhao, Li
    Xiong, Wei
    Qin, Tao
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
    Hu, Jifeng
    Sun, Yanchao
    Chen, Hechang
    Huang, Sili
    Piao, Haiyin
    Chang, Yi
    Sun, Lichao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Automatic Decomposition of Reward Machines for Decentralized Multiagent Reinforcement Learning
    Smith, Sophia
    Neary, Cyrus
    Topcu, Ufuk
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5423 - 5430
  • [4] Noise Distribution Decomposition Based Multi-Agent Distributional Reinforcement Learning
    Geng, Wei
    Xiao, Baidi
    Li, Rongpeng
    Wei, Ning
    Wang, Dong
    Zhao, Zhifeng
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 2301 - 2314
  • [5] Distributional Reinforcement Learning with Ensembles
    Lindenberg, Bjorn
    Nordqvist, Jonas
    Lindahl, Karl-Olof
    ALGORITHMS, 2020, 13 (05)
  • [6] A Distributional Perspective on Reinforcement Learning
    Bellemare, Marc G.
    Dabney, Will
    Munos, Remi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [7] Distributional Reinforcement Learning in the Brain
    Lowet, Adam S.
    Zheng, Qiao
    Matias, Sara
    Drugowitsch, Jan
    Uchida, Naoshige
    TRENDS IN NEUROSCIENCES, 2020, 43 (12) : 980 - 997
  • [8] Exploration by Distributional Reinforcement Learning
    Tang, Yunhao
    Agrawal, Shipra
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2710 - 2716
  • [9] Implicit Distributional Reinforcement Learning
    Yue, Yuguang
    Wang, Zhendong
    Zhou, Mingyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [10] Distributional Reinforcement Learning for Efficient Exploration
    Mavrin, Borislav
    Yao, Hengshuai
    Kong, Linglong
    Wu, Kaiwen
    Yu, Yaoliang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97