Distributional Reward Decomposition for Reinforcement Learning

被引：0

作者：

Lin, Zichuan ^{[1
,2
]}

Zhao, Li ^{[2
]}

Yang, Derek ^{[3
]}

Qin, Tao ^{[2
]}

Yang, Guangwen ^{[1
]}

Liu, Tie-Yan ^{[2
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Microsoft Res, Redmond, WA USA

[3] Univ Calif San Diego, La Jolla, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many reinforcement learning (RL) tasks have specific properties that can be lever-aged to modify existing RL algorithms to adapt to those tasks and further improve performance, and a general class of such properties is the multiple reward channel. In those environments the full reward can be decomposed into sub-rewards obtained from different channels. Existing work on reward decomposition either requires prior knowledge of the environment to decompose the full reward, or decomposes reward without prior knowledge but with degraded performance. In this paper, we propose Distributional Reward Decomposition for Reinforcement Learning (DRDRL), a novel reward decomposition algorithm which captures the multiple reward channel structure under distributional setting. Empirically, our method captures the multi-channel structure and discovers meaningful reward decomposition, without any requirements on prior knowledge. Consequently, our agent achieves better performance than existing methods on environments with multiple reward channels.

引用

页数：10

共 50 条

[1] Distributional Reinforcement Learning for Multi-Dimensional Reward Functions
Zhang, Pushi
Chen, Xiaoyu
Zhao, Li
Xiong, Wei
Qin, Tao
Liu, Tie-Yan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
Hu, Jifeng
Sun, Yanchao
Chen, Hechang
Huang, Sili
Piao, Haiyin
Chang, Yi
Sun, Lichao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[3] Automatic Decomposition of Reward Machines for Decentralized Multiagent Reinforcement Learning
Smith, Sophia
Neary, Cyrus
Topcu, Ufuk
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5423 - 5430
[4] Noise Distribution Decomposition Based Multi-Agent Distributional Reinforcement Learning
Geng, Wei
Xiao, Baidi
Li, Rongpeng
Wei, Ning
Wang, Dong
Zhao, Zhifeng
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 2301 - 2314
[5] Distributional Reinforcement Learning with Ensembles
Lindenberg, Bjorn
Nordqvist, Jonas
Lindahl, Karl-Olof
ALGORITHMS, 2020, 13 (05)
[6] A Distributional Perspective on Reinforcement Learning
Bellemare, Marc G.
Dabney, Will
Munos, Remi
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[7] Distributional Reinforcement Learning in the Brain
Lowet, Adam S.
Zheng, Qiao
Matias, Sara
Drugowitsch, Jan
Uchida, Naoshige
TRENDS IN NEUROSCIENCES, 2020, 43 (12) : 980 - 997
[8] Exploration by Distributional Reinforcement Learning
Tang, Yunhao
Agrawal, Shipra
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2710 - 2716
[9] Implicit Distributional Reinforcement Learning
Yue, Yuguang
Wang, Zhendong
Zhou, Mingyuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[10] Distributional Reinforcement Learning for Efficient Exploration
Mavrin, Borislav
Yao, Hengshuai
Kong, Linglong
Wu, Kaiwen
Yu, Yaoliang
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97

← 1 2 3 4 5 →