Distributional Reward Decomposition for Reinforcement Learning

被引：0

作者：

Lin, Zichuan ^{[1
,2
]}

Zhao, Li ^{[2
]}

Yang, Derek ^{[3
]}

Qin, Tao ^{[2
]}

Yang, Guangwen ^{[1
]}

Liu, Tie-Yan ^{[2
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Microsoft Res, Redmond, WA USA

[3] Univ Calif San Diego, La Jolla, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many reinforcement learning (RL) tasks have specific properties that can be lever-aged to modify existing RL algorithms to adapt to those tasks and further improve performance, and a general class of such properties is the multiple reward channel. In those environments the full reward can be decomposed into sub-rewards obtained from different channels. Existing work on reward decomposition either requires prior knowledge of the environment to decompose the full reward, or decomposes reward without prior knowledge but with degraded performance. In this paper, we propose Distributional Reward Decomposition for Reinforcement Learning (DRDRL), a novel reward decomposition algorithm which captures the multiple reward channel structure under distributional setting. Empirically, our method captures the multi-channel structure and discovers meaningful reward decomposition, without any requirements on prior knowledge. Consequently, our agent achieves better performance than existing methods on environments with multiple reward channels.

引用

页数：10

共 50 条

[21] Information Directed Reward Learning for Reinforcement Learning
Lindner, David
Turchetta, Matteo
Tschiatschek, Sebastian
Ciosek, Kamil
Krause, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[22] Reinforcement learning reward functions for unsupervised learning
Fyfe, Colin
Lai, Pei Ling
ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 397 - +
[23] Conjugated Discrete Distributions for Distributional Reinforcement Learning
Lindenberg, Bjorn
Lindahl, Jonas Nordqvistand Karl-Olof
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7516 - 7524
[24] An opponent striatal circuit for distributional reinforcement learning
Lowet, Adam S.
Zheng, Qiao
Meng, Melissa
Matias, Sara
Drugowitsch, Jan
Uchida, Naoshige
NATURE, 2025, : 717 - 726
[25] Distributional reinforcement learning with linear function approximation
Bellemare, Marc G.
Le Roux, Nicolas
Castro, Pablo Samuel
Moitra, Subhodeep
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[26] Implicit Quantile Networks for Distributional Reinforcement Learning
Dabney, Will
Ostrovski, Georg
Silver, David
Munos, Remi
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[27] A Comparative Analysis of Expected and Distributional Reinforcement Learning
Lyle, Clare
Bellemare, Marc G.
Castro, Pablo Samuel
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4504 - 4511
[28] Distributional Reinforcement Learning via Moment Matching
Thanh Nguyen-Tang
Gupta, Sunil
Venkatesh, Svetha
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9144 - 9152
[29] Distributional Deep Reinforcement Learning with a Mixture of Gaussians
Choi, Yunho
Lee, Kyungjae
Oh, Songhwai
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9791 - 9797
[30] Belief Reward Shaping in Reinforcement Learning
Marom, Ofir
Rosman, Benjamin
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3762 - 3769

← 1 2 3 4 5 →