Distributional reinforcement learning with epistemic and aleatoric uncertainty estimation

被引：5

作者：

Liu, Qi ^{[1
]}

Li, Yanjie ^{[1
]}

Chen, Shiyu ^{[1
]}

Lin, Ke ^{[1
]}

Shi, Xiongtao ^{[1
]}

Lou, Yunjiang ^{[1
]}

机构：

[1] Harbin Inst Technol, Dept Control Sci & Engn, Shenzhen 518055, Peoples R China

来源：

INFORMATION SCIENCES | 2023年 / 644卷

基金：

中国国家自然科学基金;

关键词：

Distributional reinforcement learning; Uncertainty; Risk sensitive policy; Exploration;

D O I：

10.1016/j.ins.2023.119217

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Distributional reinforcement learning (RL) differs from conventional RL, which only estimates the expectation of the return. Distributional RL considers the return as a random variable and estimates its distribution. The return distribution can provide more information than its expectation in conventional RL. Thus, distributional RL has been widely studied. However, very few previous works take full advantage of the learned distribution to improve distributional RL. This paper improves distributional RL by introducing epistemic and aleatoric uncertainty estimation. First, an epistemic and aleatoric uncertainty estimation method is introduced using deep ensembles and the learned value distribution. Next, we improve the exploration efficiency of fully parametrized quantile function (FQF) for distributional RL and obtain a FQF-U (uncertainty) algorithm. Then, to overcome the problem that distributional RL cannot operate over continuous control tasks, we propose an epistemic-uncertainty-based distributional soft actor-critic algorithm with an adaptive risk-averse and risk-seeking policy. Finally, experimental results show that our algorithms outperform the baselines in Atari games and Multi-joint dynamics with contact (MuJoCo) environments.

引用

页数：16

共 50 条

[31] One Step Closer to Unbiased Aleatoric Uncertainty Estimation
Zhang, Wang
Ma, Ziwen Martin
Das, Subhro
LilyWeng, Tsui-Wei
Megretski, Alexandre
Daniel, Luca
Nguyen, Lam M.
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16857 - 16864
[32] Entropy-Guided Distributional Reinforcement Learning with Controlling Uncertainty in Robotic Tasks
Cho, Hyunjin
Kim, Hyunseok
APPLIED SCIENCES-BASEL, 2025, 15 (05):
[33] A Bayesian Deep Learning RUL Framework Integrating Epistemic and Aleatoric Uncertainties
Li, Gaoyang
Yang, Li
Lee, Chi-Guhn
Wang, Xiaohua
Rong, Mingzhe
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (09) : 8829 - 8841
[34] Fuel performance uncertainty quantification and sensitivity analysis in the presence of epistemic and aleatoric sources of uncertainties
Faure, Quentin
Delipei, Gregory
Petruzzi, Alessandro
Avramova, Maria
Ivanov, Kostadin
FRONTIERS IN ENERGY RESEARCH, 2023, 11
[35] Distributional Reinforcement Learning with Ensembles
Lindenberg, Bjorn
Nordqvist, Jonas
Lindahl, Karl-Olof
ALGORITHMS, 2020, 13 (05)
[36] A Distributional Perspective on Reinforcement Learning
Bellemare, Marc G.
Dabney, Will
Munos, Remi
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[37] Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories
Acharya, Aastha
Russell, Rebecca
Ahmed, Nisar R.
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 12751 - 12757
[38] Distributional Reinforcement Learning in the Brain
Lowet, Adam S.
Zheng, Qiao
Matias, Sara
Drugowitsch, Jan
Uchida, Naoshige
TRENDS IN NEUROSCIENCES, 2020, 43 (12) : 980 - 997
[39] Exploration by Distributional Reinforcement Learning
Tang, Yunhao
Agrawal, Shipra
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2710 - 2716
[40] Implicit Distributional Reinforcement Learning
Yue, Yuguang
Wang, Zhendong
Zhou, Mingyuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →