Distributional reinforcement learning with epistemic and aleatoric uncertainty estimation

被引:5
|
作者
Liu, Qi [1 ]
Li, Yanjie [1 ]
Chen, Shiyu [1 ]
Lin, Ke [1 ]
Shi, Xiongtao [1 ]
Lou, Yunjiang [1 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributional reinforcement learning; Uncertainty; Risk sensitive policy; Exploration;
D O I
10.1016/j.ins.2023.119217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributional reinforcement learning (RL) differs from conventional RL, which only estimates the expectation of the return. Distributional RL considers the return as a random variable and estimates its distribution. The return distribution can provide more information than its expectation in conventional RL. Thus, distributional RL has been widely studied. However, very few previous works take full advantage of the learned distribution to improve distributional RL. This paper improves distributional RL by introducing epistemic and aleatoric uncertainty estimation. First, an epistemic and aleatoric uncertainty estimation method is introduced using deep ensembles and the learned value distribution. Next, we improve the exploration efficiency of fully parametrized quantile function (FQF) for distributional RL and obtain a FQF-U (uncertainty) algorithm. Then, to overcome the problem that distributional RL cannot operate over continuous control tasks, we propose an epistemic-uncertainty-based distributional soft actor-critic algorithm with an adaptive risk-averse and risk-seeking policy. Finally, experimental results show that our algorithms outperform the baselines in Atari games and Multi-joint dynamics with contact (MuJoCo) environments.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] One Step Closer to Unbiased Aleatoric Uncertainty Estimation
    Zhang, Wang
    Ma, Ziwen Martin
    Das, Subhro
    LilyWeng, Tsui-Wei
    Megretski, Alexandre
    Daniel, Luca
    Nguyen, Lam M.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16857 - 16864
  • [32] Entropy-Guided Distributional Reinforcement Learning with Controlling Uncertainty in Robotic Tasks
    Cho, Hyunjin
    Kim, Hyunseok
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [33] A Bayesian Deep Learning RUL Framework Integrating Epistemic and Aleatoric Uncertainties
    Li, Gaoyang
    Yang, Li
    Lee, Chi-Guhn
    Wang, Xiaohua
    Rong, Mingzhe
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (09) : 8829 - 8841
  • [34] Fuel performance uncertainty quantification and sensitivity analysis in the presence of epistemic and aleatoric sources of uncertainties
    Faure, Quentin
    Delipei, Gregory
    Petruzzi, Alessandro
    Avramova, Maria
    Ivanov, Kostadin
    FRONTIERS IN ENERGY RESEARCH, 2023, 11
  • [35] Distributional Reinforcement Learning with Ensembles
    Lindenberg, Bjorn
    Nordqvist, Jonas
    Lindahl, Karl-Olof
    ALGORITHMS, 2020, 13 (05)
  • [36] A Distributional Perspective on Reinforcement Learning
    Bellemare, Marc G.
    Dabney, Will
    Munos, Remi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [37] Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories
    Acharya, Aastha
    Russell, Rebecca
    Ahmed, Nisar R.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 12751 - 12757
  • [38] Distributional Reinforcement Learning in the Brain
    Lowet, Adam S.
    Zheng, Qiao
    Matias, Sara
    Drugowitsch, Jan
    Uchida, Naoshige
    TRENDS IN NEUROSCIENCES, 2020, 43 (12) : 980 - 997
  • [39] Exploration by Distributional Reinforcement Learning
    Tang, Yunhao
    Agrawal, Shipra
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2710 - 2716
  • [40] Implicit Distributional Reinforcement Learning
    Yue, Yuguang
    Wang, Zhendong
    Zhou, Mingyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33