Distributional reinforcement learning with epistemic and aleatoric uncertainty estimation

被引:5
|
作者
Liu, Qi [1 ]
Li, Yanjie [1 ]
Chen, Shiyu [1 ]
Lin, Ke [1 ]
Shi, Xiongtao [1 ]
Lou, Yunjiang [1 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributional reinforcement learning; Uncertainty; Risk sensitive policy; Exploration;
D O I
10.1016/j.ins.2023.119217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributional reinforcement learning (RL) differs from conventional RL, which only estimates the expectation of the return. Distributional RL considers the return as a random variable and estimates its distribution. The return distribution can provide more information than its expectation in conventional RL. Thus, distributional RL has been widely studied. However, very few previous works take full advantage of the learned distribution to improve distributional RL. This paper improves distributional RL by introducing epistemic and aleatoric uncertainty estimation. First, an epistemic and aleatoric uncertainty estimation method is introduced using deep ensembles and the learned value distribution. Next, we improve the exploration efficiency of fully parametrized quantile function (FQF) for distributional RL and obtain a FQF-U (uncertainty) algorithm. Then, to overcome the problem that distributional RL cannot operate over continuous control tasks, we propose an epistemic-uncertainty-based distributional soft actor-critic algorithm with an adaptive risk-averse and risk-seeking policy. Finally, experimental results show that our algorithms outperform the baselines in Atari games and Multi-joint dynamics with contact (MuJoCo) environments.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Exploration via Distributional Reinforcement Learning with Epistemic and Aleatoric Uncertainty Estimation
    Liu, Qi
    Li, Yanjie
    Liu, Yuecheng
    Chen, Meiling
    Lv, Shaohua
    Xu, Yunhong
    2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 2256 - 2261
  • [2] EPISTEMIC AND ALEATORIC UNCERTAINTY IN MODELING
    Segalman, Daniel J.
    Brake, Matthew R.
    Bergman, Lawrence A.
    Vakakis, Alexander F.
    Willner, Kai
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2013, VOL 8, 2014,
  • [3] Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods
    Eyke Hüllermeier
    Willem Waegeman
    Machine Learning, 2021, 110 : 457 - 506
  • [4] Reliable classification: Learning classifiers that distinguish aleatoric and epistemic uncertainty
    Senge, Robin
    Boesner, Stefan
    Dembczynski, Krzysztof
    Haasenritter, Joerg
    Hirsch, Oliver
    Donner-Banzhoff, Norbert
    Huellermeier, Eyke
    INFORMATION SCIENCES, 2014, 255 : 16 - 29
  • [5] Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods
    Huellermeier, Eyke
    Waegeman, Willem
    MACHINE LEARNING, 2021, 110 (03) : 457 - 506
  • [6] Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation
    Zhang, Zheng
    Liu, Qi
    Li, Yanjie
    Lin, Ke
    Li, Linyu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13653 - 13666
  • [7] Aleatoric and Epistemic Uncertainty with Random Forests
    Shaker, Mohammad Hossein
    Huellermeier, Eyke
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVIII, IDA 2020, 2020, 12080 : 444 - 456
  • [8] A Deeper Look into Aleatoric and Epistemic Uncertainty Disentanglement
    Valdenegro-Toro, Matias
    Mori, Daniel Saromo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1508 - 1516
  • [9] Towards Aleatoric and Epistemic Uncertainty in Medical Image Classification
    Loehr, Timo
    Ingrisch, Michael
    Huellermeier, Eyke
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 145 - 155
  • [10] Label-wise Aleatoric and Epistemic Uncertainty Quantification
    Sale, Yusuf
    Hofman, Paul
    Loehr, Timo
    Wimmer, Lisa
    Nagler, Thomas
    Huellermeier, Eyke
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2024, 244 : 3159 - 3179