Distributional reinforcement learning with epistemic and aleatoric uncertainty estimation

被引:5
|
作者
Liu, Qi [1 ]
Li, Yanjie [1 ]
Chen, Shiyu [1 ]
Lin, Ke [1 ]
Shi, Xiongtao [1 ]
Lou, Yunjiang [1 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributional reinforcement learning; Uncertainty; Risk sensitive policy; Exploration;
D O I
10.1016/j.ins.2023.119217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributional reinforcement learning (RL) differs from conventional RL, which only estimates the expectation of the return. Distributional RL considers the return as a random variable and estimates its distribution. The return distribution can provide more information than its expectation in conventional RL. Thus, distributional RL has been widely studied. However, very few previous works take full advantage of the learned distribution to improve distributional RL. This paper improves distributional RL by introducing epistemic and aleatoric uncertainty estimation. First, an epistemic and aleatoric uncertainty estimation method is introduced using deep ensembles and the learned value distribution. Next, we improve the exploration efficiency of fully parametrized quantile function (FQF) for distributional RL and obtain a FQF-U (uncertainty) algorithm. Then, to overcome the problem that distributional RL cannot operate over continuous control tasks, we propose an epistemic-uncertainty-based distributional soft actor-critic algorithm with an adaptive risk-averse and risk-seeking policy. Finally, experimental results show that our algorithms outperform the baselines in Atari games and Multi-joint dynamics with contact (MuJoCo) environments.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] The role of epistemic uncertainty of contact models in the design and optimization of mechanical systems with aleatoric uncertainty
    Brake, M. R.
    NONLINEAR DYNAMICS, 2014, 77 (03) : 899 - 922
  • [22] Aleatoric and Epistemic Uncertainty Quantification in Bayesian Dirichlet Cost Rules of Thumb
    Fleischer, Sam
    Hooke, Melissa
    2023 IEEE AEROSPACE CONFERENCE, 2023,
  • [23] Towards reasoning over knowledge graphs under aleatoric and epistemic uncertainty
    Kunitomo-Jacquin, Lucie
    Fukuda, Ken
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 294 - 295
  • [24] A general framework for quantifying aleatoric and epistemic uncertainty in graph neural networks
    Munikoti, Sai
    Agarwal, Deepesh
    Das, Laya
    Natarajan, Balasubramaniam
    NEUROCOMPUTING, 2023, 521 : 1 - 10
  • [25] Quantification of margins and uncertainties of complex systems in the presence of aleatoric and epistemic uncertainty
    Urbina, Angel
    Mahadevan, Sankaran
    Paez, Thomas L.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2011, 96 (09) : 1114 - 1125
  • [26] Aleatoric and epistemic uncertainty in the overstrength of CLT-to-CLT screwed connections
    Aloisio, Angelo
    De Santis, Yuri
    Pasca, Dag Pasquale
    Fragiacomo, Massimo
    Tomasi, Roberto
    ENGINEERING STRUCTURES, 2024, 304
  • [27] Decision-making models on perceptual uncertainty with distributional reinforcement learning
    Xu, Shuyuan
    Liu, Qiao
    Hu, Yuhui
    Xu, Mengtian
    Hao, Jiachen
    GREEN ENERGY AND INTELLIGENT TRANSPORTATION, 2023, 2 (02):
  • [28] Reliable Multi-class Classification based on Pairwise Epistemic and Aleatoric Uncertainty
    Nguyen, Vu-Linh
    Destercke, Sebastien
    Masson, Marie-Helene
    Huellermeier, Eyke
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5089 - 5095
  • [29] Development of a fuzzy-stochastic nonlinear model to incorporate aleatoric and epistemic uncertainty
    Li, Hua
    Zhang, Kejiang
    JOURNAL OF CONTAMINANT HYDROLOGY, 2010, 111 (1-4) : 1 - 12
  • [30] Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation
    Bae, Gwangbin
    Budvytis, Ignas
    Cipolla, Roberto
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13117 - 13126