Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning

被引:0
|
作者
Su, Yunjie [1 ]
Kong, Yilun [1 ]
Wang, Xueqian [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Data augmentation; Uncertainty estimation; Out of distribution; Offline reinforcement learning;
D O I
10.1109/IJCNN54540.2023.10191211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the key challenges in Offline Reinforcement Learning is that it cannot conduct further environment exploration and performs poorly in terms of out-of-distribution generalizations. Data augmentation is commonly used to solve the issue of limited coverage of the full state-action space in static offline dataset. However, the existing data augmentation methods for proprioceptive observation suffer from the dilemma where the data coverage is often limited by tight constraints, while aggressive methods may exacerbate the performance. At the heart of this phenomenon are the diverged action distribution and the high uncertainty of the value function. In this paper, we propose to extend the static offline datasets during training by adding gradient-based perturbation to the state and utilizing the estimated uncertainty of the value function to constrain the range of the gradient. The estimated uncertainty of the value function works as a guidance to adjust the range of augmentation automatically, ensuring the adaptability and reliability of the state perturbation. The proposed algorithm Uncertainty-Aware Data Augmentation(UADA), is plugged into various standard offline RL algorithms and evaluated on several offline reinforcement learning tasks. The empirical results confirm that UADA substantially improves the performance and achieves better model stability compared with the original algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Ensemble Quantile Networks: Uncertainty-Aware Reinforcement Learning With Applications in Autonomous Driving
    Hoel, Carl-Johan
    Wolff, Krister
    Laine, Leo
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6030 - 6041
  • [22] MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
    Li, Kevin
    Gupta, Abhishek
    Reddy, Ashwin
    Pong, Vitchyr
    Zhou, Aurick
    Yu, Justin
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game
    Liu, Guiliang
    Luo, Yudong
    Schulte, Oliver
    Poupart, Pascal
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [24] Uncertainty-Aware Model-Based Reinforcement Learning: Methodology and Application in Autonomous Driving
    Wu, Jingda
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 194 - 203
  • [25] Uncertainty-Aware Federated Reinforcement Learning for Optimizing Accuracy and Energy in Heterogeneous Industrial IoT
    Sagar, A. S. M. Sharifuzzaman
    Islam, Muhammad Zubair
    Haider, Amir
    Kim, Hyung-Seok
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [26] An uncertainty-aware deep reinforcement learning framework for residential air conditioning energy management
    Lork, Clement
    Li, Wen-Tai
    Qin, Yan
    Zhou, Yuren
    Yuen, Chau
    Tushar, Wayes
    Saha, Tapan K.
    APPLIED ENERGY, 2020, 276 (276)
  • [27] Uncertainty-aware automated machine learning toolbox
    Dorst, Tanja
    Schneider, Tizian
    Eichstaedt, Sascha
    Schuetze, Andreas
    TM-TECHNISCHES MESSEN, 2023, 90 (03) : 141 - 153
  • [28] An Uncertainty-Aware Auction Mechanism for Federated Learning
    Xu, Jiali
    Tang, Bin
    Cui, Hengrui
    Ye, Baoliu
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT VI, 2024, 14492 : 1 - 18
  • [29] Uncertainty-Aware Representation Learning for Action Segmentation
    Chen, Lei
    Li, Muheng
    Duan, Yueqi
    Zhou, Jie
    Lu, Jiwen
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 820 - 826
  • [30] Uncertainty-aware human-like driving policy learning with deep Bayesian inverse reinforcement learning
    Zeng, Di
    Zheng, Ling
    Yang, Xiantong
    Li, Yinong
    TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2024,