Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning

被引:0
|
作者
Su, Yunjie [1 ]
Kong, Yilun [1 ]
Wang, Xueqian [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Data augmentation; Uncertainty estimation; Out of distribution; Offline reinforcement learning;
D O I
10.1109/IJCNN54540.2023.10191211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the key challenges in Offline Reinforcement Learning is that it cannot conduct further environment exploration and performs poorly in terms of out-of-distribution generalizations. Data augmentation is commonly used to solve the issue of limited coverage of the full state-action space in static offline dataset. However, the existing data augmentation methods for proprioceptive observation suffer from the dilemma where the data coverage is often limited by tight constraints, while aggressive methods may exacerbate the performance. At the heart of this phenomenon are the diverged action distribution and the high uncertainty of the value function. In this paper, we propose to extend the static offline datasets during training by adding gradient-based perturbation to the state and utilizing the estimated uncertainty of the value function to constrain the range of the gradient. The estimated uncertainty of the value function works as a guidance to adjust the range of augmentation automatically, ensuring the adaptability and reliability of the state perturbation. The proposed algorithm Uncertainty-Aware Data Augmentation(UADA), is plugged into various standard offline RL algorithms and evaluated on several offline reinforcement learning tasks. The empirical results confirm that UADA substantially improves the performance and achieves better model stability compared with the original algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] ROMA: Reverse Model-Based Data Augmentation for Offline Reinforcement Learning
    Wei, Xiaochen
    Huang, Wenzhen
    Zhai, Ziming
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 178 - 193
  • [32] Decision Making for Human-in-the-loop Robotic Agents via Uncertainty-Aware Reinforcement Learning
    Singi, Siddharth
    He, Zhanpeng
    Pan, Alvin
    Patel, Sandip
    Sigurdsson, Gunnar A.
    Piramuthu, Robinson
    Song, Shuran
    Ciocarlie, Matei
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 7939 - 7945
  • [33] Uncertainty-aware deep learning for monitoring and fault diagnosis from synthetic data
    Das, Laya
    Gjorgiev, Blazhe
    Sansavini, Giovanni
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 251
  • [34] GUMBLE: Uncertainty-Aware Conditional Mobile Data Generation Using Bayesian Learning
    Skocaj, Marco
    Amorosa, Lorenzo Mario
    Lombardi, Michele
    Verdone, Roberto
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 13158 - 13171
  • [35] Uncertainty-aware data pipeline of calibrated MEMS sensors used for machine learning
    Dorst T.
    Gruber M.
    Seeger B.
    Vedurmudi A.P.
    Schneider T.
    Eichstädt S.
    Schütze A.
    Measurement: Sensors, 2022, 22
  • [36] Uncertainty-Aware Bootstrap Learning for Joint Extraction on Distantly-Supervised Data
    Li, Yufei
    Yu, Xiao
    Liu, Yanchi
    Chen, Haifeng
    Liu, Cong
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1349 - 1358
  • [37] Vision-Based Uncertainty-Aware Lane Keeping Strategy Using Deep Reinforcement Learning
    Kim, Myounghoe
    Seo, Joohwan
    Lee, Mingoo
    Choi, Jongeun
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2021, 143 (08):
  • [38] Discrete Uncertainty Quantification For Offline Reinforcement Learning
    Perez, Jose Luis
    Corrochano, Javier
    Garcia, Javier
    Majadas, Ruben
    Ibanez-Llano, Cristina
    Perez, Sergio
    Fernandez, Fernando
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 13 (04) : 273 - 287
  • [39] EfficientDeRain plus : Learning Uncertainty-Aware Filtering via RainMix Augmentation for High-Efficiency Deraining
    Guo, Qing
    Qi, Hua
    Sun, Jingyang
    Juefei-Xu, Felix
    Ma, Lei
    Lin, Di
    Feng, Wei
    Wang, Song
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2111 - 2135
  • [40] NPCL: Neural Processes for Uncertainty-Aware Continual Learning
    Jha, Saurav
    Gong, Dong
    Zhao, He
    Yao, Lina
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,