Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning

被引:0
|
作者
Su, Yunjie [1 ]
Kong, Yilun [1 ]
Wang, Xueqian [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Data augmentation; Uncertainty estimation; Out of distribution; Offline reinforcement learning;
D O I
10.1109/IJCNN54540.2023.10191211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the key challenges in Offline Reinforcement Learning is that it cannot conduct further environment exploration and performs poorly in terms of out-of-distribution generalizations. Data augmentation is commonly used to solve the issue of limited coverage of the full state-action space in static offline dataset. However, the existing data augmentation methods for proprioceptive observation suffer from the dilemma where the data coverage is often limited by tight constraints, while aggressive methods may exacerbate the performance. At the heart of this phenomenon are the diverged action distribution and the high uncertainty of the value function. In this paper, we propose to extend the static offline datasets during training by adding gradient-based perturbation to the state and utilizing the estimated uncertainty of the value function to constrain the range of the gradient. The estimated uncertainty of the value function works as a guidance to adjust the range of augmentation automatically, ensuring the adaptability and reliability of the state perturbation. The proposed algorithm Uncertainty-Aware Data Augmentation(UADA), is plugged into various standard offline RL algorithms and evaluated on several offline reinforcement learning tasks. The empirical results confirm that UADA substantially improves the performance and achieves better model stability compared with the original algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Uncertainty-Aware Model-Based Offline Reinforcement Learning for Automated Driving
    Diehl, Christopher
    Sievernich, Timo Sebastian
    Kruger, Martin
    Hoffmann, Frank
    Bertram, Torsten
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 1167 - 1174
  • [2] Uncertainty-Aware Data Augmentation for Food Recognition
    Aguilar, Eduardo
    Nagarajan, Bhalaji
    Khantun, Rupali
    Bolanos, Marc
    Radeva, Petia
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4017 - 4024
  • [3] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
    Enkhsaikhan, Bayaraa
    Jo, Ohyun
    IEEE ACCESS, 2024, 12 : 166553 - 166563
  • [4] Uncertainty-aware autonomous sensing with deep reinforcement learning
    Murad, Abdulmajid
    Kraemer, Frank Alexander
    Bach, Kerstin
    Taylor, Gavin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 156 : 242 - 253
  • [5] Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
    Nguyen, Thanh
    Luu, Tung M.
    Ton, Tri
    Kim, Sungwoong
    Yoo, Chang D.
    IEEE ACCESS, 2024, 12 : 100972 - 100982
  • [6] Boundary Data Augmentation for Offline Reinforcement Learning
    SHEN Jiahao
    JIANG Ke
    TAN Xiaoyang
    ZTE Communications, 2023, 21 (03) : 29 - 36
  • [7] Uncertainty-Aware Pedestrian Crossing Prediction via Reinforcement Learning
    Dai, Siyang
    Liu, Jun
    Cheung, Ngai-Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9540 - 9549
  • [8] Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents
    Da Silva, Felipe Lena
    Hernandez-Leal, Pablo
    Kartal, Bilal
    Taylor, Matthew E.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5792 - 5799
  • [9] Uncertainty-Aware Data Aggregation for Deep Imitation Learning
    Cui, Yuchen
    Isele, David
    Niekum, Scott
    Fujimura, Kikuo
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 761 - 767
  • [10] Uncertainty-aware hierarchical reinforcement learning for long-horizon tasks
    Wenning Hu
    Hongbin Wang
    Ming He
    Nianbin Wang
    Applied Intelligence, 2023, 53 : 28555 - 28569