Data-Efficient Offline Reinforcement Learning with Approximate Symmetries

被引:0
|
作者
Angelotti, Giorgio [1 ,2 ]
Drougard, Nicolas [1 ,2 ]
Chanel, Caroline P. C. [1 ,2 ]
机构
[1] Univ Toulouse, ANITI, Toulouse, France
[2] Univ Toulouse, ISAE Supaero, Toulouse, France
关键词
Offline reinforcement learning; Approximate symmetries; Data augmentation;
D O I
10.1007/978-3-031-55326-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of Offline Reinforcement Learning (ORL) models in Markov Decision Processes (MDPs) is heavily contingent upon the quality and diversity of the training data. This research furthers the exploration of expert-guided symmetry detection and data augmentation techniques by considering approximate symmetries in discrete MDPs, providing a fresh perspective on data efficiency in the domain of ORL. We scrutinize the adaptability and resilience of these established methodologies in varied stochastic environments, featuring alterations in transition probabilities with respect to the already tested stochastic environments. Key findings from these investigations elucidate the potential of approximate symmetries for the data augmentation process and confirm the robustness of the existing methods under altered stochastic conditions. Our analysis reinforces the applicability of the established symmetry detection techniques in diverse scenarios while opening new horizons for enhancing the efficiency of ORL models.
引用
收藏
页码:164 / 186
页数:23
相关论文
共 50 条
  • [21] A Data-Efficient Method of Deep Reinforcement Learning for Chinese Chess
    Xu, Changming
    Ding, Hengfeng
    Zhang, Xuejian
    Wang, Cong
    Yang, Hongji
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 687 - 693
  • [22] Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
    Maulana, Muhammad Rizki
    Lee, Wee Sun
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 122 - 138
  • [23] Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
    Jiang, Zhaohui
    Weng, Paul
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 556 - 572
  • [24] Shielded Planning Guided Data-Efficient and Safe Reinforcement Learning
    Wang, Hao
    Qin, Jiahu
    Kan, Zhen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3808 - 3819
  • [25] Data Based Optimal Control with Neural Networks and Data-Efficient Reinforcement Learning
    Runkler, Thomas A.
    Udluft, Steffen
    Duell, Siegmund
    AT-AUTOMATISIERUNGSTECHNIK, 2012, 60 (10) : 641 - 647
  • [26] A Data-Efficient Reinforcement Learning Method Based on Local Koopman Operators
    Song, Lixing
    Wang, Junheng
    Xu, Junhong
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 515 - 520
  • [27] Data-efficient model-based reinforcement learning with trajectory discrimination
    Tuo Qu
    Fuqing Duan
    Junge Zhang
    Bo Zhao
    Wenzhen Huang
    Complex & Intelligent Systems, 2024, 10 : 1927 - 1936
  • [28] DATA-EFFICIENT MODEL-BASED REINFORCEMENT LEARNING FOR ROBOT CONTROL
    Sun, Ming
    Gao, Yue
    Liu, Wei
    Li, Shaoyuan
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (04): : 211 - 218
  • [29] Data-Efficient Hierarchical Reinforcement Learning for Robotic Assembly Control Applications
    Hou, Zhimin
    Fei, Jiajun
    Deng, Yuelin
    Xu, Jing
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (11) : 11565 - 11575
  • [30] Data-efficient model-based reinforcement learning with trajectory discrimination
    Qu, Tuo
    Duan, Fuqing
    Zhang, Junge
    Zhao, Bo
    Huang, Wenzhen
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1927 - 1936