Sample-efficient Adversarial Imitation Learning

被引:0
|
作者
Jung, Dahuin [1 ]
Lee, Hyungyu [1 ]
Yoon, Sungroh [2 ]
机构
[1] Electrical and Computer Engineering, Seoul National University, Seoul,08826, Korea, Republic of
[2] Electrical and Computer Engineering, Interdisciplinary Program in Artificial Intelligence, Seoul National University, Seoul,08826, Korea, Republic of
基金
新加坡国家研究基金会;
关键词
Decision making - Demonstrations - Learning systems - Supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods still require numerous expert demonstration samples to successfully imitate an expert’s behavior. To improve sample efficiency, we utilize self-supervised representation learning, which can generate vast training signals from the given data. In this study, we propose a self-supervised representation-based adversarial imitation learning method to learn state and action representations that are robust to diverse distortions and temporally predictive, on non-image control tasks. In particular, in comparison with existing self-supervised learning methods for tabular data, we propose a different corruption method for state and action representations that is robust to diverse distortions. We theoretically and empirically observe that making an informative feature manifold with less sample complexity significantly improves the performance of imitation learning. The proposed method shows a 39% relative improvement over existing adversarial imitation learning methods on MuJoCo in a setting limited to 100 expert state-action pairs. Moreover, we conduct comprehensive ablations and additional experiments using demonstrations with varying optimality to provide insights into a range of factors. ©2024 Dahuin Jung, Hyungyu Lee, and Sungroh Yoon.
引用
收藏
页码:1 / 32
相关论文
共 50 条
  • [41] TEXPLORE: real-time sample-efficient reinforcement learning for robots
    Hester, Todd
    Stone, Peter
    MACHINE LEARNING, 2013, 90 (03) : 385 - 429
  • [42] Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
    Xie, Tengyang
    Jiang, Nan
    Wang, Huan
    Xiong, Caiming
    Bai, Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] TEXPLORE: real-time sample-efficient reinforcement learning for robots
    Todd Hester
    Peter Stone
    Machine Learning, 2013, 90 : 385 - 429
  • [44] Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning
    Guo, Jeff
    Schwaller, Philippe
    JACS AU, 2024, 4 (06): : 2160 - 2172
  • [45] Robust Humanoid Locomotion Using Trajectory Optimization and Sample-Efficient Learning
    Yeganegi, Mohammad Hasan
    Khadiv, Majid
    Moosavian, S. Ali A.
    Zhu, Jia-Jie
    Del Prete, Andrea
    Righetti, Ludovic
    2019 IEEE-RAS 19TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2019, : 170 - 177
  • [46] Sample-efficient model-based reinforcement learning for quantum control
    Khalid, Irtaza
    Weidner, Carrie A.
    Jonckheere, Edmond A.
    Schirmer, Sophie G.
    Langbein, Frank C.
    PHYSICAL REVIEW RESEARCH, 2023, 5 (04):
  • [47] Active Code Learning: Benchmarking Sample-Efficient Training of Code Models
    Hu, Qiang
    Guo, Yuejun
    Xie, Xiaofei
    Cordy, Maxime
    Ma, Lei
    Papadakis, Mike
    Le Traon, Yves
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (05) : 1080 - 1095
  • [48] Sample-Efficient Blockage Prediction and Handover Using Causal Reinforcement Learning
    Kanagamani, Tamizharasan
    Sadasivan, Jishnu
    Banerjee, Serene
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [49] Sample-efficient Learning for Edge Resource Allocation and Pricing with BNN Approximators
    Tutuncuoglu, Feridun
    Dan, Gyorgy
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
  • [50] Surrogate models for device design using sample-efficient Deep Learning?
    Patel, Rutu
    Mohapatra, Nihar R.
    Hegde, Ravi S.
    SOLID-STATE ELECTRONICS, 2023, 199