Sample-efficient Adversarial Imitation Learning

被引:0
|
作者
Jung, Dahuin [1 ]
Lee, Hyungyu [1 ]
Yoon, Sungroh [2 ]
机构
[1] Electrical and Computer Engineering, Seoul National University, Seoul,08826, Korea, Republic of
[2] Electrical and Computer Engineering, Interdisciplinary Program in Artificial Intelligence, Seoul National University, Seoul,08826, Korea, Republic of
基金
新加坡国家研究基金会;
关键词
Decision making - Demonstrations - Learning systems - Supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods still require numerous expert demonstration samples to successfully imitate an expert’s behavior. To improve sample efficiency, we utilize self-supervised representation learning, which can generate vast training signals from the given data. In this study, we propose a self-supervised representation-based adversarial imitation learning method to learn state and action representations that are robust to diverse distortions and temporally predictive, on non-image control tasks. In particular, in comparison with existing self-supervised learning methods for tabular data, we propose a different corruption method for state and action representations that is robust to diverse distortions. We theoretically and empirically observe that making an informative feature manifold with less sample complexity significantly improves the performance of imitation learning. The proposed method shows a 39% relative improvement over existing adversarial imitation learning methods on MuJoCo in a setting limited to 100 expert state-action pairs. Moreover, we conduct comprehensive ablations and additional experiments using demonstrations with varying optimality to provide insights into a range of factors. ©2024 Dahuin Jung, Hyungyu Lee, and Sungroh Yoon.
引用
收藏
页码:1 / 32
相关论文
共 50 条
  • [31] Sample-efficient deep learning for accelerating photonic inverse design
    Hegde, Ravi
    OSA CONTINUUM, 2021, 4 (03): : 1019 - 1033
  • [32] Safe and Sample-Efficient Reinforcement Learning for Clustered Dynamic Environments
    Chen, Hongyi
    Liu, Changliu
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 1928 - 1933
  • [33] Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
    Qiao, Dan
    Yin, Ming
    Min, Ming
    Wang, Yu-Xiang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [34] Sample-Efficient Reinforcement Learning of Partially Observable Markov Games
    Liu, Qinghua
    Szepesvari, Csaba
    Jin, Chi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [35] Sample-Efficient Deep Reinforcement Learning with Directed Associative Graph
    Dujia Yang
    Xiaowei Qin
    Xiaodong Xu
    Chensheng Li
    Guo Wei
    中国通信, 2021, 18 (06) : 100 - 113
  • [36] Sample-Efficient Proper PAC Learning with Approximate Differential Privacy
    Ghazi, Badih
    Golowich, Noah
    Kumar, Ravi
    Manurangsi, Pasin
    STOC '21: PROCEEDINGS OF THE 53RD ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2021, : 183 - 196
  • [37] Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
    Ma, Guozheng
    Zhang, Linrui
    Wang, Haoyu
    Li, Lu
    Wang, Zilin
    Wang, Zhen
    Shen, Li
    Wang, Xueqian
    Tao, Dacheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [38] Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions
    Shang, Zhiwei
    Li, Renxing
    Zheng, Chunhua
    Li, Huiyun
    Cui, Yunduan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 475 - 485
  • [39] A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks
    Abernethy, Jacob
    Agarwal, Alekh
    Marinov, Teodor V.
    Warmuth, Manfred K.
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [40] Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions
    Shang, Zhiwei
    Li, Renxing
    Zheng, Chunhua
    Li, Huiyun
    Cui, Yunduan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 475 - 485