Adversarial Imitation Learning with Trajectorial Augmentation and Correction

被引:6
|
作者
Antotsiou, Dafni [1 ]
Ciliberto, Carlo [1 ]
Kim, Tae-Kyun [1 ]
机构
[1] Imperial Coll London, EEE Dept, London, England
关键词
D O I
10.1109/ICRA48506.2021.9561915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.
引用
收藏
页码:4724 / 4730
页数:7
相关论文
共 50 条
  • [1] Generative Adversarial Imitation Learning
    Ho, Jonathan
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [2] What Matters for Adversarial Imitation Learning?
    Orsini, Manu
    Raichuk, Anton
    Hussenot, Leonard
    Vincent, Damien
    Dadashi, Robert
    Girgin, Sertan
    Geist, Matthieu
    Bachem, Olivier
    Pietquin, Olivier
    Andrychowicz, Marcin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [3] Quantum generative adversarial imitation learning
    Xiao, Tailong
    Huang, Jingzheng
    Li, Hongjing
    Fan, Jianping
    Zeng, Guihua
    NEW JOURNAL OF PHYSICS, 2023, 25 (03):
  • [4] DiffAIL: Diffusion Adversarial Imitation Learning
    Wang, Bingzheng
    Wu, Guoqiang
    Pang, Teng
    Zhang, Yan
    Yin, Yilong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15447 - 15455
  • [5] Deterministic generative adversarial imitation learning
    Zuo, Guoyu
    Chen, Kexin
    Lu, Jiahao
    Huang, Xiangsheng
    NEUROCOMPUTING, 2020, 388 : 60 - 69
  • [6] A Bayesian Approach to Generative Adversarial Imitation Learning
    Jeon, Wonseok
    Seo, Seokin
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [7] Sample-efficient Adversarial Imitation Learning
    Jung, Dahuin
    Lee, Hyungyu
    Yoon, Sungroh
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [8] Combating False Negatives in Adversarial Imitation Learning
    Zolna, Konrad
    Saharia, Chitwan
    Boussioux, Leonard
    Hui, David Yu-Tung
    Chevalier-Boisvert, Maxime
    Bahdanau, Dzmitry
    Bengio, Yoshua
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Adversarial Imitation Learning via Random Search
    Shin, MyungJae
    Kim, Joongheon
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [10] Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
    Wang, Yunke
    Du, Bo
    Xu, Chang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10262 - 10270