Adversarial Imitation Learning with Trajectorial Augmentation and Correction

被引:6
|
作者
Antotsiou, Dafni [1 ]
Ciliberto, Carlo [1 ]
Kim, Tae-Kyun [1 ]
机构
[1] Imperial Coll London, EEE Dept, London, England
关键词
D O I
10.1109/ICRA48506.2021.9561915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.
引用
收藏
页码:4724 / 4730
页数:7
相关论文
共 50 条
  • [21] Emergence of Chaotic Time Series by Adversarial Imitation Learning
    Yamazaki, Seiya
    Iizuka, Hiroyuki
    Yamamoto, Masahito
    2018 CONFERENCE ON ARTIFICIAL LIFE (ALIFE 2018), 2018, : 659 - 664
  • [22] Generative Adversarial Imitation Learning from Failed Experiences
    Zhu, Jiacheng
    Lin, Jiahao
    Wang, Meng
    Chen, Yingfeng
    Fan, Changjie
    Jiang, Chong
    Zhang, Zongzhang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13997 - 13998
  • [23] Visual Adversarial Imitation Learning using Variational Models
    Rafailov, Rafael
    Yu, Tianhe
    Rajeswaran, Aravind
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [24] Ranking-Based Generative Adversarial Imitation Learning
    Shi, Zhipeng
    Zhang, Xuehe
    Fang, Yu
    Li, Changle
    Liu, Gangfeng
    Zhao, Jie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (10): : 8967 - 8974
  • [25] Complexity of bird song caused by adversarial imitation learning
    Yamazaki, Seiya
    Iizuka, Hiroyuki
    Yamamoto, Masahito
    ARTIFICIAL LIFE AND ROBOTICS, 2020, 25 (01) : 124 - 132
  • [26] Adversarial Imitation Learning with Controllable Rewards for Text Generation
    Nishikino, Keizaburo
    Kobayashi, Kenichi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 131 - 146
  • [27] Multi-Agent Generative Adversarial Imitation Learning
    Song, Jiaming
    Ren, Hongyu
    Sadigh, Dorsa
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [28] Multimodal Storytelling via Generative Adversarial Imitation Learning
    Chen, Zhiqian
    Zhang, Xuchao
    Boedihardjo, Arnold P.
    Dai, Jing
    Lu, Chang-Tien
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3967 - 3973
  • [29] ARC - Actor Residual Critic for Adversarial Imitation Learning
    Deka, Ankur
    Liu, Changliu
    Sycara, Katia
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1446 - 1456
  • [30] Risk-Sensitive Generative Adversarial Imitation Learning
    Lacotte, Jonathan
    Ghavamzadeh, Mohammad
    Chow, Yinlam
    Pavone, Marco
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89