Adversarial Imitation Learning with Trajectorial Augmentation and Correction

被引:6
|
作者
Antotsiou, Dafni [1 ]
Ciliberto, Carlo [1 ]
Kim, Tae-Kyun [1 ]
机构
[1] Imperial Coll London, EEE Dept, London, England
关键词
D O I
10.1109/ICRA48506.2021.9561915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.
引用
收藏
页码:4724 / 4730
页数:7
相关论文
共 50 条
  • [41] Adversarial imitation learning with mixed demonstrations from multiple demonstrators
    Zuo, Guoyu
    Zhao, Qishen
    Huang, Shuai
    Li, Jiangeng
    Gong, Daoxiong
    NEUROCOMPUTING, 2021, 457 (457) : 365 - 376
  • [42] GACS: Generative Adversarial Imitation Learning Based on Control Sharing
    Huaiwei SI
    Guozhen TAN
    Dongyu LI
    Yanfei PENG
    JournalofSystemsScienceandInformation, 2023, 11 (01) : 78 - 93
  • [43] Generative Adversarial Network for Imitation Learning from Single Demonstration
    Tho Nguyen Duc
    Chanh Minh Tran
    Phan Xuan Tan
    Kamioka, Eiji
    BAGHDAD SCIENCE JOURNAL, 2021, 18 (04) : 1350 - 1355
  • [44] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
    Tongtao Zhang
    Heng Ji
    Avirup Sil
    Data Intelligence, 2019, (02) : 99 - 120
  • [45] Improve generated adversarial imitation learning with reward variance regularization
    Zhang, Yi-Feng
    Luo, Fan-Ming
    Yu, Yang
    MACHINE LEARNING, 2022, 111 (03) : 977 - 995
  • [46] Adversarial imitation learning with deep attention network for swarm systems
    Wu, Yapei
    Wang, Tao
    Liu, Tong
    Zheng, Zhicheng
    Xu, Demin
    Peng, Xingguang
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [47] Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning
    Xu, Mai
    Yang, Li
    Tao, Xiaoming
    Duan, Yiping
    Wang, Zulin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2087 - 2102
  • [48] Imitation Learning for Playing Shogi Based on Generative Adversarial Networks
    Wan, Shanchuan
    Kaneko, Tomoyuki
    2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 92 - 95
  • [49] Generalization and Computation for Policy Classes of Generative Adversarial Imitation Learning
    Zhou, Yirui
    Zhang, Yangchun
    Liu, Xiaowei
    Wang, Wanying
    Che, Zhengping
    Xu, Zhiyuan
    Tang, Jian
    Peng, Yaxin
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT I, 2022, 13398 : 385 - 399
  • [50] Addressing implicit bias in adversarial imitation learning with mutual information
    Zhang, Lihua
    Liu, Quan
    Zhu, Fei
    Huang, Zhigang
    NEURAL NETWORKS, 2023, 167 : 847 - 864