Adversarial Imitation Learning from Incomplete Demonstrations

被引:0
|
作者
Sun, Mingfei [1 ]
Xiaojuan [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation learning targets deriving a mapping from states to actions, a.k.a. policy, from expert demonstrations. Existing methods for imitation learning typically require any actions in the demonstrations to be fully available, which is hard to ensure in real applications. Though algorithms for learning with unobservable actions have been proposed, they focus solely on state information and overlook the fact that the action sequence could still be partially available and provide useful information for policy deriving. In this paper, we propose a novel algorithm called Action-Guided Adversarial Imitation Learning (AGAIL) that learns a policy from demonstrations with incomplete action sequences, i.e., incomplete demonstrations. The core idea of AGAIL is to separate demonstrations into state and action trajectories, and train a policy with state trajectories while using actions as auxiliary information to guide the training whenever applicable. Built upon the Generative Adversarial Imitation Learning, AGAIL has three components: a generator, a discriminator, and a guide. The generator learns a policy with rewards provided by the discriminator, which tries to distinguish state distributions between demonstrations and samples generated by the policy. The guide provides additional rewards to the generator when demonstrated actions for specific states are available. We compare AGAIL to other methods on benchmark tasks and show that AGAIL consistently delivers comparable performance to the state-of-the-art methods even when the action sequence in demonstrations is only partially available.
引用
收藏
页码:3513 / 3519
页数:7
相关论文
共 50 条
  • [31] Generative Adversarial Network for Imitation Learning from Single Demonstration
    Tho Nguyen Duc
    Chanh Minh Tran
    Phan Xuan Tan
    Kamioka, Eiji
    BAGHDAD SCIENCE JOURNAL, 2021, 18 (04) : 1350 - 1355
  • [32] Deterministic generative adversarial imitation learning
    Zuo, Guoyu
    Chen, Kexin
    Lu, Jiahao
    Huang, Xiangsheng
    NEUROCOMPUTING, 2020, 388 : 60 - 69
  • [33] Adversarial Imitation Learning from Video using a State Observer
    Karnan, Haresh
    Torabi, Faraz
    Warnell, Garrett
    Stone, Peter
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2452 - 2458
  • [34] Robustness to Adversarial Perturbations in Learning from Incomplete Data
    Najafi, Amir
    Maeda, Shin-ichi
    Koyama, Masanori
    Miyato, Takeru
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [35] Beyond rational imitation: Learning arbitrary means actions from communicative demonstrations
    Kiraly, Ildiko
    Csibra, Gergely
    Gergely, Gyorgy
    JOURNAL OF EXPERIMENTAL CHILD PSYCHOLOGY, 2013, 116 (02) : 471 - 486
  • [36] Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance
    Chen, Tianhao
    Zhang, Zheng
    Fang, Zheng
    Jiang, Dong
    Li, Guangliang
    OCEAN ENGINEERING, 2024, 298
  • [37] Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations
    Cai, Yuanying
    Zhang, Chuheng
    Shen, Wei
    He, Xiaonan
    Zhang, Xuyun
    Huang, Longbo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 128 - 137
  • [38] Generative Adversarial Imitation Learning from Human Behavior with Reward Shaping
    Li, Jiangeng
    Huang, Shuai
    Xu, Xin
    Zuo, Guoyu
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 6254 - 6259
  • [39] DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation
    Torabi, Faraz
    Warnell, Garrett
    Stone, Peter
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2391 - 2397
  • [40] A Bayesian Approach to Generative Adversarial Imitation Learning
    Jeon, Wonseok
    Seo, Seokin
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31