Ess-InfoGAIL: Semi-supervised Imitation Learning from Imbalanced Demonstrations

被引:0
|
作者
Fu, Huiqiao [1 ]
Tang, Kaiqiang [1 ]
Lu, Yuanyang [1 ]
Qi, Yiming [1 ]
Deng, Guizhou [1 ]
Sung, Flood [2 ]
Chen, Chunlin [1 ]
机构
[1] Nanjing Univ, Nanjing, Peoples R China
[2] Moonshot AI, Beijing, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation learning aims to reproduce expert behaviors without relying on an explicit reward signal. However, real-world demonstrations often present challenges, such as multi-modal, data imbalance, and expensive labeling processes. In this work, we propose a novel semi-supervised imitation learning architecture that learns disentangled behavior representations from imbalanced demonstrations using limited labeled data. Specifically, our method consists of three key components. First, we adapt the concept of semi-supervised generative adversarial networks to the imitation learning context. Second, we employ a learnable latent distribution to align the generated and expert data distributions. Finally, we utilize a regularized information maximization approach in conjunction with an approximate label prior to further improve the semi-supervised learning performance. Experimental results demonstrate the efficiency of our method in learning multi-modal behaviors from imbalanced demonstrations compared to baseline methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations
    Li, Yunzhu
    Song, Jiaming
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving
    Lee, Gunmin
    Oh, Wooseok
    Oh, Jeongwoo
    Shin, Seungyoun
    Kim, Dohyeong
    Jeong, Jaeyeon
    Choi, Sungjoon
    Oh, Songhwai
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 20 - 25
  • [3] A survey of class-imbalanced semi-supervised learning
    Gui, Qian
    Zhou, Hong
    Guo, Na
    Niu, Baoning
    MACHINE LEARNING, 2024, 113 (08) : 5057 - 5086
  • [4] Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding
    Guo, Lan-Zhe
    Li, Yu-Feng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [5] Imbalanced fault diagnosis based on semi-supervised ensemble learning
    Chuanxia Jian
    Yinhui Ao
    Journal of Intelligent Manufacturing, 2023, 34 : 3143 - 3158
  • [6] A semi-supervised resampling method for class-imbalanced learning
    Jiang, Zhen
    Zhao, Lingyun
    Lu, Yu
    Zhan, Yongzhao
    Mao, Qirong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [7] Imbalanced fault diagnosis based on semi-supervised ensemble learning
    Jian, Chuanxia
    Ao, Yinhui
    JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (07) : 3143 - 3158
  • [8] A SEMI-SUPERVISED LEARNING ALGORITHM BASED ON SVM FOR IMBALANCED DATA
    Du, Limin
    Xu, Yang
    He, Xingxing
    UNCERTAINTY MODELLING IN KNOWLEDGE ENGINEERING AND DECISION MAKING, 2016, 10 : 194 - 200
  • [9] Semi-supervised Learning for Instrument Detection with a Class Imbalanced Dataset
    Yoon, Jihun
    Lee, Jiwon
    Park, SungHyun
    Hyung, Woo Jin
    Choi, Min-Kook
    INTERPRETABLE AND ANNOTATION-EFFICIENT LEARNING FOR MEDICAL IMAGE COMPUTING, IMIMIC 2020, MIL3ID 2020, LABELS 2020, 2020, 12446 : 266 - 276
  • [10] Graph Based Semi-supervised Learning Method for Imbalanced Dataset
    Zhang, Chenguang
    Zhang, Yan
    Zhang, Xiahuan
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 4040 - 4044