State Aware Imitation Learning

被引:0
|
作者
Schroecker, Yannick [1 ]
Isbell, Charles [1 ]
机构
[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
关键词
AVERAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation learning is the study of learning how to act given a set of demonstrations provided by a human expert. It is intuitively apparent that learning to take optimal actions is a simpler undertaking in situations that are similar to the ones shown by the teacher. However, imitation learning approaches do not tend to use this insight directly. In this paper, we introduce State Aware Imitation Learning (SAIL), an imitation learning algorithm that allows an agent to learn how to remain in states where it can confidently take the correct action and how to recover if it is lead astray. Key to this algorithm is a gradient learned using a temporal difference update rule which leads the agent to prefer states similar to the demonstrated states. We show that estimating a linear approximation of this gradient yields similar theoretical guarantees to online temporal difference learning approaches and empirically show that SAIL can effectively be used for imitation learning in continuous domains with non-linear function approximators used for both the policy representation and the gradient estimate.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving
    Cultrera, Luca
    Becattini, Federico
    Seidenari, Lorenzo
    Pala, Pietro
    Del Bimbo, Alberto
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2946 - 2955
  • [2] Extraneousness-Aware Imitation Learning
    Zheng, Ray Chen
    Hu, Kaizhe
    Yuan, Zhecheng
    Chen, Boyuan
    Xu, Huazhe
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2973 - 2979
  • [3] Adversarial Option-Aware Hierarchical Imitation Learning
    Jing, Mingxuan
    Huang, Wenbing
    Sunk, Fuchun
    Ma, Xiaojian
    Kong, Tao
    Gan, Chuang
    Li, Lei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] Uncertainty-Aware Data Aggregation for Deep Imitation Learning
    Cui, Yuchen
    Isele, David
    Niekum, Scott
    Fujimura, Kikuo
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 761 - 767
  • [5] Initial State Interventions for Deconfounded Imitation Learning
    Pfrommer, Samuel
    Bai, Yatong
    Lee, Hyunin
    Sojoudi, Somayeh
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2312 - 2319
  • [6] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [7] Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality
    Zhang, Songyuan
    Cao, Zhangjie
    Sadigh, Dorsa
    Sui, Yanan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
    Park, Jongjin
    Seo, Younggyo
    Liu, Chang
    Zhao, Li
    Qin, Tao
    Shin, Jinwoo
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems
    Wang, Yuening
    Zhang, Yingxue
    Valkanas, Antonios
    Tang, Ruiming
    Ma, Chen
    Hao, Jianye
    Coates, Mark
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4711 - 4719
  • [10] Uncertainty-Aware Imitation Learning using Kernelized Movement Primitives
    Silverio, Joao
    Huang, Yanlong
    Abu-Dakka, Fares J.
    Rozo, Leonel
    Caldwell, Darwin G.
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 90 - 97