To Follow or not to Follow: Selective Imitation Learning from Observations

被引:0
|
作者
Lee, Youngwoon [1 ]
Hu, Edward S. [1 ]
Yang, Zhengyu [1 ]
Lim, Joseph J. [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
来源
关键词
imitation learning; hierarchical reinforcement learning; deep learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Learning from demonstrations is a useful way to transfer a skill from one agent to another. While most imitation learning methods aim to mimic an expert skill by following the demonstration step-by-step, imitating every step in the demonstration often becomes infeasible when the learner and its environment are different from the demonstration. In this paper, we propose a method that can imitate a demonstration composed solely of observations, which may not be reproducible with the current agent. Our method, dubbed selective imitation learning from observations (SILO), selects reachable states in the demonstration and learns how to reach the selected states. Our experiments on both simulated and real robot environments show that our method reliably performs a new task by following a demonstration. Videos and code are available at https://clvrai.com/silo
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Observations on hysteroplasty: Technique, results and follow-up
    Borruto, F
    Fistarol, M
    GYNAKOLOGISCH-GEBURTSHILFLICHE RUNDSCHAU, 1997, 37 (01): : 48 - 51
  • [42] MASTER Prompt and Follow-Up GRB Observations
    Tyurina, Nataly
    Lipunov, Vladimir
    Kornilov, Victor
    Gorbovskoy, Evgeny
    Shatskij, Nikolaj
    Kuvshinov, Dmitry
    Balanutsa, Pavel
    Belinski, Alexander
    Krushinsky, Vadim
    Zalozhnyh, Ivan
    Tlatov, Andrey
    Parkhomenko, Alexander
    Ivanov, Kirill
    Yazev, Sergey
    Kortunov, Peter
    Sankovich, Anatoly
    Kuznetsov, Artem
    Yurkov, Vladimir
    ADVANCES IN ASTRONOMY, 2010, 2010
  • [43] Follow-up observations of the binary system γ Cep
    Mugrauer, Markus
    Schlagenhauf, Saskia
    Buder, Sven
    Ginski, Christian
    Fernandez, Matilde
    ASTRONOMISCHE NACHRICHTEN, 2022, 343 (05)
  • [44] Observations in hypertensive adolescents: A ten year follow up
    Handa, SP
    Downey, P
    JOURNAL OF HYPERTENSION, 2002, 20 : S229 - S229
  • [45] A SURVEY AND FOLLOW-UP OBSERVATIONS OF STARBURST GALAXIES
    MAEHARA, H
    TAKASE, B
    HEIDMANN, J
    IAU SYMPOSIA, 1987, (115): : 655 - 657
  • [46] Imitation Learning from Vague Feedback
    Cai, Xin-Qiang
    Zhang, Yu-Jie
    Chiang, Chao-Kai
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [47] Imitation Learning from Imperfect Demonstration
    Wu, Yueh-Hua
    Charoenphakdee, Nontawat
    Bao, Han
    Tangkaratt, Voot
    Sugiyama, Masashi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [48] NTT follow-up observations of star cluster candidates from the FSR catalogue
    Froebrich, D.
    Meusinger, H.
    Scholz, A.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2008, 390 (04) : 1598 - 1618
  • [49] Follow the Cut, Follow the Rhythm, Follow the Material
    Edgeworth, Matt
    NORWEGIAN ARCHAEOLOGICAL REVIEW, 2012, 45 (01) : 76 - 92
  • [50] Fundamental Observations on a "Follow-up of accident victims suffering from traumatic Neurosis"
    Schwarz, Hanns
    NERVENARZT, 1929, 2 (01): : 54 - 54