To Follow or not to Follow: Selective Imitation Learning from Observations

被引:0
|
作者
Lee, Youngwoon [1 ]
Hu, Edward S. [1 ]
Yang, Zhengyu [1 ]
Lim, Joseph J. [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
来源
关键词
imitation learning; hierarchical reinforcement learning; deep learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Learning from demonstrations is a useful way to transfer a skill from one agent to another. While most imitation learning methods aim to mimic an expert skill by following the demonstration step-by-step, imitating every step in the demonstration often becomes infeasible when the learner and its environment are different from the demonstration. In this paper, we propose a method that can imitate a demonstration composed solely of observations, which may not be reproducible with the current agent. Our method, dubbed selective imitation learning from observations (SILO), selects reachable states in the demonstration and learns how to reach the selected states. Our experiments on both simulated and real robot environments show that our method reliably performs a new task by following a demonstration. Videos and code are available at https://clvrai.com/silo
引用
收藏
页数:13
相关论文
共 50 条
  • [31] "Follow the leader" learning dynamics on networks
    De Lillo, Silvana
    Dolfin, Marina
    Fioriti, Gioia
    APPLIED MATHEMATICS AND COMPUTATION, 2018, 332 : 316 - 328
  • [32] Philosophies About Learning and the Practices That Follow
    Norris, Meghan E.
    CANADIAN PSYCHOLOGY-PSYCHOLOGIE CANADIENNE, 2024, 65 (02): : 93 - 100
  • [33] Learning to Follow Navigational Route Instructions
    Shimizu, Nobuyuki
    Haas, Andrew
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1488 - 1493
  • [34] Follow the Moving Leader in Deep Learning
    Zheng, Shuai
    Kwok, James T.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [35] Learning to Follow Directions in Street View
    Hermann, Karl Moritz
    Malinowski, Mateusz
    Mirowski, Piotr
    Banki-Horvath, Andras
    Anderson, Keith
    Hadsell, Raia
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11773 - 11781
  • [36] Learning to Follow Directions with Untagged Data
    Yang, Zhidan
    Yang, Zhiting
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELING AND SIMULATION (AMMS 2017), 2017, 153 : 70 - 73
  • [37] Question, Explanation, Follow-Up: A Mechanism for Learning From Others?
    Kurkul, Katelyn E.
    Corriveau, Kathleen H.
    CHILD DEVELOPMENT, 2018, 89 (01) : 280 - 294
  • [38] TNO follow-up observations at the Saji Observatory
    Miyamoto, A
    Kosai, H
    Oribe, T
    MINOR BODIES IN THE OUTER SOLAR SYSTEM, 2000, : 169 - 169
  • [39] Follow-up to observations on the bathypelagic Gennadas or Peneidae
    Bouvier, EL
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES, 1906, 142 : 746 - 750
  • [40] POSTPARTUM THYROIDITIS-OBSERVATIONS AND FOLLOW-UP
    SIMOVA, N
    DOLGOVAKORUBIN, V
    ANNALES D ENDOCRINOLOGIE, 1987, 48 (02) : 95 - 95