To Follow or not to Follow: Selective Imitation Learning from Observations

被引:0
|
作者
Lee, Youngwoon [1 ]
Hu, Edward S. [1 ]
Yang, Zhengyu [1 ]
Lim, Joseph J. [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
来源
关键词
imitation learning; hierarchical reinforcement learning; deep learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Learning from demonstrations is a useful way to transfer a skill from one agent to another. While most imitation learning methods aim to mimic an expert skill by following the demonstration step-by-step, imitating every step in the demonstration often becomes infeasible when the learner and its environment are different from the demonstration. In this paper, we propose a method that can imitate a demonstration composed solely of observations, which may not be reproducible with the current agent. Our method, dubbed selective imitation learning from observations (SILO), selects reachable states in the demonstration and learns how to reach the selected states. Our experiments on both simulated and real robot environments show that our method reliably performs a new task by following a demonstration. Videos and code are available at https://clvrai.com/silo
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Follow the Clairvoyant: an Imitation Learning Approach to Optimal Control
    Martin, Andrea
    Furieri, Luca
    Dorfler, Florian
    Lygeros, John
    Ferrari-Trecate, Giancarlo
    IFAC PAPERSONLINE, 2023, 56 (02): : 2589 - 2594
  • [2] Sequential robot imitation learning from observations
    Tanwani, Ajay Kumar
    Yan, Andy
    Lee, Jonathan
    Calinon, Sylvain
    Goldberg, Ken
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (10-11): : 1306 - 1325
  • [3] Follow the genuine leader: The "green imitation"
    Calderon, Reyes
    Ortiz De Urbina, Maria
    Exposito, Luis
    BUSINESS ETHICS THE ENVIRONMENT & RESPONSIBILITY, 2023, 32 (02): : 570 - 581
  • [4] Off-Policy Imitation Learning from Observations
    Zhu, Zhuangdi
    Lin, Kaixiang
    Dai, Bo
    Zhou, Jiayu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] VERBAL IMITATION IN RETARDATES - FOLLOW-UP
    FOREHAND, R
    CALHOUN, K
    PERCEPTUAL AND MOTOR SKILLS, 1973, 36 (01) : 74 - 74
  • [6] Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
    Yang, Chao
    Ma, Xiaojian
    Huang, Wenbing
    Sun, Fuchun
    Liu, Huaping
    Huang, Junzhou
    Gan, Chuang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] A posteriori control densities: Imitation learning from partial observations
    Lefebvre, Tom
    Crevecoeur, Guillaume
    PATTERN RECOGNITION LETTERS, 2023, 169 : 87 - 94
  • [8] Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective
    Fan, Youlin
    Jiu, Bo
    Pu, Wenqiang
    Li, Ziniu
    Li, Kang
    Liu, Hongwei
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 4098 - 4114
  • [9] Learning to Follow the Trails
    Bird, Johannah
    CANADIAN LITERATURE, 2023, (253): : 163 - 168
  • [10] LEARNING TO FOLLOW IN PIGEONS
    HOGAN, DE
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1988, 26 (06) : 498 - 498