To Follow or not to Follow: Selective Imitation Learning from Observations

被引:0
|
作者
Lee, Youngwoon [1 ]
Hu, Edward S. [1 ]
Yang, Zhengyu [1 ]
Lim, Joseph J. [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
来源
关键词
imitation learning; hierarchical reinforcement learning; deep learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Learning from demonstrations is a useful way to transfer a skill from one agent to another. While most imitation learning methods aim to mimic an expert skill by following the demonstration step-by-step, imitating every step in the demonstration often becomes infeasible when the learner and its environment are different from the demonstration. In this paper, we propose a method that can imitate a demonstration composed solely of observations, which may not be reproducible with the current agent. Our method, dubbed selective imitation learning from observations (SILO), selects reachable states in the demonstration and learns how to reach the selected states. Our experiments on both simulated and real robot environments show that our method reliably performs a new task by following a demonstration. Videos and code are available at https://clvrai.com/silo
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Competitiveness in follow-on drug R&D: a race or imitation?
    DiMasi, Joseph A.
    Faden, Laura B.
    NATURE REVIEWS DRUG DISCOVERY, 2011, 10 (01) : 23 - 27
  • [22] Imitation Learning from Observation
    Torabi, Faraz
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9900 - 9901
  • [23] Follow up ability for GRB observations on Swift
    McLean, K
    Fenimore, E
    Palmer, D
    Barthelmy, S
    Gehrels, N
    Krimm, H
    Markwardt, C
    Parsons, A
    Tueller, J
    Stephens, M
    NUOVO CIMENTO DELLA SOCIETA ITALIANA DI FISICA C-COLLOQUIA ON PHYSICS, 2005, 28 (4-5): : 837 - 840
  • [24] Follow up ability for GRB observations on Swift
    McLean, K.
    Fenimore, E.
    Palmer, D.
    Barthelmy, S.
    Gehrels, N.
    Krimm, H.
    Markwardt, C.
    Parsons, A.
    Tueller, J.
    Stephens, M.
    NUOVO CIMENTO C-COLLOQUIA AND COMMUNICATIONS IN PHYSICS, 2005, 28 (4-5): : 837 - 840
  • [25] Spectroscopic and photometric follow-up observations
    Latham, David W.
    Transiting Extrasolar Planets Workshop, 2007, 366 : 203 - 208
  • [26] Selective Sampling and Imitation Learning via Online Regression
    Sekhari, Ayush
    Sridharan, Karthik
    Sun, Wen
    Wu, Runzhe
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Follow up your unexpected clinical observations!
    Dahl, Olav
    ACTA ONCOLOGICA, 2009, 48 (03) : 325 - 327
  • [28] Cross-domain Imitation from Observations
    Raychaudhuri, Dripta S.
    Paul, Sujoy
    van Baar, Jeroen
    Roy-Chowdhury, Amit K.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [29] To Follow or Not to Follow: Estimating Political Opinion From Twitter Data Using a Network-Based Machine Learning Approach
    Brandenstein, Nils
    Montag, Christian
    Sindermann, Cornelia
    SOCIAL SCIENCE COMPUTER REVIEW, 2024,
  • [30] SELECTIVE AMYGDALOHIPPOCAMPECTOMY - INDICATIONS AND FOLLOW-UP
    WIESER, HG
    CANADIAN JOURNAL OF NEUROLOGICAL SCIENCES, 1991, 18 (04) : 617 - 627