State Aware Imitation Learning

被引:0
|
作者
Schroecker, Yannick [1 ]
Isbell, Charles [1 ]
机构
[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
关键词
AVERAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation learning is the study of learning how to act given a set of demonstrations provided by a human expert. It is intuitively apparent that learning to take optimal actions is a simpler undertaking in situations that are similar to the ones shown by the teacher. However, imitation learning approaches do not tend to use this insight directly. In this paper, we introduce State Aware Imitation Learning (SAIL), an imitation learning algorithm that allows an agent to learn how to remain in states where it can confidently take the correct action and how to recover if it is lead astray. Key to this algorithm is a gradient learned using a temporal difference update rule which leads the agent to prefer states similar to the demonstrated states. We show that estimating a linear approximation of this gradient yields similar theoretical guarantees to online temporal difference learning approaches and empirically show that SAIL can effectively be used for imitation learning in continuous domains with non-linear function approximators used for both the policy representation and the gradient estimate.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] SOCIAL LEARNING AND IMITATION
    WODTKE, KH
    BROWN, BR
    REVIEW OF EDUCATIONAL RESEARCH, 1967, 37 (05) : 514 - 538
  • [42] SOCIAL LEARNING AND IMITATION
    Roheim, Geza
    PSYCHOANALYTIC QUARTERLY, 1943, 12 (02): : 280 - 281
  • [43] Social Learning and Imitation
    Sletto, Raymond F.
    ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 1942, 220 : 267 - 268
  • [44] Social learning and imitation
    Wolfle, Dael
    PSYCHOLOGICAL BULLETIN, 1942, 39 (02) : 128 - 129
  • [45] Social Learning and Imitation
    Mekeel, H. Scudder
    AMERICAN SOCIOLOGICAL REVIEW, 1942, 7 (06) : 872 - 874
  • [46] Imitation: learning and communication
    Andry, P
    Moga, S
    Gaussier, P
    Revel, A
    Nadel, J
    FROM ANIMALS TO ANIMATS 6, 2000, : 353 - 362
  • [47] Social Learning and Imitation
    Young, Kimball
    AMERICAN ANTHROPOLOGIST, 1943, 45 (01) : 144 - 146
  • [48] SOCIAL LEARNING AND IMITATION
    Hilgard, Ernest R.
    CHARACTER AND PERSONALITY, 1942, 10 (03): : 247 - 250
  • [49] Learning birdsong by imitation
    Clayton, David F.
    SCIENCE, 2019, 366 (6461) : 33 - 34
  • [50] SOCIAL LEARNING AND IMITATION
    Blackwell, Gordon W.
    SOCIAL FORCES, 1942, 21 (02) : 256 - 256