Initial State Interventions for Deconfounded Imitation Learning

被引:0
|
作者
Pfrommer, Samuel [1 ]
Bai, Yatong [1 ]
Lee, Hyunin [1 ]
Sojoudi, Somayeh [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CDC49753.2023.10383252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imitation learning suffers from causal confusion. This phenomenon occurs when learned policies attend to features that do not causally influence the expert actions but are instead spuriously correlated. Causally confused agents produce low open-loop supervised loss but poor closed-loop performance upon deployment. We consider the problem of masking observed confounders in a disentangled representation of the observation space. Our novel masking algorithm leverages the usual ability to intervene in the initial system state, avoiding any requirement involving expert querying, expert reward functions, or causal graph specification. Under certain assumptions, we theoretically prove that this algorithm is conservative in the sense that it does not incorrectly mask observations that causally influence the expert; furthermore, intervening on the initial state serves to strictly reduce excess conservatism. The masking algorithm is applied to behavior cloning for two illustrative control systems: CartPole and Reacher.
引用
收藏
页码:2312 / 2319
页数:8
相关论文
共 50 条
  • [41] Social Learning and Imitation
    Mekeel, H. Scudder
    AMERICAN SOCIOLOGICAL REVIEW, 1942, 7 (06) : 872 - 874
  • [42] Imitation: learning and communication
    Andry, P
    Moga, S
    Gaussier, P
    Revel, A
    Nadel, J
    FROM ANIMALS TO ANIMATS 6, 2000, : 353 - 362
  • [43] Social Learning and Imitation
    Young, Kimball
    AMERICAN ANTHROPOLOGIST, 1943, 45 (01) : 144 - 146
  • [44] SOCIAL LEARNING AND IMITATION
    Hilgard, Ernest R.
    CHARACTER AND PERSONALITY, 1942, 10 (03): : 247 - 250
  • [45] Learning birdsong by imitation
    Clayton, David F.
    SCIENCE, 2019, 366 (6461) : 33 - 34
  • [46] SOCIAL LEARNING AND IMITATION
    Blackwell, Gordon W.
    SOCIAL FORCES, 1942, 21 (02) : 256 - 256
  • [47] Social Learning and Imitation
    Johnson, Donald M.
    JOURNAL OF SOCIAL PSYCHOLOGY, 1942, 16 (01): : 163 - 167
  • [48] SOCIAL LEARNING AND IMITATION
    Gault, Robert H.
    JOURNAL OF CRIMINAL LAW & CRIMINOLOGY, 1942, 32 (05): : 569 - 570
  • [49] Social Learning and Imitation
    Young, Kimball
    AMERICAN JOURNAL OF SOCIOLOGY, 1942, 48 (03) : 436 - 439
  • [50] Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning
    Chaudhury, Subhajit
    Kimura, Daiki
    Munawar, Asim
    Tachibana, Ryuki
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,