Initial State Interventions for Deconfounded Imitation Learning

被引:0
|
作者
Pfrommer, Samuel [1 ]
Bai, Yatong [1 ]
Lee, Hyunin [1 ]
Sojoudi, Somayeh [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CDC49753.2023.10383252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imitation learning suffers from causal confusion. This phenomenon occurs when learned policies attend to features that do not causally influence the expert actions but are instead spuriously correlated. Causally confused agents produce low open-loop supervised loss but poor closed-loop performance upon deployment. We consider the problem of masking observed confounders in a disentangled representation of the observation space. Our novel masking algorithm leverages the usual ability to intervene in the initial system state, avoiding any requirement involving expert querying, expert reward functions, or causal graph specification. Under certain assumptions, we theoretically prove that this algorithm is conservative in the sense that it does not incorrectly mask observations that causally influence the expert; furthermore, intervening on the initial state serves to strictly reduce excess conservatism. The masking algorithm is applied to behavior cloning for two illustrative control systems: CartPole and Reacher.
引用
收藏
页码:2312 / 2319
页数:8
相关论文
共 50 条
  • [1] State Aware Imitation Learning
    Schroecker, Yannick
    Isbell, Charles
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] Backdoor Defense via Deconfounded Representation Learning
    Zhang, Zaixi
    Liu, Qi
    Wang, Zhicai
    Lu, Zepu
    Hu, Qingyong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12228 - 12238
  • [3] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [4] DeVLBert: Learning Deconfounded Visio-Linguistic Representations
    Zhang, Shengyu
    Jiang, Tan
    Wang, Tan
    Kuang, Kun
    Zhao, Zhou
    Zhu, Jianke
    Yu, Jin
    Yang, Hongxia
    Wu, Fei
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4373 - 4382
  • [5] Imitation learning by state-only distribution matching
    Damian Boborzi
    Christoph-Nikolas Straehle
    Jens S. Buchner
    Lars Mikelsons
    Applied Intelligence, 2023, 53 : 30865 - 30886
  • [6] Learning the optimal state-feedback via supervised imitation learning
    Dharmesh Tailor
    Dario Izzo
    Astrodynamics, 2019, 3 : 361 - 374
  • [7] Learning the optimal state-feedback via supervised imitation learning
    Tailor, Dharmesh
    Izzo, Dario
    ASTRODYNAMICS, 2019, 3 (04) : 361 - 374
  • [8] State-Only Imitation Learning for Dexterous Manipulation
    Radosavovic, Ilija
    Wang, Xiaolong
    Pinto, Lerrel
    Malik, Jitendra
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7865 - 7871
  • [9] Imitation Learning as State Matching via Differentiable Physics
    Chen, Siwei
    Ma, Xiao
    Xu, Zhongwen
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7846 - 7855
  • [10] Imitation learning by state-only distribution matching
    Boborzi, Damian
    Straehle, Christoph-Nikolas
    Buchner, Jens S.
    Mikelsons, Lars
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30911 - 30926