Hierarchical Imitation Learning for Stochastic Environments

被引:0
|
作者
Igl, Maximilian [1 ]
Shah, Punit [1 ]
Mougin, Paul [1 ]
Srinivasan, Sirish [1 ]
Gupta, Tarun [1 ]
White, Brandyn [1 ]
Shiarlis, Kyriacos [1 ]
Whiteson, Shimon [1 ]
机构
[1] Waymo Res, Mountain View, CA 94043 USA
关键词
D O I
10.1109/IROS55552.2023.10341451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications of imitation learning require the agent to generate the full distribution of behaviour observed in the training data. For example, to evaluate the safety of autonomous vehicles in simulation, accurate and diverse behaviour models of other road users are paramount. Existing methods that improve this distributional realism typically rely on hierarchical policies. These condition the policy on types such as goals or personas that give rise to multi-modal behaviour. However, such methods are often inappropriate for stochastic environments where the agent must also react to external factors: because agent types are inferred from the observed future trajectory during training, these environments require that the contributions of internal and external factors to the agent behaviour are disentangled and only internal factors, i.e., those under the agent's control, are encoded in the type. Encoding future information about external factors leads to inappropriate agent reactions during testing, when the future is unknown and types must be drawn independently from the actual future. We formalize this challenge as distribution shift in the conditional distribution of agent types under environmental stochasticity. We propose Robust Type Conditioning (RTC), which eliminates this shift with adversarial training under randomly sampled types. Experiments on two domains, including the large-scale Waymo Open Motion Dataset, show improved distributional realism while maintaining or improving task performance compared to state-of-the-art baselines.
引用
收藏
页码:1697 / 1704
页数:8
相关论文
共 50 条
  • [1] Hierarchical Imitation and Reinforcement Learning
    Le, Hoang M.
    Jiang, Nan
    Agarwal, Alekh
    Dudik, Miroslav
    Yue, Yisong
    Daume, Hal, III
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] Learning by imitation: A hierarchical approach
    Byrne, RW
    Russon, AE
    BEHAVIORAL AND BRAIN SCIENCES, 1998, 21 (05) : 667 - +
  • [3] Imitation Learning in Uncertain Environments
    Priesterjahn, Steffen
    Eberling, Markus
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN X, PROCEEDINGS, 2008, 5199 : 950 - 960
  • [4] Active Imitation Learning of Hierarchical Policies
    Hamidi, Mandana
    Tadepalli, Prasad
    Goetschalckx, Robby
    Fern, Alan
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3554 - 3560
  • [5] SHAIL: Safety-Aware Hierarchical Adversarial Imitation Learning for Autonomous Driving in Urban Environments
    Jamgochian, Arec
    Buehrle, Etienne
    Fischer, Johannes
    Kochenderfer, Mykel J.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1530 - 1536
  • [6] Imitation Modelling Of Differential Hydromagnetic Survey In Stochastic Environments
    Kochetov M.V.
    Geology, 2019, 2019 (03) : 99 - 103
  • [7] Provable Hierarchical Imitation Learning via EM
    Zhang, Zhiyu
    Paschalidis, Ioannis Ch
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [8] Spectral Method of Moments for Hierarchical Imitation Learning
    Nguyen Nguyen
    Molloy, Timothy L.
    Nair, Girish N.
    Paschalidis, Ioannis Ch.
    IFAC PAPERSONLINE, 2023, 56 (02): : 10101 - 10106
  • [9] STOCHASTIC IMITATION OF INSTRUMENTAL REFLEX AT PROBABILISTIC LEARNING
    SALTYKOV, AB
    SMIRNOV, IV
    STARSHOV, VP
    ZHURNAL VYSSHEI NERVNOI DEYATELNOSTI IMENI I P PAVLOVA, 1989, 39 (05) : 974 - 981
  • [10] Transferring Hierarchical Structure with Dual Meta Imitation Learning
    Gao, Chongkai
    Jiang, Yizhou
    Chen, Feng
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 762 - 773