Exploring Data Aggregation in Policy Learning for Vision-based Urban Autonomous Driving

被引:13
|
作者
Prakash, Aditya [1 ]
Behl, Aseem [1 ,2 ]
Ohn-Bar, Eshed [1 ,3 ]
Chitta, Kashyap [1 ,2 ]
Geiger, Andreas [1 ,2 ]
机构
[1] Max Planck Inst Intelligent Syst, Tubingen, Germany
[2] Univ Tubingen, Tubingen, Germany
[3] Boston Univ, Boston, MA USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.01178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data aggregation techniques can significantly improve vision-based policy learning within a training environment, e.g., learning to drive in a specific simulation condition. However, as on-policy data is sequentially sampled and added in an iterative manner, the policy can specialize and overfit to the training conditions. For real-world applications, it is useful for the learned policy to generalize to novel scenarios that differ from the training conditions. To improve policy learning while maintaining robustness when training end-to-end driving policies, we perform an extensive analysis of data aggregation techniques in the CARLA environment. We demonstrate how the majority of them have poor generalization performance, and develop a novel approach with empirically better generalization performance compared to existing techniques. Our two key ideas are (1) to sample critical states from the collected on-policy data based on the utility they provide to the learned policy in terms of driving behavior, and (2) to incorporate a replay buffer which progressively focuses on the high uncertainty regions of the policy's state distribution. We evaluate the proposed approach on the CARLA NoCrash benchmark, focusing on the most challenging driving scenarios with dense pedestrian and vehicle traffic. Our approach improves driving success rate by 16% over state-of-the-art, achieving 87% of the expert performance while also reducing the collision rate by an order of magnitude without the use of any additional modality, auxiliary tasks, architectural modifications or reward from the environment.
引用
收藏
页码:11760 / 11770
页数:11
相关论文
共 50 条
  • [41] A Vision-Based Approach for Autonomous Landing
    Cabrera-Poncel, Aldrich A.
    Martinez-Carranza, Jose
    2017 WORKSHOP ON RESEARCH, EDUCATION AND DEVELOPMENT OF UNMANNED AERIAL SYSTEMS (RED-UAS), 2017, : 126 - 131
  • [42] Vision-based robotic convoy driving
    Schneiderman, H
    Nashman, R
    Wavering, A
    Lumia, R
    MACHINE VISION AND APPLICATIONS, 1995, 8 (06) : 359 - 364
  • [43] Deep learning for vision-based micro aerial vehicle autonomous landing
    Yu, Leijian
    Luo, Cai
    Yu, Xingrui
    Jiang, Xiangyuan
    Yang, Erfu
    Luo, Chunbo
    Ren, Peng
    INTERNATIONAL JOURNAL OF MICRO AIR VEHICLES, 2018, 10 (02) : 171 - 185
  • [44] Move and the Robot will Learn: Vision-based Autonomous Learning of Object Models
    Li, Xiang
    Sridharan, Mohan
    2013 16TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2013,
  • [45] Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing
    Fu, Jiawei
    Song, Yunlong
    Wu, Yan
    Yu, Fisher
    Scaramuzza, Davide
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5243 - 5250
  • [46] Vision-Based Reinforcement Learning using Approximate Policy Iteration
    Shaker, Marwan R.
    Yue, Shigang
    Duckett, Tom
    ICAR: 2009 14TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, VOLS 1 AND 2, 2009, : 594 - 599
  • [47] OptiCloak: Blinding Vision-Based Autonomous Driving Systems Through Adversarial Optical Projection
    Wen, Huixiang
    Chang, Shan
    Zhou, Luo
    Liu, Wei
    Zhu, Hongzi
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (17): : 28931 - 28944
  • [48] Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer
    Li, Dianzhao
    Okhrin, Ostap
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 866 - 873
  • [49] Vision-based navigation system for autonomous urban transport vehicles in outdoor environments
    Sotelo, MA
    Rodríguez, FJ
    Magdalena, L
    IV'2002: IEEE INTELLIGENT VEHICLE SYMPOSIUM, PROCEEDINGS, 2002, : 52 - 57
  • [50] VTGNet: A Vision-Based Trajectory Generation Network for Autonomous Vehicles in Urban Environments
    Cai, Peide
    Sun, Yuxiang
    Wang, Hengli
    Liu, Ming
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (03): : 419 - 429