Exploring Data Aggregation in Policy Learning for Vision-based Urban Autonomous Driving

被引：13

作者：

Prakash, Aditya ^{[1
]}

Behl, Aseem ^{[1
,2
]}

Ohn-Bar, Eshed ^{[1
,3
]}

Chitta, Kashyap ^{[1
,2
]}

Geiger, Andreas ^{[1
,2
]}

机构：

[1] Max Planck Inst Intelligent Syst, Tubingen, Germany

[2] Univ Tubingen, Tubingen, Germany

[3] Boston Univ, Boston, MA USA

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.01178

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data aggregation techniques can significantly improve vision-based policy learning within a training environment, e.g., learning to drive in a specific simulation condition. However, as on-policy data is sequentially sampled and added in an iterative manner, the policy can specialize and overfit to the training conditions. For real-world applications, it is useful for the learned policy to generalize to novel scenarios that differ from the training conditions. To improve policy learning while maintaining robustness when training end-to-end driving policies, we perform an extensive analysis of data aggregation techniques in the CARLA environment. We demonstrate how the majority of them have poor generalization performance, and develop a novel approach with empirically better generalization performance compared to existing techniques. Our two key ideas are (1) to sample critical states from the collected on-policy data based on the utility they provide to the learned policy in terms of driving behavior, and (2) to incorporate a replay buffer which progressively focuses on the high uncertainty regions of the policy's state distribution. We evaluate the proposed approach on the CARLA NoCrash benchmark, focusing on the most challenging driving scenarios with dense pedestrian and vehicle traffic. Our approach improves driving success rate by 16% over state-of-the-art, achieving 87% of the expert performance while also reducing the collision rate by an order of magnitude without the use of any additional modality, auxiliary tasks, architectural modifications or reward from the environment.

引用

页码：11760 / 11770

页数：11

共 50 条

[41] A Vision-Based Approach for Autonomous Landing
Cabrera-Poncel, Aldrich A.
Martinez-Carranza, Jose
2017 WORKSHOP ON RESEARCH, EDUCATION AND DEVELOPMENT OF UNMANNED AERIAL SYSTEMS (RED-UAS), 2017, : 126 - 131
[42] Vision-based robotic convoy driving
Schneiderman, H
Nashman, R
Wavering, A
Lumia, R
MACHINE VISION AND APPLICATIONS, 1995, 8 (06) : 359 - 364
[43] Deep learning for vision-based micro aerial vehicle autonomous landing
Yu, Leijian
Luo, Cai
Yu, Xingrui
Jiang, Xiangyuan
Yang, Erfu
Luo, Chunbo
Ren, Peng
INTERNATIONAL JOURNAL OF MICRO AIR VEHICLES, 2018, 10 (02) : 171 - 185
[44] Move and the Robot will Learn: Vision-based Autonomous Learning of Object Models
Li, Xiang
Sridharan, Mohan
2013 16TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2013,
[45] Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing
Fu, Jiawei
Song, Yunlong
Wu, Yan
Yu, Fisher
Scaramuzza, Davide
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5243 - 5250
[46] Vision-Based Reinforcement Learning using Approximate Policy Iteration
Shaker, Marwan R.
Yue, Shigang
Duckett, Tom
ICAR: 2009 14TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, VOLS 1 AND 2, 2009, : 594 - 599
[47] OptiCloak: Blinding Vision-Based Autonomous Driving Systems Through Adversarial Optical Projection
Wen, Huixiang
Chang, Shan
Zhou, Luo
Liu, Wei
Zhu, Hongzi
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (17): : 28931 - 28944
[48] Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer
Li, Dianzhao
Okhrin, Ostap
2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 866 - 873
[49] Vision-based navigation system for autonomous urban transport vehicles in outdoor environments
Sotelo, MA
Rodríguez, FJ
Magdalena, L
IV'2002: IEEE INTELLIGENT VEHICLE SYMPOSIUM, PROCEEDINGS, 2002, : 52 - 57
[50] VTGNet: A Vision-Based Trajectory Generation Network for Autonomous Vehicles in Urban Environments
Cai, Peide
Sun, Yuxiang
Wang, Hengli
Liu, Ming
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (03): : 419 - 429

← 1 2 3 4 5 →