MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

被引：46

作者：

Li, Quanyi ^{[1
]}

Peng, Zhenghao ^{[2
]}

Feng, Lan ^{[4
]}

Zhang, Qihang ^{[3
]}

Xue, Zhenghai ^{[3
]}

Zhou, Bolei ^{[5
]}

机构：

[1] Chinese Univ Hong Kong, Ctr Perceptual & Interact Intelligence, Hong Kong, Peoples R China

[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[3] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China

[4] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[5] Univ Calif Los Angeles, Los Angeles, CA 90095 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 03期

关键词：

Task analysis; Roads; Reinforcement learning; Benchmark testing; Training; Safety; Autonomous vehicles; autonomous driving; simulation;

D O I：

10.1109/TPAMI.2022.3190471

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Driving safely requires multiple capabilities from human and intelligent agents, such as the generalizability to unseen environments, the safety awareness of the surrounding traffic, and the decision-making in complex multi-agent settings. Despite the great success of Reinforcement Learning (RL), most of the RL research works investigate each capability separately due to the lack of integrated environments. In this work, we develop a new driving simulation platform called MetaDrive to support the research of generalizable reinforcement learning algorithms for machine autonomy. MetaDrive is highly compositional, which can generate an infinite number of diverse driving scenarios from both the procedural generation and the real data importing. Based on MetaDrive, we construct a variety of RL tasks and baselines in both single-agent and multi-agent settings, including benchmarking generalizability across unseen scenes, safe exploration, and learning multi-agent traffic. The generalization experiments conducted on both procedurally generated scenarios and real-world scenarios show that increasing the diversity and the size of the training set leads to the improvement of the RL agent's generalizability. We further evaluate various safe reinforcement learning and multi-agent reinforcement learning algorithms in MetaDrive environments and provide the benchmarks. Source code, documentation, and demo video are available at https://metadriverse.github.io/metadrive.

引用

页码：3461 / 3475

页数：15

共 50 条

[31] Generalizable Crowd Counting via Diverse Context Style Learning
Zhao, Wenda
Wang, Mingyue
Liu, Yu
Lu, Huimin
Xu, Congan
Yao, Libo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5399 - 5410
[32] Generalizable control for quantum parameter estimation through reinforcement learning
Han Xu
Junning Li
Liqiang Liu
Yu Wang
Haidong Yuan
Xin Wang
npj Quantum Information, 5
[33] Generalizable control for quantum parameter estimation through reinforcement learning
Xu, Han
Li, Junning
Liu, Liqiang
Wang, Yu
Yuan, Haidong
Wang, Xin
NPJ QUANTUM INFORMATION, 2019, 5 (1)
[34] BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning
Chen, Xuan
Guo, Wenbo
Tao, Guanhong
Zhang, Xiangyu
Song, Dawn
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[35] Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions
Ozturk, Anil
Gunel, Mustafa Burak
Dagdanov, Resul
Vural, Mira Ekim
Yurdakul, Ferhat
Dal, Melih
Ure, Nazim Kemal
2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 358 - 363
[36] Unsupervised and Generalizable Wireless Positioning via Variational Reinforcement Learning
Zhang, Jiankun
Wang, Hao
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[37] Static and Dynamic Collision Avoidance for Autonomous Robot Navigation in Diverse Scenarios based on Deep Reinforcement Learning
Pico, Nabih
Lee, Beomjoon
Montero, Estrella
Tadese, Meseret
Auh, Eugene
Doh, Myeongyun
Moon, Hyungpil
2023 20TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR, 2023, : 281 - 286
[38] Hierarchical Reinforcement Learning-Based Policy Switching Towards Multi-Scenarios Autonomous Driving
Guo, Youtian
Zhang, Qichao
Wang, Junjie
Liu, Shasha
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[39] DriveSceneGen: Generating Diverse and Realistic Driving Scenarios From Scratch
Sun, Shuo
Gu, Zekai
Sun, Tianchen
Sun, Jiawei
Yuan, Chengran
Han, Yuhang
Li, Dongen
Ang Jr, Marcelo H.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7007 - 7014
[40] Accelerating reinforcement learning by composing solutions of automatically identified subtasks
Drummond, C
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 16 : 59 - 104

← 1 2 3 4 5 →