MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

被引:46
|
作者
Li, Quanyi [1 ]
Peng, Zhenghao [2 ]
Feng, Lan [4 ]
Zhang, Qihang [3 ]
Xue, Zhenghai [3 ]
Zhou, Bolei [5 ]
机构
[1] Chinese Univ Hong Kong, Ctr Perceptual & Interact Intelligence, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China
[4] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[5] Univ Calif Los Angeles, Los Angeles, CA 90095 USA
关键词
Task analysis; Roads; Reinforcement learning; Benchmark testing; Training; Safety; Autonomous vehicles; autonomous driving; simulation;
D O I
10.1109/TPAMI.2022.3190471
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Driving safely requires multiple capabilities from human and intelligent agents, such as the generalizability to unseen environments, the safety awareness of the surrounding traffic, and the decision-making in complex multi-agent settings. Despite the great success of Reinforcement Learning (RL), most of the RL research works investigate each capability separately due to the lack of integrated environments. In this work, we develop a new driving simulation platform called MetaDrive to support the research of generalizable reinforcement learning algorithms for machine autonomy. MetaDrive is highly compositional, which can generate an infinite number of diverse driving scenarios from both the procedural generation and the real data importing. Based on MetaDrive, we construct a variety of RL tasks and baselines in both single-agent and multi-agent settings, including benchmarking generalizability across unseen scenes, safe exploration, and learning multi-agent traffic. The generalization experiments conducted on both procedurally generated scenarios and real-world scenarios show that increasing the diversity and the size of the training set leads to the improvement of the RL agent's generalizability. We further evaluate various safe reinforcement learning and multi-agent reinforcement learning algorithms in MetaDrive environments and provide the benchmarks. Source code, documentation, and demo video are available at https://metadriverse.github.io/metadrive.
引用
收藏
页码:3461 / 3475
页数:15
相关论文
共 50 条
  • [31] Generalizable Crowd Counting via Diverse Context Style Learning
    Zhao, Wenda
    Wang, Mingyue
    Liu, Yu
    Lu, Huimin
    Xu, Congan
    Yao, Libo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5399 - 5410
  • [32] Generalizable control for quantum parameter estimation through reinforcement learning
    Han Xu
    Junning Li
    Liqiang Liu
    Yu Wang
    Haidong Yuan
    Xin Wang
    npj Quantum Information, 5
  • [33] Generalizable control for quantum parameter estimation through reinforcement learning
    Xu, Han
    Li, Junning
    Liu, Liqiang
    Wang, Yu
    Yuan, Haidong
    Wang, Xin
    NPJ QUANTUM INFORMATION, 2019, 5 (1)
  • [34] BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning
    Chen, Xuan
    Guo, Wenbo
    Tao, Guanhong
    Zhang, Xiangyu
    Song, Dawn
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions
    Ozturk, Anil
    Gunel, Mustafa Burak
    Dagdanov, Resul
    Vural, Mira Ekim
    Yurdakul, Ferhat
    Dal, Melih
    Ure, Nazim Kemal
    2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 358 - 363
  • [36] Unsupervised and Generalizable Wireless Positioning via Variational Reinforcement Learning
    Zhang, Jiankun
    Wang, Hao
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [37] Static and Dynamic Collision Avoidance for Autonomous Robot Navigation in Diverse Scenarios based on Deep Reinforcement Learning
    Pico, Nabih
    Lee, Beomjoon
    Montero, Estrella
    Tadese, Meseret
    Auh, Eugene
    Doh, Myeongyun
    Moon, Hyungpil
    2023 20TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR, 2023, : 281 - 286
  • [38] Hierarchical Reinforcement Learning-Based Policy Switching Towards Multi-Scenarios Autonomous Driving
    Guo, Youtian
    Zhang, Qichao
    Wang, Junjie
    Liu, Shasha
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [39] DriveSceneGen: Generating Diverse and Realistic Driving Scenarios From Scratch
    Sun, Shuo
    Gu, Zekai
    Sun, Tianchen
    Sun, Jiawei
    Yuan, Chengran
    Han, Yuhang
    Li, Dongen
    Ang Jr, Marcelo H.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7007 - 7014
  • [40] Accelerating reinforcement learning by composing solutions of automatically identified subtasks
    Drummond, C
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 16 : 59 - 104