MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

被引:46
|
作者
Li, Quanyi [1 ]
Peng, Zhenghao [2 ]
Feng, Lan [4 ]
Zhang, Qihang [3 ]
Xue, Zhenghai [3 ]
Zhou, Bolei [5 ]
机构
[1] Chinese Univ Hong Kong, Ctr Perceptual & Interact Intelligence, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China
[4] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[5] Univ Calif Los Angeles, Los Angeles, CA 90095 USA
关键词
Task analysis; Roads; Reinforcement learning; Benchmark testing; Training; Safety; Autonomous vehicles; autonomous driving; simulation;
D O I
10.1109/TPAMI.2022.3190471
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Driving safely requires multiple capabilities from human and intelligent agents, such as the generalizability to unseen environments, the safety awareness of the surrounding traffic, and the decision-making in complex multi-agent settings. Despite the great success of Reinforcement Learning (RL), most of the RL research works investigate each capability separately due to the lack of integrated environments. In this work, we develop a new driving simulation platform called MetaDrive to support the research of generalizable reinforcement learning algorithms for machine autonomy. MetaDrive is highly compositional, which can generate an infinite number of diverse driving scenarios from both the procedural generation and the real data importing. Based on MetaDrive, we construct a variety of RL tasks and baselines in both single-agent and multi-agent settings, including benchmarking generalizability across unseen scenes, safe exploration, and learning multi-agent traffic. The generalization experiments conducted on both procedurally generated scenarios and real-world scenarios show that increasing the diversity and the size of the training set leads to the improvement of the RL agent's generalizability. We further evaluate various safe reinforcement learning and multi-agent reinforcement learning algorithms in MetaDrive environments and provide the benchmarks. Source code, documentation, and demo video are available at https://metadriverse.github.io/metadrive.
引用
收藏
页码:3461 / 3475
页数:15
相关论文
共 50 条
  • [21] Safety-based Reinforcement Learning Longitudinal Decision for Autonomous Driving in Crosswalk Scenarios
    Xiong, Fangzhou
    Ren, Dongchun
    Fan, Mingyu
    Ding, Shuguang
    Liu, Zhiyong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [22] Learning Automated Driving in Complex Intersection Scenarios Based on Camera Sensors: A Deep Reinforcement Learning Approach
    Li, Guofa
    Lin, Siyan
    Li, Shen
    Qu, Xingda
    IEEE SENSORS JOURNAL, 2022, 22 (05) : 4687 - 4696
  • [23] Deep Reinforcement Learning for Generalizable Field Development Optimization
    He, Jincong
    Tang, Meng
    Hu, Chaoshun
    Tanaka, Shusei
    Wang, Kainan
    Wen, Xian-Huan
    Nasir, Yusuf
    SPE JOURNAL, 2022, 27 (01): : 226 - 245
  • [24] Generalizable Reinforcement Learning-Based Coarsening Model for Resource Allocation over Large and Diverse Stream Processing Graphs
    Nie, Lanshun
    Qiu, Yuqi
    Meng, Fei
    Yu, Mo
    Li, Jing
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 435 - 445
  • [25] Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
    Lecerf, Ugo U. L.
    Yemdji-Tchassi, Christelle C. Y.
    Aubert, Sebastien S. A.
    Michiardi, Pietro P. M.
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 209 - 215
  • [26] Composing Synergistic Macro Actions for Reinforcement Learning Agents
    Chen, Yu-Ming
    Chang, Kaun-Yu
    Liu, Chien
    Hsiao, Tsu-Ching
    Hong, Zhang-Wei
    Lee, Chun-Yi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 7251 - 7258
  • [27] Parameter Sharing Reinforcement Learning for Modeling Multi-Agent Driving Behavior in Roundabout Scenarios
    Konstantinidis, Fabian
    Sackmann, Moritz
    De Candido, Oliver
    Hofmann, Ulrich
    Thielecke, Joern
    Utschick, Wolfgang
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1974 - 1981
  • [28] Reinforcement Learning Based Multi-objective Eco-driving Strategy in Urban Scenarios
    Li J.
    Wu X.
    Xu M.
    Liu Y.
    Qiche Gongcheng/Automotive Engineering, 2023, 45 (10): : 1791 - 1802
  • [29] Motion Planner With Fixed-Horizon Constrained Reinforcement Learning for Complex Autonomous Driving Scenarios
    Lin, Ke
    Li, Yanjie
    Chen, Shiyu
    Li, Duantengchuan
    Wu, Xinyu
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1577 - 1588
  • [30] Reinforcement Learning for Shared Driving
    Ko, Sangjin
    Langari, Reza
    IFAC PAPERSONLINE, 2023, 56 (03): : 247 - 252