Continual Driving Policy Optimization with Closed-Loop Individualized Curricula

被引:0
|
作者
Ni, Haoyi [1 ]
Xu, Yizhou [1 ]
Jiang, Xingjian [1 ]
Hu, Jianming [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
SAFETY;
D O I
10.1109/ICRA57147.2024.10611578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The safety of autonomous vehicles (AV) has been a long-standing top concern, stemming from the absence of rare and safety-critical scenarios in the long-tail naturalistic driving distribution. To tackle this challenge, a surge of research in scenario-based autonomous driving has emerged, with a focus on generating high-risk driving scenarios and applying them to conduct safety-critical testing of AV models. However, limited work has been explored on the reuse of these extensive scenarios to iteratively improve AV models. Moreover, it remains intractable and challenging to filter through gigantic scenario libraries collected from other AV models with distinct behaviors, attempting to extract transferable information for current AV improvement. Therefore, we develop a continual driving policy optimization framework featuring Closed-Loop Individualized Curricula (CLIC), which we factorize into a set of standardized sub-modules for flexible implementation choices: AV Evaluation, Scenario Selection, and AV Training. CLIC frames AV Evaluation as a collision prediction task, where it estimates the chance of AV failures in these scenarios at each iteration. Subsequently, by re-sampling from historical scenarios based on these failure probabilities, CLIC tailors individualized curricula for downstream training, aligning them with the evaluated capability of AV. Accordingly, CLIC not only maximizes the utilization of the vast pre-collected scenario library for closed-loop driving policy optimization but also facilitates AV improvement by individualizing its training with more challenging cases out of those poorly organized scenarios. Experimental results clearly indicate that CLIC surpasses other curriculum-based training strategies, showing substantial improvement in managing risky scenarios, while still maintaining proficiency in handling simpler cases.
引用
收藏
页码:6850 / 6857
页数:8
相关论文
共 50 条
  • [11] Individualized closed-loop anesthesia through patient model partitioning
    Wahlquist, Ylva
    van Heusden, Klaske
    Dumont, Guy A.
    Soltesz, Kristian
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 361 - 364
  • [12] Individualized closed-loop control of propofol anesthesia: A preliminary study
    Soltesz, Kristian
    Hahn, Jin-Oh
    Hagglund, Tore
    Dumont, Guy A.
    Ansermino, J. Mark
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2013, 8 (06) : 500 - 508
  • [13] Individualized treatment of motor stroke: A perspective on open-loop, closed-loop and adaptive closed-loop brain state-dependent TMS
    Roesch, Johanna
    Vetter, David Emanuel
    Baldassarre, Antonello
    Souza, Victor H.
    Lioumis, Pantelis
    Roine, Timo
    Jooss, Andreas
    Baur, David
    Kozak, Gabor
    Jovellar, D. Blair
    Vaalto, Selja
    Romani, Gian Luca
    Ilmoniemi, Risto J.
    Ziemann, Ulf
    CLINICAL NEUROPHYSIOLOGY, 2024, 158 : 204 - 211
  • [14] CLOSED-LOOP PROBLEMS IN BIOMECHANICS - AN OPTIMIZATION APPROACH
    VAUGHAN, CL
    HAY, JG
    ANDREWS, JG
    JOURNAL OF BIOMECHANICS, 1982, 15 (03) : 201 - 210
  • [15] WHATS WRONG WITH UNIT CLOSED-LOOP OPTIMIZATION
    FRIEDMAN, YZ
    HYDROCARBON PROCESSING, 1995, 74 (10): : 107 - &
  • [16] The optimization of the closed-loop supply chain network
    Yang, Guang-fen
    Wang, Zhi-ping
    Li, Xiao-qiang
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2009, 45 (01) : 16 - 28
  • [17] DISTILLATION - CLOSED-LOOP OPTIMIZATION OF DISTILLATION ENERGY
    MARTIN, GD
    LATOUR, PR
    RICHARD, LA
    CHEMICAL ENGINEERING PROGRESS, 1981, 77 (09) : 33 - 37
  • [18] Successful closed-loop olefins plant optimization
    Brewer, WM
    Lopez, SF
    HYDROCARBON PROCESSING, 1998, 77 (06): : 83 - +
  • [19] Closed-loop experimental optimization of tunable lenses
    Lopez -De -Haro, Angel G.
    Barcala, Xoana
    Martinez-Ibarburu, Ivan
    Marrakchi, Yassine
    Gambra, Enrique
    Rodriguez-Lopez, Victor
    Sawides, Lucie
    Dorronsoro, Carlos
    APPLIED OPTICS, 2022, 61 (27) : 8091 - 8099
  • [20] Production Optimization in Closed-Loop Reservoir Management
    Wang, Chunhong
    Li, Gaoming
    Reynolds, Albert C.
    SPE JOURNAL, 2009, 14 (03): : 506 - 523