Continual Driving Policy Optimization with Closed-Loop Individualized Curricula

被引:0
|
作者
Ni, Haoyi [1 ]
Xu, Yizhou [1 ]
Jiang, Xingjian [1 ]
Hu, Jianming [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
SAFETY;
D O I
10.1109/ICRA57147.2024.10611578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The safety of autonomous vehicles (AV) has been a long-standing top concern, stemming from the absence of rare and safety-critical scenarios in the long-tail naturalistic driving distribution. To tackle this challenge, a surge of research in scenario-based autonomous driving has emerged, with a focus on generating high-risk driving scenarios and applying them to conduct safety-critical testing of AV models. However, limited work has been explored on the reuse of these extensive scenarios to iteratively improve AV models. Moreover, it remains intractable and challenging to filter through gigantic scenario libraries collected from other AV models with distinct behaviors, attempting to extract transferable information for current AV improvement. Therefore, we develop a continual driving policy optimization framework featuring Closed-Loop Individualized Curricula (CLIC), which we factorize into a set of standardized sub-modules for flexible implementation choices: AV Evaluation, Scenario Selection, and AV Training. CLIC frames AV Evaluation as a collision prediction task, where it estimates the chance of AV failures in these scenarios at each iteration. Subsequently, by re-sampling from historical scenarios based on these failure probabilities, CLIC tailors individualized curricula for downstream training, aligning them with the evaluated capability of AV. Accordingly, CLIC not only maximizes the utilization of the vast pre-collected scenario library for closed-loop driving policy optimization but also facilitates AV improvement by individualizing its training with more challenging cases out of those poorly organized scenarios. Experimental results clearly indicate that CLIC surpasses other curriculum-based training strategies, showing substantial improvement in managing risky scenarios, while still maintaining proficiency in handling simpler cases.
引用
收藏
页码:6850 / 6857
页数:8
相关论文
共 50 条
  • [21] Closed-loop feedback control for production optimization
    Dilib, F.A.
    Jackson, M.D.
    Dilib, F.A., 1600, Society of Petroleum Engineers (SPE) (64): : 99 - 101
  • [22] CLOSED-LOOP INDIVIDUALIZED TRANSCRANIAL ALTERNATING CURRENT STIMULATION FOR THE TREATMENT OF DEPRESSION
    Schwippel, Tobias
    Townsend, Leah
    Walker, Christopher
    Rubinow, David
    Froehlich, Flavio
    PSYCHOPHYSIOLOGY, 2023, 60 : S15 - S15
  • [23] Closed-loop Test Systems for highly automated Driving Functions
    Schiefenhoevel, Martin
    ATP MAGAZINE, 2021, (08): : 46 - 48
  • [24] CLOSED-LOOP
    WINTERFLOOD, AH
    ELECTRONICS & WIRELESS WORLD, 1984, 90 (1577): : 51 - 51
  • [25] CLOSED-LOOP
    LEATHER, M
    INDUSTRIAL DISTRIBUTION, 1977, 67 (05): : 294 - 294
  • [26] Closed-loop EEG study on visual recognition during driving
    Aydarkhanov, Ruslan
    Uscumlic, Marija
    Chavarriaga, Ricardo
    Gheorghe, Lucian
    Millan, Jose del R.
    JOURNAL OF NEURAL ENGINEERING, 2021, 18 (02)
  • [27] Double closed-loop driving system for resonant tactile sensor
    Wang, Haodong
    Shen, Jingjin
    Xu, Rongqing
    Kong, Meimei
    Wang, Hongyi
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3152 - 3156
  • [28] Consignment stock policy in a closed-loop supply chain
    Chakraborty, A.
    Maiti, Tarun
    Giri, B. C.
    RAIRO-OPERATIONS RESEARCH, 2021, 55 : S1913 - S1934
  • [29] NeuroNCAP: Photorealistic Closed-Loop Safety Testing for Autonomous Driving
    Ljungbergh, William
    Tonderski, Adam
    Johnander, Joakim
    Caesar, Holger
    Astrom, Kalle
    Felsberg, Michael
    Petersson, Christoffer
    COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 161 - 177
  • [30] Closed-Loop Optimization of Guidance Gain for Constrained Impact
    Liu, Xinfu
    Shen, Zuojun
    Lu, Ping
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (02) : 453 - 460