Continual Driving Policy Optimization with Closed-Loop Individualized Curricula

被引:0
|
作者
Ni, Haoyi [1 ]
Xu, Yizhou [1 ]
Jiang, Xingjian [1 ]
Hu, Jianming [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
SAFETY;
D O I
10.1109/ICRA57147.2024.10611578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The safety of autonomous vehicles (AV) has been a long-standing top concern, stemming from the absence of rare and safety-critical scenarios in the long-tail naturalistic driving distribution. To tackle this challenge, a surge of research in scenario-based autonomous driving has emerged, with a focus on generating high-risk driving scenarios and applying them to conduct safety-critical testing of AV models. However, limited work has been explored on the reuse of these extensive scenarios to iteratively improve AV models. Moreover, it remains intractable and challenging to filter through gigantic scenario libraries collected from other AV models with distinct behaviors, attempting to extract transferable information for current AV improvement. Therefore, we develop a continual driving policy optimization framework featuring Closed-Loop Individualized Curricula (CLIC), which we factorize into a set of standardized sub-modules for flexible implementation choices: AV Evaluation, Scenario Selection, and AV Training. CLIC frames AV Evaluation as a collision prediction task, where it estimates the chance of AV failures in these scenarios at each iteration. Subsequently, by re-sampling from historical scenarios based on these failure probabilities, CLIC tailors individualized curricula for downstream training, aligning them with the evaluated capability of AV. Accordingly, CLIC not only maximizes the utilization of the vast pre-collected scenario library for closed-loop driving policy optimization but also facilitates AV improvement by individualizing its training with more challenging cases out of those poorly organized scenarios. Experimental results clearly indicate that CLIC surpasses other curriculum-based training strategies, showing substantial improvement in managing risky scenarios, while still maintaining proficiency in handling simpler cases.
引用
收藏
页码:6850 / 6857
页数:8
相关论文
共 50 条
  • [41] Optimising sounds for the driving of sleep oscillations by closed-loop auditory stimulation
    Debellemaniere, Eden
    Pinaud, Clemence
    Schneider, Jules
    Arnal, Pierrick J.
    Casson, Alexander J.
    Chennaoui, Mounir
    Galtier, Mathieu
    Navarrete, Miguel
    Lewis, Penelope A.
    JOURNAL OF SLEEP RESEARCH, 2022, 31 (06)
  • [42] A Precision Closed-loop Driving Scheme of Silicon Micromachined Vibratory Gyroscope
    Yang, Bo
    Zhou, Bailing
    Wang, Shourong
    INTERNATIONAL MEMS CONFERENCE 2006, 2006, 34 : 57 - 64
  • [43] Precision closed-loop driving scheme of silicon micromachined vibratory gyroscope
    Yang, Bo
    Zhou, Bai-Ling
    Yuhang Xuebao/Journal of Astronautics, 2006, 27 (03): : 433 - 437
  • [44] Closed-loop Feedback Speed Guidance Method Considering Driving Style
    Li H.-R.
    Chu D.-F.
    Liang D.-C.
    Zhou T.-Q.
    Zhou, Tu-Qiang (3104@ecjtu.edu.cn), 1600, Science Press (21): : 94 - 100
  • [45] APPLICATION OF AN OPTIMAL PREVIEW CONTROL FOR SIMULATION OF CLOSED-LOOP AUTOMOBILE DRIVING
    MACADAM, C
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1981, 11 (06): : 393 - 399
  • [46] MCMSys: Multimodal Data Closed-Loop Management System for Autonomous Driving
    Li, He
    Zhou, Zhaogao
    Chen, Pin-tong
    Yan, Jinjie
    Yu, Rong
    Hu, Ziwei
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 411 - 417
  • [47] Return Policy and Buyback Contract in Closed-Loop Supply Chain
    Seksan, Jariya
    Amaruchkul, Kannapha
    ADVANCED SCIENCE LETTERS, 2018, 24 (11) : 8165 - 8170
  • [48] Approximate policy iteration for closed-loop learning of visual tasks
    Jodogne, Sebastien
    Briquet, Cyril
    Piater, Justus H.
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 210 - 221
  • [49] Closed-loop ventilation
    Arnal, Jean-Michel
    Katayama, Shinshu
    Howard, Christopher
    CURRENT OPINION IN CRITICAL CARE, 2023, 29 (01) : 19 - 25
  • [50] CLOSED-LOOP IN SCHOOL
    Bratina, N.
    DIABETES TECHNOLOGY & THERAPEUTICS, 2020, 22 : A12 - A13