Implications of stop-and-go traffic on training learning-based car-following control

被引:0
|
作者
Zhou, Anye [1 ]
Peeta, Srinivas [2 ]
Zhou, Hao [3 ]
Laval, Jorge [2 ]
Wang, Zejiang [1 ]
Cook, Adian [1 ]
机构
[1] Oak Ridge Natl Lab, POB 2008, Oak Ridge, TN 37831 USA
[2] Georgia Inst Technol, 790 Atlantic Dr, Atlanta, GA 30332 USA
[3] Univ S Florida, 4202 Fowler Ave, Tampa, FL 33620 USA
关键词
Car-following control; System identification; Behavior cloning; Deep reinforcement learning; Generalizability; STABILITY; VALIDATION; SAFETY;
D O I
10.1016/j.trc.2024.104578
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Learning-based car-following control (LCC) of connected and autonomous vehicles (CAVs) is gaining significant attention with the advancement of computing power and data accessibility. While the flexibility and large model capacity of model-free architecture enable LCC to potentially outperform the model-based car-following (CF) model in improving traffic efficiency and mitigating congestion, the generalizability of LCC for traffic conditions different from the training environment/dataset is not well-understood. This study seeks to explore the impact of stop-andgo traffic in the training dataset on the generalizability of LCC. It uses the characteristics of lead vehicle trajectories to describe stop-and-go traffic, and links the theory of identifiability (i.e., obtaining a unique parameter estimation result using sensor measurements) to the generalizability of behavior cloning (BC) and policy-based deep reinforcement learning (DRL). Correspondingly, the study shows theoretically that: (i) stop-and-go traffic can enable the property of identifiability and enhance the control performance of BC-based LCC in different traffic conditions; (ii) stop-and-go traffic is not necessary for DRL-based LCC to generalize to different traffic conditions; (iii) DRL-based LCC trained with only constant-speed lead vehicle trajectories (not sufficient to ensure identifiability) can be generalized to different traffic conditions; and (iv) stopand-go traffic increases variance in the training dataset, which improves the convergence of parameter estimation while negatively impacting the convergence of DRL to the optimal control policy. Numerical experiments validate the above findings, illustrating that BC-based LCC entails comprehensive training datasets for generalizing to different traffic conditions, while DRL-based LCC can achieve generalization with simple free-flow traffic training environments. This further suggests DRL as a more promising and cost-effective LCC approach to reduce operational costs, mitigate traffic congestion, and enhance safety and mobility, which can accelerate the deployment and acceptance of CAVs.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] CARSIM. CAR-following model for SIMulation of traffic in normal and stop-and-go conditions
    Benekohal, R.F.
    Treiterer, Joseph
    Transportation Research Record, 1988, (1194) : 99 - 111
  • [2] Nonlinear control of stop-and-go traffic
    Sollacher, R
    Lenz, H
    TRAFFIC AND GRANULAR FLOW'99: SOCIAL, TRAFFIC, AND GRANULAR DYNAMICS, 2000, : 315 - 320
  • [3] Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-Based Traffic Congestion Control Systems
    Wang, Yue
    Sarkar, Esha
    Li, Wenqing
    Maniatakos, Michail
    Jabari, Saif Eddin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4772 - 4787
  • [4] Generic First-Order Car-Following Models with Stop-and-Go Waves and Exclusion
    Tordeux, Antoine
    Lassarre, Sylvain
    Roussignol, Michel
    Aguilera, Vincent
    TRAFFIC AND GRANULAR FLOW '13, 2015, : 485 - 493
  • [5] Connected variable speed limits control and car-following control with vehicle-infrastructure communication to resolve stop-and-go waves
    Wang, Meng
    Daamen, Winnie
    Hoogendoorn, Serge P.
    van Arem, Bart
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 20 (06) : 559 - 572
  • [6] Reinforcement Learning-based Car-Following Control for Autonomous Vehicles with OTFS
    Liu, Yulin
    Shi, Yuye
    Zhang, Xiaoqi
    Wu, Jun
    Yang, Songyuan
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [7] Mitigating stop-and-go traffic congestion with operator learning
    Zhang, Yihuai
    Zhong, Ruiguo
    Yu, Huan
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2025, 170
  • [8] Multiagent Reinforcement Learning for Ecological Car-Following Control in Mixed Traffic
    Wang, Qun
    Ju, Fei
    Wang, Huaiyu
    Qian, Yahui
    Zhu, Meixin
    Zhuang, Weichao
    Wang, Liangmo
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (04): : 8671 - 8684
  • [9] Car-following traffic model based on PID control: modelling and simulation
    Han, Shuang
    Zhang, Jing
    Yang, Quanyue
    Yuan, Zijian
    Li, Shubin
    Cui, Fengying
    Zhang, Chuntang
    Wang, Tao
    ENGINEERING COMPUTATIONS, 2022, 39 (10) : 3400 - 3415
  • [10] Event-triggered Varying Speed Limit Control of Stop-and-go Traffic
    Espitia, Nicolas
    Yu, Huan
    Krstic, Miroslav
    IFAC PAPERSONLINE, 2020, 53 (02): : 7509 - 7514