Implications of stop-and-go traffic on training learning-based car-following control

被引:0
|
作者
Zhou, Anye [1 ]
Peeta, Srinivas [2 ]
Zhou, Hao [3 ]
Laval, Jorge [2 ]
Wang, Zejiang [1 ]
Cook, Adian [1 ]
机构
[1] Oak Ridge Natl Lab, POB 2008, Oak Ridge, TN 37831 USA
[2] Georgia Inst Technol, 790 Atlantic Dr, Atlanta, GA 30332 USA
[3] Univ S Florida, 4202 Fowler Ave, Tampa, FL 33620 USA
关键词
Car-following control; System identification; Behavior cloning; Deep reinforcement learning; Generalizability; STABILITY; VALIDATION; SAFETY;
D O I
10.1016/j.trc.2024.104578
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Learning-based car-following control (LCC) of connected and autonomous vehicles (CAVs) is gaining significant attention with the advancement of computing power and data accessibility. While the flexibility and large model capacity of model-free architecture enable LCC to potentially outperform the model-based car-following (CF) model in improving traffic efficiency and mitigating congestion, the generalizability of LCC for traffic conditions different from the training environment/dataset is not well-understood. This study seeks to explore the impact of stop-andgo traffic in the training dataset on the generalizability of LCC. It uses the characteristics of lead vehicle trajectories to describe stop-and-go traffic, and links the theory of identifiability (i.e., obtaining a unique parameter estimation result using sensor measurements) to the generalizability of behavior cloning (BC) and policy-based deep reinforcement learning (DRL). Correspondingly, the study shows theoretically that: (i) stop-and-go traffic can enable the property of identifiability and enhance the control performance of BC-based LCC in different traffic conditions; (ii) stop-and-go traffic is not necessary for DRL-based LCC to generalize to different traffic conditions; (iii) DRL-based LCC trained with only constant-speed lead vehicle trajectories (not sufficient to ensure identifiability) can be generalized to different traffic conditions; and (iv) stopand-go traffic increases variance in the training dataset, which improves the convergence of parameter estimation while negatively impacting the convergence of DRL to the optimal control policy. Numerical experiments validate the above findings, illustrating that BC-based LCC entails comprehensive training datasets for generalizing to different traffic conditions, while DRL-based LCC can achieve generalization with simple free-flow traffic training environments. This further suggests DRL as a more promising and cost-effective LCC approach to reduce operational costs, mitigate traffic congestion, and enhance safety and mobility, which can accelerate the deployment and acceptance of CAVs.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Development and Evaluation of Two Learning-Based Personalized Driver Models for Car-Following Behaviors
    Wang, Wenshuo
    Zhao, Ding
    Xi, Junqiang
    LeBlanc, David J.
    Hedrick, J. Karl
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 1133 - 1138
  • [22] Feedback control scheme of traffic jams based on the coupled map car-following model
    周桐
    孙棣华
    赵敏
    李华民
    Chinese Physics B, 2013, (09) : 207 - 212
  • [23] Feedback control scheme of traffic jams based on the coupled map car-following model
    Zhou Tong
    Sun Di-Hua
    Zhao Min
    Li Hua-Min
    CHINESE PHYSICS B, 2013, 22 (09)
  • [24] Feedback control of traffic jam based on the full velocity difference car-following model
    Li, Yongfu
    Sun, Dihua
    Liu, Weining
    Journal of Information and Computational Science, 2012, 9 (03): : 719 - 730
  • [25] An investigation into a string-stable vehicle following control strategy for stop-and-go cruise control
    Yi, K
    Park, Y
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2002, 216 (D12) : 947 - 956
  • [26] Deep Reinforcement Learning Car-Following Control Based on Multivehicle Motion Prediction
    Wang, Tao
    Qu, Dayi
    Wang, Kedong
    Dai, Shouchen
    ELECTRONICS, 2024, 13 (06)
  • [27] Research on Deep Reinforcement Learning-based Intelligent Car-following Control and Energy Management Strategy for Hybrid Electric Vehicles
    Tang X.
    Chen J.
    Liu T.
    Li J.
    Hu X.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2021, 57 (22): : 237 - 246
  • [28] A control method for congested traffic in the coupled map car-following model
    Shen Fei-Ying
    Ge Hong-Xia
    Zhang Hui
    Yu Han-Mei
    Lei Li
    CHINESE PHYSICS B, 2009, 18 (10) : 4208 - 4216
  • [29] A vehicle stop-and-go control strategy based on human drivers driving characteristics
    Yi, K
    Han, D
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2005, 19 (04) : 993 - 1000
  • [30] A control method for congested traffic in the coupled map car-following model
    沈飞英
    葛红霞
    张辉
    余寒梅
    雷丽
    Chinese Physics B, 2009, (10) : 4208 - 4216