SpeedAdv: Enabling Green Light Optimized Speed Advisory for Diverse Traffic Lights

被引:1
|
作者
Ding, Lige [1 ]
Zhao, Dong [1 ]
Zhu, Boqing [1 ]
Wang, Zhaofeng [1 ]
Tan, Chang [2 ]
Tong, Jianjun [1 ]
Ma, Huadong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing Key Lab Intelligent Telecommun Software &, Beijing 100876, Peoples R China
[2] IFLYTEK Co Ltd, iFLYTEK Res, Hefei 230088, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Green products; Vehicle dynamics; Uncertainty; Safety; Urban areas; Trajectory; Cooperative vehicle-infrastructure system; GLOSA; deep reinforcement learning; heterogeneous agents;
D O I
10.1109/TMC.2023.3319697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Green Light Optimized Speed Advisory (GLOSA) systems have emerged to allow drivers to pass traffic lights during a green interval. However, various adaptive and intelligent traffic light control approaches have been adopted in many cities, resulting in the development of current GLOSA technologies lagging behind that of traffic light technologies. When taking diverse dynamic traffic lights into account, it is difficult to model the interactions between vehicles and traffic lights, which is further exacerbated by the hybrid control strategies of traffic lights. To this end, we design a new GLOSA system SpeedAdv to provide optimal speed advisory for addressing diverse traffic lights. We formulate the problem as a Multi-Agent Markov Decision Process (MAMDP) with an implicit common goal and propose a heterogeneous-agent collaborative framework based on reinforcement learning. Three main modules are used in the system: i) a spatio-temporal relation reasoning module based on the phase-aware attention mechanism pays more attention to the traffic rules and traffic flow diversion of adjacent intersections to predict traffic conditions for a few seconds later; ii) a behavior approximating module based on imitation learning is introduced to approximate the phases of diverse traffic lights; iii) a speed advisory module provides the optimal speed advisory based on policy gradient reinforcement learning relying on the above two modules and other information collected by vehicles. We implement and evaluate SpeedAdv with a real-world trajectory dataset, together with a field test based on a prototype system, demonstrating that SpeedAdv improves the overall performance by at least 24.1% in terms of travel time, energy consumption, safety, and comfort compared to the state-of-the-art GreenDrive method.
引用
收藏
页码:6258 / 6271
页数:14
相关论文
共 37 条