Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization

被引:28
|
作者
Ning, Lingbin [1 ]
Zhou, Min [1 ]
Hou, Zhuopu [1 ]
Goverde, Rob M. P. [2 ]
Wang, Fei-Yue [3 ]
Dong, Hairong [1 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[2] Delft Univ Technol, Dept Transport & Planning, NL-2628 CN Delft, Netherlands
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail transportation; Training; Heuristic algorithms; Resistance; Optimal control; Trajectory optimization; Switches; High-speed railway; train trajectory optimization; deep deterministic policy gradient; energy efficiency; TRAFFIC MANAGEMENT; LEARNING APPROACH; MODEL; INTEGRATION; OPERATION; ALGORITHM; SYSTEM; DELAY;
D O I
10.1109/TITS.2021.3105380
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper proposes a novel train trajectory optimization approach for high-speed railways. We restrict our attention to single train operation scenarios with different scheduled/rescheduled running times aiming at generating optimal train recommended trajectories in real time, which can ensure punctuality and energy efficiency of train operation. A learning-based approach deep deterministic policy gradient (DDPG) is designed to generate optimal train trajectories based on the offline training from the interaction between the agent and the trajectory simulation environment. An allocating running time and selecting operation modes (ARTSOM) algorithm is proposed to improve train punctuality and give a series of discrete operation modes (full traction, cruising, coasting, full braking), and thus to produce a feasible training set for DDPG, which can speed up the training process. Numerical experiments show that an optimized speed profile can be generated by DDPG within seconds on a realistic railway line. In addition, the results demonstrate the generalization ability of trained DDPG in solving TTO problems with different running times and line conditions.
引用
收藏
页码:11562 / 11574
页数:13
相关论文
共 50 条
  • [1] High-speed Train Operation Adjustment Strategy Based on Deep Deterministic Policy Gradient
    Rui Luo
    Wei ShangGuan
    Rong, Dingchao
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 549 - 554
  • [2] Trajectory Optimization for High-Speed Train Operation
    He Zhi-yu
    Yang Zhi-jie
    Lv Jing-yang
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2065 - 2070
  • [3] Double Deep Q network-based speed trajectory intelligent optimization for high-speed train
    Zhou, Min
    Zhou, Xueying
    Cao, Yaoguang
    Yang, Bo
    Done, Hairong
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 2436 - 2441
  • [4] An Improved Deep Deterministic Policy Gradient Pantograph Active Control Strategy for High-Speed Railways
    Wang, Ying
    Wang, Yuting
    Chen, Xiaoqiang
    Wang, Yixuan
    Chang, Zhanning
    ELECTRONICS, 2024, 13 (17)
  • [5] Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy Gradient
    Luck, Kevin Sebastian
    Vecerik, Mel
    Stepputtis, Simon
    Ben Amor, Heni
    Scholz, Jonathan
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3704 - 3711
  • [6] The path-tracking method based on deep deterministic policy gradient and applied for high-speed driving
    Liu, Hebing
    Sun, Jinhong
    Wane, Heshou
    Chen, Ka Wai Eric
    2024 10TH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS, PESA 2024, 2024,
  • [7] Moving Horizon Optimization of Dynamic Trajectory Planning for High-Speed Train Operation
    Yan, Xi-Hui
    Cai, Bai-Gen
    Ning, Bin
    Wei ShangGuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (05) : 1258 - 1270
  • [8] Multiobjective Optimization for Train Speed Trajectory in CTCS High-Speed Railway With Hybrid Evolutionary Algorithm
    Wei ShangGuan
    Yan, Xi-Hui
    Cai, Bai-Gen
    Wang, Jian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) : 2215 - 2225
  • [9] Optimization method of dynamic trajectory for high-speed train group based on resilience adjustment
    Song H.-Y.
    Shangguan W.
    Sheng Z.
    Zhang R.-F.
    Jiaotong Yunshu Gongcheng Xuebao/Journal of Traffic and Transportation Engineering, 2021, 21 (04): : 235 - 250
  • [10] Aerodynamic drag optimization of a high-speed train
    Munoz-Paniagua, J.
    Garcia, J.
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2020, 204