Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization

被引:28
|
作者
Ning, Lingbin [1 ]
Zhou, Min [1 ]
Hou, Zhuopu [1 ]
Goverde, Rob M. P. [2 ]
Wang, Fei-Yue [3 ]
Dong, Hairong [1 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[2] Delft Univ Technol, Dept Transport & Planning, NL-2628 CN Delft, Netherlands
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail transportation; Training; Heuristic algorithms; Resistance; Optimal control; Trajectory optimization; Switches; High-speed railway; train trajectory optimization; deep deterministic policy gradient; energy efficiency; TRAFFIC MANAGEMENT; LEARNING APPROACH; MODEL; INTEGRATION; OPERATION; ALGORITHM; SYSTEM; DELAY;
D O I
10.1109/TITS.2021.3105380
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper proposes a novel train trajectory optimization approach for high-speed railways. We restrict our attention to single train operation scenarios with different scheduled/rescheduled running times aiming at generating optimal train recommended trajectories in real time, which can ensure punctuality and energy efficiency of train operation. A learning-based approach deep deterministic policy gradient (DDPG) is designed to generate optimal train trajectories based on the offline training from the interaction between the agent and the trajectory simulation environment. An allocating running time and selecting operation modes (ARTSOM) algorithm is proposed to improve train punctuality and give a series of discrete operation modes (full traction, cruising, coasting, full braking), and thus to produce a feasible training set for DDPG, which can speed up the training process. Numerical experiments show that an optimized speed profile can be generated by DDPG within seconds on a realistic railway line. In addition, the results demonstrate the generalization ability of trained DDPG in solving TTO problems with different running times and line conditions.
引用
收藏
页码:11562 / 11574
页数:13
相关论文
共 50 条
  • [21] Localization of a high-speed train using a speed model based on the gradient descent algorithm
    Ma, Liwen
    Wu, Jiaji
    Li, Chunyuan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 85 : 201 - 209
  • [22] Optimization of Multiperiod Mixed Train Schedule on High-Speed Railway
    Zhou, Wenliang
    Tian, Junli
    Qin, Jin
    Deng, Lianbo
    Wei, TangJian
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2015, 2015
  • [23] Research on aerodynamic optimization of high-speed train's slipstream
    Sun, Zhenxu
    Yao, Shuanbao
    Yang, Guowei
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2020, 14 (01) : 1106 - 1127
  • [24] Optimization on the Dynamic Train Coupling Process in High-Speed Railway
    Cheng Fanglin
    Tang Tao
    Su Shuai
    Meng Jun
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (05) : 1002 - 1010
  • [25] Optimization models for high-speed train unit routing problems
    Wang, Ying
    Gao, Yuan
    Yu, Xiaoyuan
    Hansen, Ingo A.
    Miao, Jianrui
    COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 127 : 1273 - 1281
  • [26] Aerodynamic Shape Optimization of the Pantograph Fairing of a High-Speed Train
    Zhang, Liang
    Zhang, Jiye
    Li, Tian
    RAILWAY DEVELOPMENT, OPERATIONS, AND MAINTENANCE: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON RAIL TRANSPORTATION 2017 (ICRT 2017), 2018, : 977 - 987
  • [27] Maintenance scheduling at high-speed train depots: An optimization approach
    Wang, Jiaxi
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 243
  • [28] Optimization on the Dynamic Train Coupling Process in High-Speed Railway
    CHENG Fanglin
    TANG Tao
    SU Shuai
    MENG Jun
    ChineseJournalofElectronics, 2023, 32 (05) : 1002 - 1010
  • [29] Policy Space Noise in Deep Deterministic Policy Gradient
    Yan, Yan
    Liu, Quan
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 624 - 634
  • [30] Adaptive Partial Train Speed Trajectory Optimization
    Tan, Zhaoxiang
    Lu, Shaofeng
    Bao, Kai
    Zhang, Shaoning
    Wu, Chaoxian
    Yang, Jie
    Xue, Fei
    ENERGIES, 2018, 11 (12)