DRL Router: Distributional Reinforcement Learning-Based Router for Reliable Shortest Path Problems

被引:4
|
作者
Guo, Hongliang [1 ]
Sheng, Wenda [2 ]
Gao, Chen [3 ]
Jin, Yaochu [4 ]
机构
[1] Sichuan Univ SCU, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu 611731, Peoples R China
[3] Swiss Fed Inst Technol, ZH-8092 Zurich, Switzerland
[4] Bielefeld Univ, D-33619 Bielefeld, Germany
关键词
Transportation; Reliability; Planning; Decision making; Routing; Navigation; Bibliographies; TRAVEL-TIME; STOCHASTIC NETWORKS; ALGORITHM; PROBABILITY;
D O I
10.1109/MITS.2023.3265309
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article studies reliable shortest path (RSP) problems in stochastic transportation networks. The term reliability in the RSP literature has many definitions, e.g., 1) maximal stochastic on-time arrival probability, 2) minimal travel time with a high-confidence constraint, 3) minimal mean and standard deviation combination, and 4) minimal expected disutility. To the best of our knowledge, almost all state-of-the-art RSP solutions are designed to target one specific RSP objective, and it is very difficult, if not impossible, to adapt them to other RSP objectives. To bridge the gap, this article develops a distributional reinforcement learning (DRL)-based algorithm, namely, DRL-Router, which serves as a universal solution to the four aforementioned RSP problems. DRL-Router employs the DRL method to approximate the full travel time distribution of a given routing policy and then makes improvements with respect to the user-defined RSP objective through a generalized policy iteration scheme. DRL-Router is 1) universal, i.e., it is applicable to a variety of RSP objectives; 2) model free, i.e., it does not rely on well calibrated travel time distribution models; 3) it is adaptive with navigation objective changes; and 4) fast, i.e., it performs real-time decision making. Extensive experimental results and comparisons with baseline algorithms in various transportation networks justify both the accuracy and efficiency of DRL-Router.
引用
收藏
页码:91 / 108
页数:18
相关论文
共 50 条
  • [31] A reinforcement learning approach involving a shortest path finding algorithm
    Kwon, WY
    Lee, S
    Suh, IH
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 436 - 441
  • [32] Solving the shortest path interdiction problem via reinforcement learning
    Huang, Dian
    Mao, Zhaofang
    Fang, Kan
    Chen, Lin
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (01) : 31 - 48
  • [33] Reliable Shortest Path Problems in Stochastic Time-Dependent Networks
    Chen, Bi Yu
    Lam, William H. K.
    Sumalee, Agachai
    Li, Qingquan
    Tam, Mei Lam
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 18 (02) : 177 - 189
  • [34] Reinforcement Learning-Based Nonautoregressive Solver for Traveling Salesman Problems
    Xiao, Yubin
    Wang, Di
    Li, Boyang
    Chen, Huanhuan
    Pang, Wei
    Wu, Xuan
    Li, Hao
    Xu, Dong
    Liang, Yanchun
    Zhou, You
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [35] Bound Inference and Reinforcement Learning-based Path Construction in Bandwidth Tomography
    Feng, Cuiying
    An, Jianwei
    Wu, Kui
    Wang, Jianping
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [36] UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
    Wei, Minggao
    Wang, Song
    Zheng, Jinfan
    Chen, Dan
    IEEE ACCESS, 2018, 6 : 57814 - 57825
  • [37] Deep reinforcement learning-based path planning of underactuated surface vessels
    Xu H.
    Wang N.
    Zhao H.
    Zheng Z.
    Cyber-Physical Systems, 2019, 5 (01): : 1 - 17
  • [38] Reinforcement learning-based dynamic obstacle avoidance and integration of path planning
    Choi, Jaewan
    Lee, Geonhee
    Lee, Chibum
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 663 - 677
  • [39] A Reinforcement Learning-Based Adaptive Path Tracking Approach for Autonomous Driving
    Shan, Yunxiao
    Zheng, Boli
    Chen, Longsheng
    Chen, Long
    Chen, De
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 10581 - 10595
  • [40] Bound Inference and Reinforcement Learning-Based Path Construction in Bandwidth Tomography
    Feng, Cuiying
    An, Jianwei
    Wu, Kui
    Wang, Jianping
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2022, 30 (02) : 501 - 514