DRL Router: Distributional Reinforcement Learning-Based Router for Reliable Shortest Path Problems

被引:4
|
作者
Guo, Hongliang [1 ]
Sheng, Wenda [2 ]
Gao, Chen [3 ]
Jin, Yaochu [4 ]
机构
[1] Sichuan Univ SCU, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu 611731, Peoples R China
[3] Swiss Fed Inst Technol, ZH-8092 Zurich, Switzerland
[4] Bielefeld Univ, D-33619 Bielefeld, Germany
关键词
Transportation; Reliability; Planning; Decision making; Routing; Navigation; Bibliographies; TRAVEL-TIME; STOCHASTIC NETWORKS; ALGORITHM; PROBABILITY;
D O I
10.1109/MITS.2023.3265309
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article studies reliable shortest path (RSP) problems in stochastic transportation networks. The term reliability in the RSP literature has many definitions, e.g., 1) maximal stochastic on-time arrival probability, 2) minimal travel time with a high-confidence constraint, 3) minimal mean and standard deviation combination, and 4) minimal expected disutility. To the best of our knowledge, almost all state-of-the-art RSP solutions are designed to target one specific RSP objective, and it is very difficult, if not impossible, to adapt them to other RSP objectives. To bridge the gap, this article develops a distributional reinforcement learning (DRL)-based algorithm, namely, DRL-Router, which serves as a universal solution to the four aforementioned RSP problems. DRL-Router employs the DRL method to approximate the full travel time distribution of a given routing policy and then makes improvements with respect to the user-defined RSP objective through a generalized policy iteration scheme. DRL-Router is 1) universal, i.e., it is applicable to a variety of RSP objectives; 2) model free, i.e., it does not rely on well calibrated travel time distribution models; 3) it is adaptive with navigation objective changes; and 4) fast, i.e., it performs real-time decision making. Extensive experimental results and comparisons with baseline algorithms in various transportation networks justify both the accuracy and efficiency of DRL-Router.
引用
收藏
页码:91 / 108
页数:18
相关论文
共 50 条
  • [21] Deep Distributional Reinforcement Learning-Based Adaptive Routing With Guaranteed Delay Bounds
    Liu, Jianmin
    Li, Dan
    Xu, Yongjun
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (06) : 4692 - 4706
  • [22] Reinforcement Learning Based Stochastic Shortest Path Finding in Wireless Sensor Networks
    Xia, Wenwen
    Di, Chong
    Guo, Haonan
    Li, Shenghong
    IEEE ACCESS, 2019, 7 : 157807 - 157817
  • [23] A Heuristically Accelerated Reinforcement Learning-Based Neurosurgical Path Planner
    Ji, Guanglin
    Gao, Qian
    Zhang, Tianwei
    Cao, Lin
    Sun, Zhenglong
    CYBORG AND BIONIC SYSTEMS, 2023, 4
  • [24] A Reinforcement Learning-Based Path Planning Considering Degree of Observability
    Cho, Yong Hyeon
    Park, Chan Gook
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 502 - 505
  • [25] A Heuristically Accelerated Reinforcement Learning-Based Neurosurgical Path Planner
    Ji G.
    Gao Q.
    Zhang T.
    Cao L.
    Sun Z.
    Cyborg and Bionic Systems, 2023, 4
  • [26] Recursive Router Metrics Prediction Using Machine Learning-Based Node Modeling for Network Digital Replica
    Hattori, Kyota
    Korikawa, Tomohiro
    Takasaki, Chikako
    Oowada, Hidenari
    IEEE ACCESS, 2023, 11 : 138638 - 138654
  • [27] RDERL: Reliable deep ensemble reinforcement learning-based recommender system
    Ahmadian, Milad
    Ahmadian, Sajad
    Ahmadi, Mahmood
    KNOWLEDGE-BASED SYSTEMS, 2023, 263
  • [28] DRL-RNP: Deep Reinforcement Learning-Based Optimized RNP Flight Procedure Execution
    Zhu, Longtao
    Wang, Jinlin
    Wang, Yi
    Ji, Yulong
    Ren, Jinchang
    SENSORS, 2022, 22 (17)
  • [29] DRL-OS: A Deep Reinforcement Learning-Based Offloading Scheduler in Mobile Edge Computing
    Lim, Ducsun
    Lee, Wooyeob
    Kim, Won-Tae
    Joe, Inwhee
    SENSORS, 2022, 22 (23)
  • [30] DeepThrottle: Deep Reinforcement Learning for Router Throttling to Defend Against DDoS Attack in SDN
    Chen, Shuhan
    Shen, Congqi
    Wu, Chunming
    Shen, Yi
    2022 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, IPCCC, 2022,