Traffic pattern-aware elevator dispatching via deep reinforcement learning

被引:1
|
作者
Wan, Jiansong [1 ]
Lee, Kanghoon [1 ]
Shin, Hayong [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Ind & Syst Engn, 291 Daehak Ro, Daejeon 34141, South Korea
基金
新加坡国家研究基金会;
关键词
Elevator dispatching; Semi-Markov decision process; Deep reinforcement learning; Traffic pattern awareness; GENETIC ALGORITHM;
D O I
10.1016/j.aei.2024.102497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses the elevator dispatching problem using deep reinforcement learning, with a specific emphasis on traffic pattern awareness. Previous studies on reinforcement learning-based elevator dispatching have largely focused on training separate models for single traffic patterns, such as up-peak, down-peak, lunchpeak, and inter-floor. This separate training approach not only introduces practical complexities by requiring an auxiliary model to predict traffic patterns for guiding dispatching decisions but is also computationally burdensome. In contrast, our goal is to develop a unified, traffic pattern-aware dispatching model. We formulate the elevator dispatching problem as a Semi-Markov Decision Process (SMDP) with novel state representation, action space, and reward function designs. To solve the formulated SMDP, we propose a Dueling Double Deep Q-Network (D3QN) architecture associated with the training algorithm. To ensure traffic pattern awareness, we train our model in a unified 'All in One' traffic scenario, employing two practical techniques to enhance the training process: (1) temporal grouping with gradient surgery and (2) incorporation of passenger arrival information. Empirical evaluations confirm the superiority of our model over multiple benchmarks, including those relying on separate, pattern-specific models. Remarkably, our unified model demonstrates robust performance across unseen traffic scenarios and performs exceptionally well in single traffic patterns despite being trained solely on the unified 'All in One' scenario. The short inference time for decision-making further solidifies the model's practical viability. Additionally, the incremental benefits contributed by each of our introduced techniques are also investigated. Our code is available at https://github.com/jswan95/RLbased-traffic-pattern-aware-elevator-dispatching
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Pattern-Aware Intelligent Anti-Jamming Communication: A Sequential Deep Reinforcement Learning Approach
    Liu, Songyi
    Xu, Yifan
    Chen, Xueqiang
    Wang, Ximing
    Wang, Meng
    Li, Wen
    Li, Yangyang
    Xu, Yuhua
    IEEE ACCESS, 2019, 7 (169204-169216): : 169204 - 169216
  • [2] Power-Aware Traffic Engineering via Deep Reinforcement Learning
    Pan, Tian
    Peng, Xiaoyu
    Bian, Zizheng
    Lin, Xingchen
    Song, Enge
    Huang, Tao
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM 2019 WKSHPS), 2019, : 1009 - 1010
  • [3] Solving the train dispatching problem via deep reinforcement learning
    Agasucci, Valerio
    Grani, Giorgio
    Lamorgese, Leonardo
    JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2023, 26
  • [4] GreenTE.ai: Power-Aware Traffic Engineering via Deep Reinforcement Learning
    Pan, Tian
    Peng, Xiaoyu
    Shi, Qianqian
    Bian, Zizheng
    Lin, Xingchen
    Song, Enge
    Li, Fuliang
    Xu, Yang
    Huang, Tao
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [5] Power-Aware Traffic Engineering for Data Center Networks via Deep Reinforcement Learning
    Gao, Minglan
    Pan, Tian
    Song, Enge
    Yang, Mengqi
    Huang, Tao
    Liu, Yunjie
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 6055 - 6060
  • [6] Traffic Signal Timing via Deep Reinforcement Learning
    Li Li
    Yisheng Lv
    Fei-Yue Wang
    IEEE/CAAJournalofAutomaticaSinica, 2016, 3 (03) : 247 - 247
  • [7] Traffic signal timing via deep reinforcement learning
    Li L.
    Lv Y.
    Wang F.-Y.
    Li, Li (li-li@tsinghua.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (03): : 247 - 254
  • [8] Traffic Signal Timing via Deep Reinforcement Learning
    Li, Li
    Lv, Yisheng
    Wang, Fei-Yue
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2016, 3 (03) : 247 - 254
  • [9] Delay-aware Cellular Traffic Scheduling with Deep Reinforcement Learning
    Zhang, Ticao
    Shen, Shuyi
    Mao, Shiwen
    Chang, Gee-Kung
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [10] Objective-aware Traffic Simulation via Inverse Reinforcement Learning
    Zheng, Guanjie
    Liu, Hanyang
    Xu, Kai
    Li, Zhenhui
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3771 - 3777