Traffic pattern-aware elevator dispatching via deep reinforcement learning

被引:1
|
作者
Wan, Jiansong [1 ]
Lee, Kanghoon [1 ]
Shin, Hayong [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Ind & Syst Engn, 291 Daehak Ro, Daejeon 34141, South Korea
基金
新加坡国家研究基金会;
关键词
Elevator dispatching; Semi-Markov decision process; Deep reinforcement learning; Traffic pattern awareness; GENETIC ALGORITHM;
D O I
10.1016/j.aei.2024.102497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses the elevator dispatching problem using deep reinforcement learning, with a specific emphasis on traffic pattern awareness. Previous studies on reinforcement learning-based elevator dispatching have largely focused on training separate models for single traffic patterns, such as up-peak, down-peak, lunchpeak, and inter-floor. This separate training approach not only introduces practical complexities by requiring an auxiliary model to predict traffic patterns for guiding dispatching decisions but is also computationally burdensome. In contrast, our goal is to develop a unified, traffic pattern-aware dispatching model. We formulate the elevator dispatching problem as a Semi-Markov Decision Process (SMDP) with novel state representation, action space, and reward function designs. To solve the formulated SMDP, we propose a Dueling Double Deep Q-Network (D3QN) architecture associated with the training algorithm. To ensure traffic pattern awareness, we train our model in a unified 'All in One' traffic scenario, employing two practical techniques to enhance the training process: (1) temporal grouping with gradient surgery and (2) incorporation of passenger arrival information. Empirical evaluations confirm the superiority of our model over multiple benchmarks, including those relying on separate, pattern-specific models. Remarkably, our unified model demonstrates robust performance across unseen traffic scenarios and performs exceptionally well in single traffic patterns despite being trained solely on the unified 'All in One' scenario. The short inference time for decision-making further solidifies the model's practical viability. Additionally, the incremental benefits contributed by each of our introduced techniques are also investigated. Our code is available at https://github.com/jswan95/RLbased-traffic-pattern-aware-elevator-dispatching
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Optimizing Traffic at Intersections With Deep Reinforcement Learning
    Boyko, Nataliya
    Mokryk, Yaroslav
    JOURNAL OF ENGINEERING, 2024, 2024
  • [42] Deep Reinforcement Learning for Traffic Light Optimization
    Coskun, Mustafa
    Baggag, Abdelkader
    Chawla, Sanjay
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 564 - 571
  • [43] Reconfigurable and Traffic-Aware MAC Design for Virtualized Wireless Networks via Reinforcement Learning
    Shoaei, Atoosa Dalili
    Derakhshani, Mahsa
    Tho Le-Ngoc
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (08) : 5490 - 5505
  • [44] AoI-Aware Resource Management for Smart Health Via Deep Reinforcement Learning
    Wu, Beining
    Cai, Zhengkun
    Wu, Wei
    Yin, Xiaobin
    IEEE ACCESS, 2023, 11 : 81180 - 81195
  • [45] Salience-Aware Face Presentation Attack Detection via Deep Reinforcement Learning
    Yu, Bingyao
    Lu, Jiwen
    Li, Xiu
    Zhou, Jie
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 413 - 427
  • [46] Obstacle-Aware Navigation of Soft Growing Robots via Deep Reinforcement Learning
    El-Hussieny, Haitham
    Hameed, Ibrahim A.
    IEEE ACCESS, 2024, 12 : 38192 - 38201
  • [47] An Isolation-aware Online Virtual Network Embedding via Deep Reinforcement Learning
    Gohar, Ali
    Rong, Chunming
    Lee, Sanghwan
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 89 - 95
  • [48] User preference-aware video highlight detection via deep reinforcement learning
    Han Wang
    Kexin Wang
    Yuqing Wu
    Zhongzhi Wang
    Ling Zou
    Multimedia Tools and Applications, 2020, 79 : 15015 - 15024
  • [49] Crowd-Aware Socially Compliant Robot Navigation via Deep Reinforcement Learning
    Bingxin Xue
    Ming Gao
    Chaoqun Wang
    Yao Cheng
    Fengyu Zhou
    International Journal of Social Robotics, 2024, 16 : 197 - 209
  • [50] Haisor: Human-aware Indoor Scene Optimization via Deep Reinforcement Learning
    Sun, Jia-Mu
    Yang, Jie
    Mo, Kaichun
    Lai, Yu-Kun
    Guibas, Leonidas
    Gao, Lin
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (02):