Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning

被引:0
|
作者
Zhao, Lixiang [1 ,2 ]
Zhu, Han [1 ,2 ]
Zhang, Min [1 ,2 ]
Tang, Jiafu [1 ,2 ]
Wang, Yu [3 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[2] Dongbei Univ Finance & Econ, Key Lab Liaoning Prov Data Anal & Optimizat Decis, Dalian 116025, Peoples R China
[3] Northeastern Univ, Sch Business Adm, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
SDG3: good health and well-being; dynamic surgical scheduling; hierarchical reinforcement learning; Markov decision process; dynamic unrelated parallel machine scheduling; MEAN WEIGHTED TARDINESS; ALGORITHMS; SURGERIES; DEMAND;
D O I
10.1080/00207543.2024.2361449
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Dynamic surgical scheduling within a workday is a complicated decision-making process. The critical challenge is that the actual duration of surgery and the arrival process of emergency patients are uncertain and unknown in advance. In this work, we propose a two-level dynamic scheduling framework based on hierarchical reinforcement learning to solve dynamic surgical scheduling problems considering both elective and emergency patients. Specifically, with the realisation of uncertainty, the upper-level agent (UA) dynamically decides whether to trigger rescheduling to optimise the workday total cost. The lower-level agent (LA) aims at obtaining subscheduling solutions when rescheduling is triggered. The subproblem at the LA can be formulated as a mixed integer programming model, which can be generalised to unrelated parallel machine scheduling with machine eligibility restrictions and sequence- and machine-dependent setup times. Such problems can be solved in small-scale cases and suffers the combinatorial explosion in large scale cases. To address this issue, we propose a heuristic method that is built upon deep reinforcement learning to obtain high-quality solutions. We conduct extensive simulation experiments with real data to test the effective of our framework. The results for different scenarios show that our proposed framework outperforms existing methods in terms of overall performance and has strong generalisation ability.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Large-Scale Dynamic Scheduling for Flexible Job-Shop With Random Arrivals of New Jobs by Hierarchical Reinforcement Learning
    Lei, Kun
    Guo, Peng
    Wang, Yi
    Zhang, Jian
    Meng, Xiangyin
    Qian, Linmao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (01) : 1007 - 1018
  • [2] An End-to-end Hierarchical Reinforcement Learning Framework for Large-scale Dynamic Flexible Job-shop Scheduling Problem
    Lei, Kun
    Guo, Peng
    Wang, Yi
    Xiong, Jianyu
    Zhao, Wenchao
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [3] Scheduling Large-scale Distributed Training via Reinforcement Learning
    Peng, Zhanglin
    Ren, Jiamin
    Zhang, Ruimao
    Wu, Lingyun
    Wang, Xinjiang
    Luo, Ping
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1797 - 1806
  • [4] A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
    Ma, Yi
    Hao, Xiaotian
    Hao, Jianye
    Lu, Jiawen
    Liu, Xing
    Tong, Xialiang
    Yuan, Mingxuan
    Li, Zhigang
    Tang, Jie
    Meng, Zhaopeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Automatic Hierarchical Reinforcement Learning for Efficient Large-scale Service Composition
    Wang, Hongbing
    Huang, Guicheng
    Yu, Qi
    2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 57 - 64
  • [6] Deep reinforcement learning for scheduling in large-scale networked control systems
    Redder, Adrian
    Ramaswamy, Arunselvan
    Quevedo, Daniel E.
    IFAC PAPERSONLINE, 2019, 52 (20): : 333 - 338
  • [7] A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm
    Chen, Yuandong
    Ding, Jinliang
    Chen, Qingda
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6041 - 6055
  • [8] Multi-task deep reinforcement learning for dynamic scheduling of large-scale fleets in earthmoving operations
    Zhang, Yunuo
    Zhang, Jun
    Wang, Xiaoling
    Zeng, Tuocheng
    AUTOMATION IN CONSTRUCTION, 2025, 174
  • [9] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
    Chen, Yixi
    Zhu, Jizhong
    Liu, Yun
    Zhang, Le
    Zhou, Jialin
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458
  • [10] Large-Scale Retrieval for Reinforcement Learning
    Humphreys, Peter C.
    Guez, Arthur
    Tieleman, Olivier
    Sifre, Laurent
    Weber, Theophane
    Lillicrap, Timothy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,