Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning

被引:0
|
作者
Zhao, Lixiang [1 ,2 ]
Zhu, Han [1 ,2 ]
Zhang, Min [1 ,2 ]
Tang, Jiafu [1 ,2 ]
Wang, Yu [3 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[2] Dongbei Univ Finance & Econ, Key Lab Liaoning Prov Data Anal & Optimizat Decis, Dalian 116025, Peoples R China
[3] Northeastern Univ, Sch Business Adm, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
SDG3: good health and well-being; dynamic surgical scheduling; hierarchical reinforcement learning; Markov decision process; dynamic unrelated parallel machine scheduling; MEAN WEIGHTED TARDINESS; ALGORITHMS; SURGERIES; DEMAND;
D O I
10.1080/00207543.2024.2361449
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Dynamic surgical scheduling within a workday is a complicated decision-making process. The critical challenge is that the actual duration of surgery and the arrival process of emergency patients are uncertain and unknown in advance. In this work, we propose a two-level dynamic scheduling framework based on hierarchical reinforcement learning to solve dynamic surgical scheduling problems considering both elective and emergency patients. Specifically, with the realisation of uncertainty, the upper-level agent (UA) dynamically decides whether to trigger rescheduling to optimise the workday total cost. The lower-level agent (LA) aims at obtaining subscheduling solutions when rescheduling is triggered. The subproblem at the LA can be formulated as a mixed integer programming model, which can be generalised to unrelated parallel machine scheduling with machine eligibility restrictions and sequence- and machine-dependent setup times. Such problems can be solved in small-scale cases and suffers the combinatorial explosion in large scale cases. To address this issue, we propose a heuristic method that is built upon deep reinforcement learning to obtain high-quality solutions. We conduct extensive simulation experiments with real data to test the effective of our framework. The results for different scenarios show that our proposed framework outperforms existing methods in terms of overall performance and has strong generalisation ability.
引用
收藏
页数:32
相关论文
共 50 条
  • [21] Tractable large-scale deep reinforcement learning
    Sarang, Nima
    Poullis, Charalambos
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [22] Surgical scheduling under uncertainty by approximate dynamic programming
    Silva, Thiago A. O.
    de Souza, Mauricio C.
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2020, 95
  • [23] Large-Scale Machine Learning Cluster Scheduling via Multi-Agent Graph Reinforcement Learning
    Zhao, Xiaoyang
    Wu, Chuan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4962 - 4974
  • [24] On stochastic dynamic programming for solving large-scale planning problems under uncertainty
    Pilar Cristobal, M.
    Escudero, Laureano F.
    Monge, Juan F.
    COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (08) : 2418 - 2428
  • [25] Large-scale Distance Metric Learning with Uncertainty
    Qian, Qi
    Tang, Jiasheng
    Li, Hao
    Zhu, Shenghuo
    Jin, Rong
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8542 - 8550
  • [26] Large-scale Unit Commitment under uncertainty
    Milad Tahanan
    Wim van Ackooij
    Antonio Frangioni
    Fabrizio Lacalandra
    4OR, 2015, 13 : 115 - 171
  • [27] Large-scale Unit Commitment under uncertainty
    Tahanan, Milad
    van Ackooij, Wim
    Frangioni, Antonio
    Lacalandra, Fabrizio
    4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2015, 13 (02): : 115 - 171
  • [28] Multi-Agent Mix Hierarchical Deep Reinforcement Learning for Large-Scale Fleet Management
    Huang, Xiaohui
    Ling, Jiahao
    Yang, Xiaofei
    Zhang, Xiong
    Yang, Kaiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14294 - 14305
  • [29] Large-Scale Electroplating Scheduling: A Hierarchical Temporal Planning Approach
    Xiao, Yingkai
    Jin, Kebing
    Ma, Renyong
    Zhuo, Hankz Hankui
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 215 - 226
  • [30] Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
    Petsagkourakis, P.
    Sandoval, I. O.
    Bradford, E.
    Zhang, D.
    del Rio-Chanona, E. A.
    IFAC PAPERSONLINE, 2020, 53 (02): : 11264 - 11270