Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning

被引:0
|
作者
Zhao, Lixiang [1 ,2 ]
Zhu, Han [1 ,2 ]
Zhang, Min [1 ,2 ]
Tang, Jiafu [1 ,2 ]
Wang, Yu [3 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[2] Dongbei Univ Finance & Econ, Key Lab Liaoning Prov Data Anal & Optimizat Decis, Dalian 116025, Peoples R China
[3] Northeastern Univ, Sch Business Adm, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
SDG3: good health and well-being; dynamic surgical scheduling; hierarchical reinforcement learning; Markov decision process; dynamic unrelated parallel machine scheduling; MEAN WEIGHTED TARDINESS; ALGORITHMS; SURGERIES; DEMAND;
D O I
10.1080/00207543.2024.2361449
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Dynamic surgical scheduling within a workday is a complicated decision-making process. The critical challenge is that the actual duration of surgery and the arrival process of emergency patients are uncertain and unknown in advance. In this work, we propose a two-level dynamic scheduling framework based on hierarchical reinforcement learning to solve dynamic surgical scheduling problems considering both elective and emergency patients. Specifically, with the realisation of uncertainty, the upper-level agent (UA) dynamically decides whether to trigger rescheduling to optimise the workday total cost. The lower-level agent (LA) aims at obtaining subscheduling solutions when rescheduling is triggered. The subproblem at the LA can be formulated as a mixed integer programming model, which can be generalised to unrelated parallel machine scheduling with machine eligibility restrictions and sequence- and machine-dependent setup times. Such problems can be solved in small-scale cases and suffers the combinatorial explosion in large scale cases. To address this issue, we propose a heuristic method that is built upon deep reinforcement learning to obtain high-quality solutions. We conduct extensive simulation experiments with real data to test the effective of our framework. The results for different scenarios show that our proposed framework outperforms existing methods in terms of overall performance and has strong generalisation ability.
引用
收藏
页数:32
相关论文
共 50 条
  • [41] Representation Learning for Large-Scale Dynamic Networks
    Yu, Yanwei
    Yao, Huaxiu
    Wang, Hongjian
    Tang, Xianfeng
    Li, Zhenhui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 526 - 541
  • [42] Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning
    Ghasemipour, Seyed Kamyar Seyed
    Freeman, Daniel
    David, Byron
    Gu, Shixiang Shane
    Kataoka, Satoshi
    Mordatch, Igor
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [43] A hierarchical optimization neural network for large-scale dynamic systems
    Hou, ZG
    AUTOMATICA, 2001, 37 (12) : 1931 - 1940
  • [44] Reinforcement learning in a large-scale photonic recurrent neural network
    Bueno, J.
    Maktoobi, S.
    Froehly, L.
    Fischer, I.
    Jacquot, M.
    Larger, L.
    Brunner, D.
    OPTICA, 2018, 5 (06): : 756 - 760
  • [45] Large-scale power inspection: A deep reinforcement learning approach
    Guan, Qingshu
    Zhang, Xiangquan
    Xie, Minghui
    Nie, Jianglong
    Cao, Hui
    Chen, Zhao
    He, Zhouqiang
    FRONTIERS IN ENERGY RESEARCH, 2023, 10
  • [46] Efficient and scalable reinforcement learning for large-scale network control
    Ma, Chengdong
    Li, Aming
    Du, Yali
    Dong, Hao
    Yang, Yaodong
    NATURE MACHINE INTELLIGENCE, 2024, 6 (09) : 1006 - 1020
  • [47] Large-Scale Wildfire Mitigation Through Deep Reinforcement Learning
    Altamimi, Abdulelah
    Lagoa, Constantino
    Borges, Jose G.
    McDill, Marc E.
    Andriotis, C. P.
    Papakonstantinou, K. G.
    FRONTIERS IN FORESTS AND GLOBAL CHANGE, 2022, 5
  • [48] Learning hierarchical Bayesian networks for large-scale data analysis
    Hwang, Kyu-Baek
    Kim, Byoung-Hee
    Zhang, Byoung-Tak
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 670 - 679
  • [49] Cost-sensitive Learning for Large-scale Hierarchical Classification
    Chen, Jianfu
    Warren, David
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1351 - 1360
  • [50] Delay-aware packet scheduling for massive MIMO beamforming transmission using large-scale reinforcement learning
    Zhang, Xinran
    Sun, Songlin
    PHYSICAL COMMUNICATION, 2019, 32 : 81 - 87