Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning

被引：0

作者：

Zhao, Lixiang ^{[1
,2
]}

Zhu, Han ^{[1
,2
]}

Zhang, Min ^{[1
,2
]}

Tang, Jiafu ^{[1
,2
]}

Wang, Yu ^{[3
]}

机构：

[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China

[2] Dongbei Univ Finance & Econ, Key Lab Liaoning Prov Data Anal & Optimizat Decis, Dalian 116025, Peoples R China

[3] Northeastern Univ, Sch Business Adm, Shenyang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2024年

基金：

中国国家自然科学基金;

关键词：

SDG3: good health and well-being; dynamic surgical scheduling; hierarchical reinforcement learning; Markov decision process; dynamic unrelated parallel machine scheduling; MEAN WEIGHTED TARDINESS; ALGORITHMS; SURGERIES; DEMAND;

D O I：

10.1080/00207543.2024.2361449

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Dynamic surgical scheduling within a workday is a complicated decision-making process. The critical challenge is that the actual duration of surgery and the arrival process of emergency patients are uncertain and unknown in advance. In this work, we propose a two-level dynamic scheduling framework based on hierarchical reinforcement learning to solve dynamic surgical scheduling problems considering both elective and emergency patients. Specifically, with the realisation of uncertainty, the upper-level agent (UA) dynamically decides whether to trigger rescheduling to optimise the workday total cost. The lower-level agent (LA) aims at obtaining subscheduling solutions when rescheduling is triggered. The subproblem at the LA can be formulated as a mixed integer programming model, which can be generalised to unrelated parallel machine scheduling with machine eligibility restrictions and sequence- and machine-dependent setup times. Such problems can be solved in small-scale cases and suffers the combinatorial explosion in large scale cases. To address this issue, we propose a heuristic method that is built upon deep reinforcement learning to obtain high-quality solutions. We conduct extensive simulation experiments with real data to test the effective of our framework. The results for different scenarios show that our proposed framework outperforms existing methods in terms of overall performance and has strong generalisation ability.

引用

页数：32

共 50 条

[1] Large-Scale Dynamic Scheduling for Flexible Job-Shop With Random Arrivals of New Jobs by Hierarchical Reinforcement Learning
Lei, Kun
Guo, Peng
Wang, Yi
Zhang, Jian
Meng, Xiangyin
Qian, Linmao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (01) : 1007 - 1018
[2] An End-to-end Hierarchical Reinforcement Learning Framework for Large-scale Dynamic Flexible Job-shop Scheduling Problem
Lei, Kun
Guo, Peng
Wang, Yi
Xiong, Jianyu
Zhao, Wenchao
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[3] Scheduling Large-scale Distributed Training via Reinforcement Learning
Peng, Zhanglin
Ren, Jiamin
Zhang, Ruimao
Wu, Lingyun
Wang, Xinjiang
Luo, Ping
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1797 - 1806
[4] A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
Ma, Yi
Hao, Xiaotian
Hao, Jianye
Lu, Jiawen
Liu, Xing
Tong, Xialiang
Yuan, Mingxuan
Li, Zhigang
Tang, Jie
Meng, Zhaopeng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Automatic Hierarchical Reinforcement Learning for Efficient Large-scale Service Composition
Wang, Hongbing
Huang, Guicheng
Yu, Qi
2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 57 - 64
[6] Deep reinforcement learning for scheduling in large-scale networked control systems
Redder, Adrian
Ramaswamy, Arunselvan
Quevedo, Daniel E.
IFAC PAPERSONLINE, 2019, 52 (20): : 333 - 338
[7] A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm
Chen, Yuandong
Ding, Jinliang
Chen, Qingda
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6041 - 6055
[8] Multi-task deep reinforcement learning for dynamic scheduling of large-scale fleets in earthmoving operations
Zhang, Yunuo
Zhang, Jun
Wang, Xiaoling
Zeng, Tuocheng
AUTOMATION IN CONSTRUCTION, 2025, 174
[9] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
Chen, Yixi
Zhu, Jizhong
Liu, Yun
Zhang, Le
Zhou, Jialin
IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458
[10] Large-Scale Retrieval for Reinforcement Learning
Humphreys, Peter C.
Guez, Arthur
Tieleman, Olivier
Sifre, Laurent
Weber, Theophane
Lillicrap, Timothy
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →