Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning

被引:0
|
作者
Zhao, Lixiang [1 ,2 ]
Zhu, Han [1 ,2 ]
Zhang, Min [1 ,2 ]
Tang, Jiafu [1 ,2 ]
Wang, Yu [3 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[2] Dongbei Univ Finance & Econ, Key Lab Liaoning Prov Data Anal & Optimizat Decis, Dalian 116025, Peoples R China
[3] Northeastern Univ, Sch Business Adm, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
SDG3: good health and well-being; dynamic surgical scheduling; hierarchical reinforcement learning; Markov decision process; dynamic unrelated parallel machine scheduling; MEAN WEIGHTED TARDINESS; ALGORITHMS; SURGERIES; DEMAND;
D O I
10.1080/00207543.2024.2361449
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Dynamic surgical scheduling within a workday is a complicated decision-making process. The critical challenge is that the actual duration of surgery and the arrival process of emergency patients are uncertain and unknown in advance. In this work, we propose a two-level dynamic scheduling framework based on hierarchical reinforcement learning to solve dynamic surgical scheduling problems considering both elective and emergency patients. Specifically, with the realisation of uncertainty, the upper-level agent (UA) dynamically decides whether to trigger rescheduling to optimise the workday total cost. The lower-level agent (LA) aims at obtaining subscheduling solutions when rescheduling is triggered. The subproblem at the LA can be formulated as a mixed integer programming model, which can be generalised to unrelated parallel machine scheduling with machine eligibility restrictions and sequence- and machine-dependent setup times. Such problems can be solved in small-scale cases and suffers the combinatorial explosion in large scale cases. To address this issue, we propose a heuristic method that is built upon deep reinforcement learning to obtain high-quality solutions. We conduct extensive simulation experiments with real data to test the effective of our framework. The results for different scenarios show that our proposed framework outperforms existing methods in terms of overall performance and has strong generalisation ability.
引用
收藏
页数:32
相关论文
共 50 条
  • [31] Reinforcement learning application for dynamic trust modeling in large-scale open distributed systems
    Li, Xiaoyong
    Gui, Xiaolin
    Zhao, Juan
    Zhao, Bo
    Journal of Computational Information Systems, 2008, 4 (06): : 2591 - 2597
  • [32] Algorithms or Actions? A Study in Large-Scale Reinforcement Learning
    Tavares, Anderson Rocha
    Anbalagan, Sivasubramanian
    Marcolino, Leandro Soriano
    Chaimowicz, Luiz
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2717 - 2723
  • [33] Deep Reinforcement Learning for Large-Scale Epidemic Control
    Libin, Pieter J. K.
    Moonens, Arno
    Verstraeten, Timothy
    Perez-Sanjines, Fabian
    Hens, Niel
    Lemey, Philippe
    Nowe, Ann
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 155 - 170
  • [34] Large-scale transit itinerary planning under uncertainty
    Li, Jing-Quan
    Kong, Nan
    Hu, Xiangpei
    Liu, Linlin
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2015, 60 : 397 - 415
  • [35] The economics of large-scale offshore investments under uncertainty
    Trapmann, W
    Forbes, K
    Mitchell, J
    EXPERIMENTING WITH FREER MARKETS: LESSONS FROM THE LAST 20 YEARS AND PROSPECTS FOR THE FUTURE - VOL 2, CONFERENCE PROCEEDINGS, 1998, : 445 - 452
  • [36] An Efficient Model-Free Approach for Controlling Large-Scale Canals via Hierarchical Reinforcement Learning
    Ren, Tao
    Niu, Jianwei
    Liu, Xuefeng
    Wu, Jiyan
    Lei, Xiaohui
    Zhang, Zhao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (06) : 4367 - 4378
  • [37] HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding
    Song, Zhaoyi
    Zhang, Rongqing
    Cheng, Xiang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7318 - 7325
  • [38] Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems
    Li, Jingbo
    Zhang, Xingjun
    Wei, Zheng
    Wei, Jia
    Ji, Zeyu
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2021, 3 (04) : 383 - 392
  • [39] Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems
    Jingbo Li
    Xingjun Zhang
    Zheng Wei
    Jia Wei
    Zeyu Ji
    CCF Transactions on High Performance Computing, 2021, 3 : 383 - 392
  • [40] Constrained large-scale real-time EV scheduling based on recurrent deep reinforcement learning
    Li, Hang
    Li, Guojie
    Lie, Tek Tjing
    Li, Xingzhi
    Wang, Keyou
    Han, Bei
    Xu, Jin
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 144