Teleconsultation dynamic scheduling with a deep reinforcement learning approach

被引:0
|
作者
Chen, Wenjia [1 ]
Li, Jinlin [2 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Econ & Management, Beijing 100192, Peoples R China
[2] Beijing Inst Technol, Sch Management & Econ, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Teleconsultation scheduling; Markov decision process (MDP); Deep reinforcement learning; Deep Q-network (DQN); TELEMEDICINE; MODEL; OPTIMIZATION; UNCERTAINTY; DEMAND;
D O I
10.1016/j.artmed.2024.102806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, the start time of teleconsultations is optimized for the clinical departments of class A tertiary hospitals to improve service quality and efficiency. For this purpose, first, a general teleconsultation scheduling model is formulated. In the formulation, the number of services (NS) is one of the objectives because of demand intermittency and service mobility. Demand intermittency means that demand has zero size in several periods. Service mobility means that specialists move between clinical departments and the National Telemedicine Center of China to provide the service. For problem -solving, the general model is converted into a Markov decision process (MDP) by elaborately defining the state, action, and reward. To solve the MDP, deep reinforcement learning (DRL) is applied to overcome the problem of inaccurate transition probability. To reduce the dimensions of the state-action space, a semi -fixed policy is developed and applied to the deep Q network (DQN) to construct an algorithm of the DQN with a semi -fixed policy (DQN-S). For efficient fitting, an early stop strategy is applied in DQN-S training. To verify the effectiveness of the proposed scheduling model and the model solving method DQN-S, scheduling experiments are carried out based on actual data of teleconsultation demand arrivals and service arrangements. The results show that DQN-S can improve the quality and efficiency of teleconsultations by reducing 9%-41% of the demand average waiting time, 3%-42% of the number of services, and 3%-33% of the total cost of services.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Shortening Passengers' Travel Time: A Dynamic Metro Train Scheduling Approach Using Deep Reinforcement Learning
    Wang, Zhaoyuan
    Pan, Zheyi
    Chen, Shun
    Ji, Shenggong
    Yi, Xiuwen
    Zhang, Junbo
    Wang, Jingyuan
    Gong, Zhiguo
    Li, Tianrui
    Zheng, Yu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5282 - 5295
  • [22] A deep reinforcement learning-based approach for the residential appliances scheduling
    Li, Sichen
    Cao, Di
    Huang, Qi
    Zhang, Zhenyuan
    Chen, Zhe
    Blaabjerg, Frede
    Hu, Weihao
    ENERGY REPORTS, 2022, 8 : 1034 - 1042
  • [23] A HIERARCHICAL DEEP REINFORCEMENT LEARNING APPROACH FOR OUTPATIENT PRIMARY CARE SCHEDULING
    Issabakhsh, Mona
    Lee, Seokgi
    2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 997 - 1008
  • [24] A DEEP REINFORCEMENT LEARNING APPROACH FOR PRODUCTION SCHEDULING IN COMPUTER SERVER INDUSTRY
    Radman, Azzam
    Aqlan, Faisal
    Parikh, Pratik
    Noor-E-Alam, Md
    PROCEEDINGS OF ASME 2024 19TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, MSEC2024, VOL 2, 2024,
  • [25] Deep Reinforcement Learning Approach for Resource-Constrained Project Scheduling
    Zhao, Xiaohan
    Song, Wen
    Li, Qiqiang
    Shi, Huadong
    Kang, Zhichao
    Zhang, Chunmei
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1226 - 1234
  • [26] A Deep Reinforcement Learning Approach for Production Scheduling with the Use of Dispatch Rules
    Mavrothalassitis, Panagiotis
    Bakopoulos, Emmanouil
    Siatras, Vasilis
    Nikolakis, Nikolaos
    Alexopoulos, Kosmas
    ADVANCES IN ARTIFICIAL INTELLIGENCE IN MANUFACTURING, ESAIM 2023, 2024, : 43 - 50
  • [27] A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling
    Che, Haiying
    Bai, Zixing
    Zuo, Rong
    Li, Honglei
    COMPLEXITY, 2020, 2020
  • [28] A deep reinforcement learning approach for joint scheduling of cascade reservoir system
    Luo, Wei
    Wang, Chao
    Zhang, Yunhui
    Zhao, Jianshi
    Huang, Zhifeng
    Wang, Jiaqing
    Zhang, Chu
    JOURNAL OF HYDROLOGY, 2025, 651
  • [29] DYNAMIC SCHEDULING OF MAINTENANCE BY A REINFORCEMENT LEARNING APPROACH - A SEMICONDUCTOR SIMULATION STUDY
    Geurtsen, Michael
    Adan, Ivo
    Atan, Zumbul
    2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 3110 - 3121
  • [30] A reinforcement learning approach to parameter estimation in dynamic job shop scheduling
    Shahrabi, Jamal
    Adibi, Mohammad Amin
    Mahootchi, Masoud
    COMPUTERS & INDUSTRIAL ENGINEERING, 2017, 110 : 75 - 82