Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引:0
|
作者
Zhicong Zhang
Li Zheng
Michael X. Weng
机构
[1] Tsinghua University,Department of Industrial Engineering
[2] University of South Florida,Department of Industrial and Management Systems Engineering
关键词
Scheduling; Parallel machine; Reinforcement learning; Q-Learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.
引用
收藏
页码:968 / 980
页数:12
相关论文
共 50 条
  • [32] Scheduling parallel machines to minimize total weighted and unweighted tardiness
    Alidaee, B
    Rosa, D
    COMPUTERS & OPERATIONS RESEARCH, 1997, 24 (08) : 775 - 788
  • [33] Scheduling unrelated parallel machines to minimize total weighted tardiness
    Liaw, CF
    Lin, YK
    Cheng, CY
    Chen, MC
    COMPUTERS & OPERATIONS RESEARCH, 2003, 30 (12) : 1777 - 1789
  • [34] Metaheuristics for Identical Parallel Machines Scheduling to Minimize Mean Tardiness
    Kaid, Husam
    Alharkan, Ibrahim
    Ghaleb, Atef
    Ghaleb, Mageed A.
    2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND OPERATIONS MANAGEMENT (IEOM), 2015,
  • [35] A Weighted Smooth Q-Learning Algorithm
    Vijesh, V. Antony
    Shreyas, S. R.
    IEEE CONTROL SYSTEMS LETTERS, 2025, 9 : 21 - 26
  • [36] Clustering state membership-based Q-learning for dynamic scheduling
    Wang, Guolei
    Zhong, Shisheng
    Lin, Lin
    Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (04): : 428 - 433
  • [37] Dynamic Parallel Machine Scheduling With Deep Q-Network
    Liu, Chien-Liang
    Tseng, Chun-Jan
    Huang, Tzu-Hsuan
    Wang, Jhih-Wun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (11): : 6792 - 6804
  • [38] Q-learning for Statically Scheduling DAGs
    Roeder, Julius
    Rouxel, Benjamin
    Grelck, Clemens
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5813 - 5815
  • [39] Parallel Block-Based Simulated Annealing for the Single Machine Total Weighted Tardiness Scheduling Problem
    Bozejko, Wojciech
    Pempera, Jaroslaw
    Uchronski, Mariusz
    Wodecki, Mieczyslaw
    16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 758 - 765
  • [40] Dynamic Parallel Machine Scheduling Using the Learning Agent
    Yuan, Biao
    Wang, Lei
    Jiang, Zhibin
    2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1565 - 1569