Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引：0

作者：

Zhicong Zhang

Li Zheng

Michael X. Weng

机构：

[1] Tsinghua University,Department of Industrial Engineering

[2] University of South Florida,Department of Industrial and Management Systems Engineering

来源：

The International Journal of Advanced Manufacturing Technology | 2007年 / 34卷

关键词：

Scheduling; Parallel machine; Reinforcement learning; Q-Learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.

引用

页码：968 / 980

页数：12

共 50 条

[31] Weighted-tardiness scheduling on parallel machines with proportional weights
1600, : 91 - 05
[32] Scheduling parallel machines to minimize total weighted and unweighted tardiness
Alidaee, B
Rosa, D
COMPUTERS & OPERATIONS RESEARCH, 1997, 24 (08) : 775 - 788
[33] Scheduling unrelated parallel machines to minimize total weighted tardiness
Liaw, CF
Lin, YK
Cheng, CY
Chen, MC
COMPUTERS & OPERATIONS RESEARCH, 2003, 30 (12) : 1777 - 1789
[34] Metaheuristics for Identical Parallel Machines Scheduling to Minimize Mean Tardiness
Kaid, Husam
Alharkan, Ibrahim
Ghaleb, Atef
Ghaleb, Mageed A.
2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND OPERATIONS MANAGEMENT (IEOM), 2015,
[35] A Weighted Smooth Q-Learning Algorithm
Vijesh, V. Antony
Shreyas, S. R.
IEEE CONTROL SYSTEMS LETTERS, 2025, 9 : 21 - 26
[36] Clustering state membership-based Q-learning for dynamic scheduling
Wang, Guolei
Zhong, Shisheng
Lin, Lin
Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (04): : 428 - 433
[37] Dynamic Parallel Machine Scheduling With Deep Q-Network
Liu, Chien-Liang
Tseng, Chun-Jan
Huang, Tzu-Hsuan
Wang, Jhih-Wun
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (11): : 6792 - 6804
[38] Q-learning for Statically Scheduling DAGs
Roeder, Julius
Rouxel, Benjamin
Grelck, Clemens
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5813 - 5815
[39] Parallel Block-Based Simulated Annealing for the Single Machine Total Weighted Tardiness Scheduling Problem
Bozejko, Wojciech
Pempera, Jaroslaw
Uchronski, Mariusz
Wodecki, Mieczyslaw
16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 758 - 765
[40] Dynamic Parallel Machine Scheduling Using the Learning Agent
Yuan, Biao
Wang, Lei
Jiang, Zhibin
2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1565 - 1569

← 1 2 3 4 5 →