Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引:0
|
作者
Zhicong Zhang
Li Zheng
Michael X. Weng
机构
[1] Tsinghua University,Department of Industrial Engineering
[2] University of South Florida,Department of Industrial and Management Systems Engineering
关键词
Scheduling; Parallel machine; Reinforcement learning; Q-Learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.
引用
收藏
页码:968 / 980
页数:12
相关论文
共 50 条
  • [21] Parallel Machine Scheduling with Eligibility Constraints: A Composite Dispatching Rule to Minimize Total Weighted Tardiness
    Su, Huiqiao
    Pinedo, Michael
    Wan, Guohua
    NAVAL RESEARCH LOGISTICS, 2017, 64 (03) : 249 - 267
  • [22] Unrelated parallel machine scheduling with eligibility constraints and delivery times to minimize total weighted tardiness
    Maecker, Sohnke
    Shen, Liji
    Monch, Lars
    COMPUTERS & OPERATIONS RESEARCH, 2023, 149
  • [23] Scheduling unrelated parallel machine to minimize total weighted tardiness using ant colony optimization
    Zhou, Hong
    Li, Zhengdao
    Wu, Xuejing
    2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, : 132 - 136
  • [24] Minimizing total weighted tardiness on parallel batch-processing machine scheduling problems with varying machine capacities
    Chou, Fuh-Der
    Wang, Hui-Mei
    MECHANICAL AND AEROSPACE ENGINEERING, PTS 1-7, 2012, 110-116 : 3906 - +
  • [25] Minimizing total weighted tardiness on parallel batch-processing machine scheduling problems with varying machine capacities
    Department of Industrial Engineering and Management, Ching Yun University, Jung-Li, Tao Yuan, Taiwan
    不详
    Appl. Mech. Mater., (3906-3913):
  • [26] Parallel machine earliness and tardiness scheduling with proportional weights
    Sun, HY
    Wang, GQ
    COMPUTERS & OPERATIONS RESEARCH, 2003, 30 (05) : 801 - 808
  • [27] Adaptive job shop scheduling strategy based on weighted Q-learning algorithm
    Yu-Fang Wang
    Journal of Intelligent Manufacturing, 2020, 31 : 417 - 432
  • [28] Adaptive job shop scheduling strategy based on weighted Q-learning algorithm
    Wang, Yu-Fang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (02) : 417 - 432
  • [29] Scheduling unrelated parallel machines to minimize total weighted tardiness
    Na, Dong-Gil
    Kim, Dong-Won
    Jang, Wooseung
    Chen, F. Frank
    2006 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS (SOLI 2006), PROCEEDINGS, 2006, : 758 - +
  • [30] WEIGHTED-TARDINESS SCHEDULING ON PARALLEL MACHINES WITH PROPORTIONAL WEIGHTS
    ARKIN, EM
    ROUNDY, RO
    OPERATIONS RESEARCH, 1991, 39 (01) : 64 - 81