A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups

被引:0
|
作者
Funing Li
Sebastian Lang
Bingyuan Hong
Tobias Reggelin
机构
[1] Otto von Guericke University Magdeburg,Institute of Logistics and Material Handling Systems
[2] Fraunhofer Institute for Factory Operation and Automation IFF,National
[3] Zhejiang Ocean University,Local Joint Engineering Laboratory of Harbor Oil & Gas Storage and Transportation Technology/Zhejiang Provincial Key Laboratory of Petrochemical Pollution Control/School of Petrochemical Engineering and Environment
来源
关键词
Deep reinforcement learning; Parallel machine scheduling; Family setups; Recurrent neural network;
D O I
暂无
中图分类号
学科分类号
摘要
As an essential scheduling problem with several practical applications, the parallel machine scheduling problem (PMSP) with family setups constraints is difficult to solve and proven to be NP-hard. To this end, we present a deep reinforcement learning (DRL) approach to solve a PMSP considering family setups, aiming at minimizing the total tardiness. The PMSP is first modeled as a Markov decision process, where we design a novel variable-length representation of states and actions, so that the DRL agent can calculate a comprehensive priority for each job at each decision time point and then select the next job directly according to these priorities. Meanwhile, the variable-length state matrix and action vector enable the trained agent to solve instances of any scales. To handle the variable-length sequence and simultaneously ensure the calculated priority is a global priority among all jobs, we employ a recurrent neural network, particular gated recurrent unit, to approximate the policy of the agent. The agent is trained based on Proximal Policy Optimization algorithm. Moreover, we develop a two-stage training strategy to enhance the training efficiency. In the numerical experiments, we first train the agent on a given instance and then employ it to solve instances with much larger scales. The experimental results demonstrate the strong generalization capability of the trained agent and the comparison with three dispatching rules and two metaheuristics further validates the superiority of this agent.
引用
收藏
页码:1107 / 1140
页数:33
相关论文
共 50 条
  • [21] Two-stage selection of distributed data centers based on deep reinforcement learning
    Li, Qirui
    Peng, Zhiping
    Cui, Delong
    Lin, Jianpeng
    He, Jieguang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2699 - 2714
  • [22] Two-stage selection of distributed data centers based on deep reinforcement learning
    Li, Qirui
    Peng, Zhiping
    Cui, Delong
    Lin, Jianpeng
    He, Jieguang
    Cluster Computing, 2022, 25 (04) : 2699 - 2714
  • [23] Two-Stage Reinforcement Learning-Based Differential Evolution for Solving Nonlinear Equations
    Liao, Zuowen
    Gong, Wenyin
    Li, Shuijia
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4279 - 4290
  • [24] A two-stage three-machine assembly scheduling problem with a truncation position-based learning effect
    Azzouz, Ameni
    Pan, Po-An
    Hsu, Peng-Hsiang
    Lin, Win-Chin
    Liu, Shangchia
    Ben Said, Lamjed
    Wu, Chin-Chia
    SOFT COMPUTING, 2020, 24 (14) : 10515 - 10533
  • [25] A two-stage three-machine assembly scheduling problem with a truncation position-based learning effect
    Ameni Azzouz
    Po-An Pan
    Peng-Hsiang Hsu
    Win-Chin Lin
    Shangchia Liu
    Lamjed Ben Said
    Chin-Chia Wu
    Soft Computing, 2020, 24 : 10515 - 10533
  • [26] Reinforcement Learning-Based Multi-Objective of Two-Stage Blocking Hybrid Flow Shop Scheduling Problem
    Xu, Ke
    Ye, Caixia
    Gong, Hua
    Sun, Wenjuan
    PROCESSES, 2024, 12 (01)
  • [27] A Two-Stage Machine-Learning-Based Prognostic Approach for Bearing Remaining Useful Prediction Problem
    Zhao, Dongdong
    Feng, Liu
    IAENG International Journal of Computer Science, 2021, 48 (04):
  • [28] Solving Panel Block Assembly Line Scheduling Problem via a Novel Deep Reinforcement Learning Approach
    Zhou, Tao
    Luo, Liang
    He, Yuanxin
    Fan, Zhiwei
    Ji, Shengchen
    APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [29] A new two-stage constraint programming approach for open shop scheduling problem with machine blocking
    Abreu, Levi R.
    Nagano, Marcelo S.
    Prata, Bruno A.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (24) : 8560 - 8579
  • [30] Solving the two-stage hybrid flow shop scheduling problem based on mutant firefly algorithm
    Beibei Fan
    Wenwei Yang
    Zaifang Zhang
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 979 - 990