A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups

被引：0

作者：

Funing Li

Sebastian Lang

Bingyuan Hong

Tobias Reggelin

机构：

[1] Otto von Guericke University Magdeburg,Institute of Logistics and Material Handling Systems

[2] Fraunhofer Institute for Factory Operation and Automation IFF,National

[3] Zhejiang Ocean University,Local Joint Engineering Laboratory of Harbor Oil & Gas Storage and Transportation Technology/Zhejiang Provincial Key Laboratory of Petrochemical Pollution Control/School of Petrochemical Engineering and Environment

来源：

Journal of Intelligent Manufacturing | 2024年 / 35卷

关键词：

Deep reinforcement learning; Parallel machine scheduling; Family setups; Recurrent neural network;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

As an essential scheduling problem with several practical applications, the parallel machine scheduling problem (PMSP) with family setups constraints is difficult to solve and proven to be NP-hard. To this end, we present a deep reinforcement learning (DRL) approach to solve a PMSP considering family setups, aiming at minimizing the total tardiness. The PMSP is first modeled as a Markov decision process, where we design a novel variable-length representation of states and actions, so that the DRL agent can calculate a comprehensive priority for each job at each decision time point and then select the next job directly according to these priorities. Meanwhile, the variable-length state matrix and action vector enable the trained agent to solve instances of any scales. To handle the variable-length sequence and simultaneously ensure the calculated priority is a global priority among all jobs, we employ a recurrent neural network, particular gated recurrent unit, to approximate the policy of the agent. The agent is trained based on Proximal Policy Optimization algorithm. Moreover, we develop a two-stage training strategy to enhance the training efficiency. In the numerical experiments, we first train the agent on a given instance and then employ it to solve instances with much larger scales. The experimental results demonstrate the strong generalization capability of the trained agent and the comparison with three dispatching rules and two metaheuristics further validates the superiority of this agent.

引用

页码：1107 / 1140

页数：33

共 50 条

[21] Two-stage selection of distributed data centers based on deep reinforcement learning
Li, Qirui
Peng, Zhiping
Cui, Delong
Lin, Jianpeng
He, Jieguang
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2699 - 2714
[22] Two-stage selection of distributed data centers based on deep reinforcement learning
Li, Qirui
Peng, Zhiping
Cui, Delong
Lin, Jianpeng
He, Jieguang
Cluster Computing, 2022, 25 (04) : 2699 - 2714
[23] Two-Stage Reinforcement Learning-Based Differential Evolution for Solving Nonlinear Equations
Liao, Zuowen
Gong, Wenyin
Li, Shuijia
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4279 - 4290
[24] A two-stage three-machine assembly scheduling problem with a truncation position-based learning effect
Azzouz, Ameni
Pan, Po-An
Hsu, Peng-Hsiang
Lin, Win-Chin
Liu, Shangchia
Ben Said, Lamjed
Wu, Chin-Chia
SOFT COMPUTING, 2020, 24 (14) : 10515 - 10533
[25] A two-stage three-machine assembly scheduling problem with a truncation position-based learning effect
Ameni Azzouz
Po-An Pan
Peng-Hsiang Hsu
Win-Chin Lin
Shangchia Liu
Lamjed Ben Said
Chin-Chia Wu
Soft Computing, 2020, 24 : 10515 - 10533
[26] Reinforcement Learning-Based Multi-Objective of Two-Stage Blocking Hybrid Flow Shop Scheduling Problem
Xu, Ke
Ye, Caixia
Gong, Hua
Sun, Wenjuan
PROCESSES, 2024, 12 (01)
[27] A Two-Stage Machine-Learning-Based Prognostic Approach for Bearing Remaining Useful Prediction Problem
Zhao, Dongdong
Feng, Liu
IAENG International Journal of Computer Science, 2021, 48 (04):
[28] Solving Panel Block Assembly Line Scheduling Problem via a Novel Deep Reinforcement Learning Approach
Zhou, Tao
Luo, Liang
He, Yuanxin
Fan, Zhiwei
Ji, Shengchen
APPLIED SCIENCES-BASEL, 2023, 13 (14):
[29] A new two-stage constraint programming approach for open shop scheduling problem with machine blocking
Abreu, Levi R.
Nagano, Marcelo S.
Prata, Bruno A.
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (24) : 8560 - 8579
[30] Solving the two-stage hybrid flow shop scheduling problem based on mutant firefly algorithm
Beibei Fan
Wenwei Yang
Zaifang Zhang
Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 979 - 990

← 1 2 3 4 5 →