A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups

被引:0
|
作者
Funing Li
Sebastian Lang
Bingyuan Hong
Tobias Reggelin
机构
[1] Otto von Guericke University Magdeburg,Institute of Logistics and Material Handling Systems
[2] Fraunhofer Institute for Factory Operation and Automation IFF,National
[3] Zhejiang Ocean University,Local Joint Engineering Laboratory of Harbor Oil & Gas Storage and Transportation Technology/Zhejiang Provincial Key Laboratory of Petrochemical Pollution Control/School of Petrochemical Engineering and Environment
来源
关键词
Deep reinforcement learning; Parallel machine scheduling; Family setups; Recurrent neural network;
D O I
暂无
中图分类号
学科分类号
摘要
As an essential scheduling problem with several practical applications, the parallel machine scheduling problem (PMSP) with family setups constraints is difficult to solve and proven to be NP-hard. To this end, we present a deep reinforcement learning (DRL) approach to solve a PMSP considering family setups, aiming at minimizing the total tardiness. The PMSP is first modeled as a Markov decision process, where we design a novel variable-length representation of states and actions, so that the DRL agent can calculate a comprehensive priority for each job at each decision time point and then select the next job directly according to these priorities. Meanwhile, the variable-length state matrix and action vector enable the trained agent to solve instances of any scales. To handle the variable-length sequence and simultaneously ensure the calculated priority is a global priority among all jobs, we employ a recurrent neural network, particular gated recurrent unit, to approximate the policy of the agent. The agent is trained based on Proximal Policy Optimization algorithm. Moreover, we develop a two-stage training strategy to enhance the training efficiency. In the numerical experiments, we first train the agent on a given instance and then employ it to solve instances with much larger scales. The experimental results demonstrate the strong generalization capability of the trained agent and the comparison with three dispatching rules and two metaheuristics further validates the superiority of this agent.
引用
收藏
页码:1107 / 1140
页数:33
相关论文
共 50 条
  • [1] A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups
    Li, Funing
    Lang, Sebastian
    Hong, Bingyuan
    Reggelin, Tobias
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (03) : 1107 - 1140
  • [2] A transformer-based deep reinforcement learning approach for dynamic parallel machine scheduling problem with family setups
    Li, Funing
    Lang, Sebastian
    Tian, Yuan
    Hong, Bingyuan
    Rolf, Benjamin
    Noortwyck, Ruben
    Schulz, Robert
    Reggelin, Tobias
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024,
  • [3] Deep Reinforcement Learning for Minimizing Tardiness in Parallel Machine Scheduling With Sequence Dependent Family Setups
    Paeng, Bohyung
    Park, In-Beom
    Park, Jonghun
    IEEE ACCESS, 2021, 9 : 101390 - 101401
  • [4] Solving the spatial scheduling problem: a two-stage approach
    Zhang, Zhiying
    Chen, Jie
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2012, 50 (10) : 2732 - 2743
  • [5] Two-stage deep reinforcement learning method for agile optical satellite scheduling problem
    Liu, Zheng
    Xiong, Wei
    Jia, Zhuoya
    Han, Chi
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [6] A deep reinforcement learning framework for solving two-stage stochastic programs
    Yilmaz, Dogacan
    Buyuktahtakin, I. Esra
    OPTIMIZATION LETTERS, 2024, 18 (09) : 1993 - 2020
  • [7] Software Fault Prediction Using an RNN-Based Deep Learning Approach and Ensemble Machine Learning Techniques
    Borandag, Emin
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [8] A two-stage stochastic programming model for the parallel machine scheduling problem with machine capacity
    Al-Khamis, Talal
    M'Hallah, Rym
    COMPUTERS & OPERATIONS RESEARCH, 2011, 38 (12) : 1747 - 1759
  • [9] Edge-Enabled Two-Stage Scheduling Based on Deep Reinforcement Learning for Internet of Everything
    Zhou, Xiaokang
    Liang, Wei
    Yan, Ke
    Li, Weimin
    Wang, Kevin I-Kai
    Ma, Jianhua
    Jin, Qun
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (04) : 3295 - 3304
  • [10] A two-stage partial fixing approach for solving the residency block scheduling problem
    Guo, Junhong
    Pozehl, William
    Cohn, Amy
    HEALTH CARE MANAGEMENT SCIENCE, 2023, 26 (02) : 363 - 393