Deep Reinforcement Learning Based Optimization Algorithm for Permutation Flow-Shop Scheduling

被引:65
|
作者
Pan, Zixiao [1 ]
Wang, Ling [1 ]
Wang, Jingjing [1 ]
Lu, Jiawen [2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimization; Job shop scheduling; Decoding; Reinforcement learning; Dynamic scheduling; Encoding; Computational intelligence; deep neural network; flow-shop scheduling; optimization algorithm; improvement strategy; HEURISTIC ALGORITHM; MINIMIZE MAKESPAN; M-MACHINE; N-JOB;
D O I
10.1109/TETCI.2021.3098354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a new analogy paradigm of human learning process, reinforcement learning (RL) has become an emerging topic in computational intelligence (CI). The synergy between the RL and CI is an emerging way to develop efficient solution algorithms for solving complex combinatorial optimization (CO) problems like machine scheduling problem. In this paper, we proposed an efficient optimization algorithm based on Deep RL for solving permutation flow-shop scheduling problem (PFSP) to minimize the maximum completion time. Firstly, a new deep neural network (PFSPNet) is designed for the PFSP to achieve the end-to-end output without limitation of problem sizes. Secondly, an actor-critic method of RL is used to train the PFSPNet without depending on the collection of high-quality labelled data. Thirdly, an improvement strategy is designed to refine the solution provided by the PFSPNet. Simulation results and statistical comparison show that the proposed optimization algorithm based on deep RL can obtain better results than the existing heuristics in similar computational time for solving the PFSP.
引用
收藏
页码:983 / 994
页数:12
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Algorithm for Permutation Flow Shop Scheduling Problem
    Yang, Yuanyuan
    Qian, Bin
    Hu, Rong
    Zhang, Dacheng
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 473 - 483
  • [2] Permutation flow-shop scheduling based on multiagent evolutionary algorithm
    Hu, Kang
    Li, Jinshu
    Liu, Jing
    Jiao, Licheng
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 917 - +
  • [3] An Optimization Method for Green Permutation Flow Shop Scheduling Based on Deep Reinforcement Learning and MOEA/D
    Lu, Yongxin
    Yuan, Yiping
    Sitahong, Adilanmu
    Chao, Yongsheng
    Wang, Yunxuan
    MACHINES, 2024, 12 (10)
  • [4] A new hybrid ant colony optimization algorithm for permutation flow-shop scheduling
    Zhang, Xiaoxia
    Liu, Shaoqiang
    Ma, Yunyong
    MANUFACTURING PROCESS AND EQUIPMENT, PTS 1-4, 2013, 694-697 : 2691 - 2694
  • [5] Invasive Weed Optimization Algorithm for Solving Permutation Flow-Shop Scheduling Problem
    Chen, Huan
    Zhou, Yongquan
    He, Sucai
    Ouyang, Xinxin
    Guo, Peigang
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2013, 10 (03) : 708 - 713
  • [6] Solving non-permutation flow-shop scheduling problem via a novel deep reinforcement learning approach
    Wang, Zhenyu
    Cai, Bin
    Li, Jun
    Yang, Deheng
    Zhao, Yang
    Xie, Huan
    COMPUTERS & OPERATIONS RESEARCH, 2023, 151
  • [7] Non-permutation flow shop scheduling problem based on deep reinforcement learning
    Xiao P.
    Zhang C.
    Meng L.
    Hong H.
    Dai W.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2021, 27 (01): : 192 - 205
  • [8] A discrete Jaya algorithm for permutation flow-shop scheduling problem
    Mishra, Aseem K.
    Pandey, Divya
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING COMPUTATIONS, 2020, 11 (03) : 415 - 428
  • [9] An Estimation of Distribution Algorithm for Permutation Flow-Shop Scheduling Problem
    Lemtenneche, Sami
    Bensayah, Abdallah
    Cheriet, Abdelhakim
    SYSTEMS, 2023, 11 (08):
  • [10] A revised discrete particle swarm optimization algorithm for permutation flow-shop scheduling problem
    Chen, Chun-Lung
    Huang, Shin-Ying
    Tzeng, Yeu-Ruey
    Chen, Chuen-Lung
    SOFT COMPUTING, 2014, 18 (11) : 2271 - 2282