A new solution to distributed permutation flow shop scheduling problem based on NASH Q-Learning

被引:25
|
作者
Ren, J. F. [1 ,2 ]
Ye, C. M. [1 ]
Li, Y. [1 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Business, Shanghai, Peoples R China
[2] Henan Univ Econ & Law, Sch Comp & Informat Engn, Zhengzhou, Peoples R China
来源
关键词
Flow shop scheduling; Distributed scheduling; Permutation flow shop; Reinforcement learning; NASH Q-learning; Mean field (MF); SEARCH ALGORITHM; NEIGHBORHOOD SEARCH; MAKESPAN; TIME;
D O I
10.14743/apem2021.3.399
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Aiming at Distributed Permutation Flow-shop Scheduling Problems (DPFSPs), this study took the minimization of the maximum completion time of the workpieces to be processed in all production tasks as the goal, and took the multi-agent Reinforcement Learning (RL) method as the main frame of the solution model, then, combining with the NASH equilibrium theory and the RL method, it proposed a NASH Q-Learning algorithm for Distributed Flow-shop Scheduling Problem (DFSP) based on Mean Field (MF). In the RL part, this study designed a two-layer online learning mode in which the sample collection and the training improvement proceed alternately, the outer layer collects samples, when the collected samples meet the requirement of batch size, it enters to the inner layer loop, which uses the Q-learning model-free batch processing mode to proceed and adopts neural network to approximate the value function to adapt to large-scale problems. By comparing the Average Relative Percentage Deviation (ARPD) index of the benchmark test questions, the calculation results of the proposed algorithm outperformed other similar algorithms, which proved the feasibility and efficiency of the proposed algorithm.
引用
收藏
页码:269 / 284
页数:16
相关论文
共 50 条
  • [21] Non-permutation flow shop scheduling problem based on deep reinforcement learning
    Xiao P.
    Zhang C.
    Meng L.
    Hong H.
    Dai W.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2021, 27 (01): : 192 - 205
  • [22] Deep Reinforcement Learning Algorithm for Permutation Flow Shop Scheduling Problem
    Yang, Yuanyuan
    Qian, Bin
    Hu, Rong
    Zhang, Dacheng
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 473 - 483
  • [24] Q-learning based estimation of distribution algorithm for scheduling distributed heterogeneous flexible flow-shop with mixed buffering limitation
    Xuan, Hua
    Zheng, Qian-Qian
    Lv, Lin
    Li, Bing
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 149
  • [25] A memetic discrete differential evolution algorithm for the distributed permutation flow shop scheduling problem
    Fuqing Zhao
    Xiaotong Hu
    Ling Wang
    Zekai Li
    Complex & Intelligent Systems, 2022, 8 : 141 - 161
  • [26] Generic Hybridization of MOEA/D with Learning for Permutation Flow Shop Scheduling Problem
    Pericleous, S.
    Konstantinidis, A.
    Achilleos, A.
    Papadopoulos, G. A.
    2017 8TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2017, : 426 - 431
  • [27] A memetic discrete differential evolution algorithm for the distributed permutation flow shop scheduling problem
    Zhao, Fuqing
    Hu, Xiaotong
    Wang, Ling
    Li, Zekai
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) : 141 - 161
  • [28] Solving the Distributed Permutation Flow-Shop Scheduling Problem Using Constrained Programming
    Gogos, Christos
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [29] A Hyper-Heuristic Algorithm with Q-Learning for Distributed Flow Shop-Vehicle Transport-U-Assembly Integrated Scheduling Problem
    Yang, Dong-Lin
    Qian, Bin
    Zhang, Zi-Qi
    Hu, Rong
    Li, Kun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14862 : 300 - 310
  • [30] Hyper-heuristic Q-Learning Algorithm for Flow-Shop Scheduling Problem with Fuzzy Processing Times
    Zhu, Jin-Han
    Hu, Rong
    Li, Zuo-Cheng
    Qian, Bin
    Zhang, Zi-Qi
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 194 - 205