A new solution to distributed permutation flow shop scheduling problem based on NASH Q-Learning

被引:25
|
作者
Ren, J. F. [1 ,2 ]
Ye, C. M. [1 ]
Li, Y. [1 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Business, Shanghai, Peoples R China
[2] Henan Univ Econ & Law, Sch Comp & Informat Engn, Zhengzhou, Peoples R China
来源
关键词
Flow shop scheduling; Distributed scheduling; Permutation flow shop; Reinforcement learning; NASH Q-learning; Mean field (MF); SEARCH ALGORITHM; NEIGHBORHOOD SEARCH; MAKESPAN; TIME;
D O I
10.14743/apem2021.3.399
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Aiming at Distributed Permutation Flow-shop Scheduling Problems (DPFSPs), this study took the minimization of the maximum completion time of the workpieces to be processed in all production tasks as the goal, and took the multi-agent Reinforcement Learning (RL) method as the main frame of the solution model, then, combining with the NASH equilibrium theory and the RL method, it proposed a NASH Q-Learning algorithm for Distributed Flow-shop Scheduling Problem (DFSP) based on Mean Field (MF). In the RL part, this study designed a two-layer online learning mode in which the sample collection and the training improvement proceed alternately, the outer layer collects samples, when the collected samples meet the requirement of batch size, it enters to the inner layer loop, which uses the Q-learning model-free batch processing mode to proceed and adopts neural network to approximate the value function to adapt to large-scale problems. By comparing the Average Relative Percentage Deviation (ARPD) index of the benchmark test questions, the calculation results of the proposed algorithm outperformed other similar algorithms, which proved the feasibility and efficiency of the proposed algorithm.
引用
收藏
页码:269 / 284
页数:16
相关论文
共 50 条
  • [1] Improved Q-learning algorithm for solving permutation flow shop scheduling problems
    He, Zimiao
    Wang, Kunlan
    Li, Hanxiao
    Song, Hong
    Lin, Zhongjie
    Gao, Kaizhou
    Sadollah, Ali
    IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (01) : 35 - 44
  • [2] An improved fruit fly optimization algorithm with Q-learning for solving distributed permutation flow shop scheduling problems
    Zhao, Cai
    Wu, Lianghong
    Zuo, Cili
    Zhang, Hongqiang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 5965 - 5988
  • [3] Dueling double Q-learning based reinforcement learning approach for the flow shop scheduling problem
    Kim S.J.
    Kim B.W.
    Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (10): : 1497 - 1508
  • [4] A Hyperheuristic With Q-Learning for the Multiobjective Energy-Efficient Distributed Blocking Flow Shop Scheduling Problem
    Zhao, Fuqing
    Di, Shilu
    Wang, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 3337 - 3350
  • [5] Deep Reinforcement Learning for Solving Distributed Permutation Flow Shop Scheduling Problem
    Wang, Yijun
    Qian, Bin
    Hu, Rong
    Yang, Yuanyuan
    Chen, Wenbo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 333 - 342
  • [6] A Hyper-Heuristic Algorithm with Q-Learning for Distributed Permutation Flowshop Scheduling Problem
    Lan, Ke
    Zhang, Zi-Qi
    Qian, Bi
    Hu, Rong
    Zhang, Da-Cheng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 122 - 131
  • [7] A New Variant of the Distributed Permutation Flow Shop Scheduling Problem with Worker Flexibility
    Mraihi, Tasnim
    Driss, Olfa Belkahla
    EL-Haouzi, Hind Bril
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 : 587 - 597
  • [8] A Parallel Distributed Algorithm for the Permutation Flow Shop Scheduling Problem
    Kouki, Samia
    Ladhari, Talel
    Jemni, Mohamed
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 2, PROCEEDINGS, 2010, 6082 : 328 - +
  • [9] Distributed assembly hybrid flow shop scheduling based on shuffled frog leaping algorithm with Q-learning
    Cai J.
    Wang L.
    Lei D.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (12): : 37 - 44
  • [10] Q-learning based hyper-heuristic with clustering strategy for combinatorial optimization: A case study on permutation flow-shop scheduling problem
    Yang, Yuan-yuan
    Qian, Bin
    Li, Zuocheng
    Hu, Rong
    Wang, Ling
    COMPUTERS & OPERATIONS RESEARCH, 2025, 173