A new solution to distributed permutation flow shop scheduling problem based on NASH Q-Learning

被引：25

作者：

Ren, J. F. ^{[1
,2
]}

Ye, C. M. ^{[1
]}

Li, Y. ^{[1
]}

机构：

[1] Univ Shanghai Sci & Technol, Sch Business, Shanghai, Peoples R China

[2] Henan Univ Econ & Law, Sch Comp & Informat Engn, Zhengzhou, Peoples R China

来源：

ADVANCES IN PRODUCTION ENGINEERING & MANAGEMENT | 2021年 / 16卷 / 03期

关键词：

Flow shop scheduling; Distributed scheduling; Permutation flow shop; Reinforcement learning; NASH Q-learning; Mean field (MF); SEARCH ALGORITHM; NEIGHBORHOOD SEARCH; MAKESPAN; TIME;

D O I：

10.14743/apem2021.3.399

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Aiming at Distributed Permutation Flow-shop Scheduling Problems (DPFSPs), this study took the minimization of the maximum completion time of the workpieces to be processed in all production tasks as the goal, and took the multi-agent Reinforcement Learning (RL) method as the main frame of the solution model, then, combining with the NASH equilibrium theory and the RL method, it proposed a NASH Q-Learning algorithm for Distributed Flow-shop Scheduling Problem (DFSP) based on Mean Field (MF). In the RL part, this study designed a two-layer online learning mode in which the sample collection and the training improvement proceed alternately, the outer layer collects samples, when the collected samples meet the requirement of batch size, it enters to the inner layer loop, which uses the Q-learning model-free batch processing mode to proceed and adopts neural network to approximate the value function to adapt to large-scale problems. By comparing the Average Relative Percentage Deviation (ARPD) index of the benchmark test questions, the calculation results of the proposed algorithm outperformed other similar algorithms, which proved the feasibility and efficiency of the proposed algorithm.

引用

页码：269 / 284

页数：16

共 50 条

[1] Improved Q-learning algorithm for solving permutation flow shop scheduling problems
He, Zimiao
Wang, Kunlan
Li, Hanxiao
Song, Hong
Lin, Zhongjie
Gao, Kaizhou
Sadollah, Ali
IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (01) : 35 - 44
[2] An improved fruit fly optimization algorithm with Q-learning for solving distributed permutation flow shop scheduling problems
Zhao, Cai
Wu, Lianghong
Zuo, Cili
Zhang, Hongqiang
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 5965 - 5988
[3] Dueling double Q-learning based reinforcement learning approach for the flow shop scheduling problem
Kim S.J.
Kim B.W.
Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (10): : 1497 - 1508
[4] A Hyperheuristic With Q-Learning for the Multiobjective Energy-Efficient Distributed Blocking Flow Shop Scheduling Problem
Zhao, Fuqing
Di, Shilu
Wang, Ling
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 3337 - 3350
[5] Deep Reinforcement Learning for Solving Distributed Permutation Flow Shop Scheduling Problem
Wang, Yijun
Qian, Bin
Hu, Rong
Yang, Yuanyuan
Chen, Wenbo
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 333 - 342
[6] A Hyper-Heuristic Algorithm with Q-Learning for Distributed Permutation Flowshop Scheduling Problem
Lan, Ke
Zhang, Zi-Qi
Qian, Bi
Hu, Rong
Zhang, Da-Cheng
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 122 - 131
[7] A New Variant of the Distributed Permutation Flow Shop Scheduling Problem with Worker Flexibility
Mraihi, Tasnim
Driss, Olfa Belkahla
EL-Haouzi, Hind Bril
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 : 587 - 597
[8] A Parallel Distributed Algorithm for the Permutation Flow Shop Scheduling Problem
Kouki, Samia
Ladhari, Talel
Jemni, Mohamed
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 2, PROCEEDINGS, 2010, 6082 : 328 - +
[9] Distributed assembly hybrid flow shop scheduling based on shuffled frog leaping algorithm with Q-learning
Cai J.
Wang L.
Lei D.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (12): : 37 - 44
[10] Q-learning based hyper-heuristic with clustering strategy for combinatorial optimization: A case study on permutation flow-shop scheduling problem
Yang, Yuan-yuan
Qian, Bin
Li, Zuocheng
Hu, Rong
Wang, Ling
COMPUTERS & OPERATIONS RESEARCH, 2025, 173

← 1 2 3 4 5 →