A clustering-aided multi-agent deep reinforcement learning for multi-objective parallel batch processing machines scheduling in semiconductor manufacturing

被引：0

作者：

Zhang, Peng ^{[1
]}

Jin, Mengyu ^{[1
]}

Wang, Ming ^{[2
]}

Zhang, Jie ^{[1
]}

He, Junjie ^{[1
]}

Zheng, Peng ^{[3
]}

机构：

[1] Donghua Univ, Inst Artificial Intelligence, Shanghai Engn Res Ctr Ind Big Data & Intelligent S, 2999 North Renmin Rd, Shanghai 201620, Peoples R China

[2] Donghua Univ, Coll Mech Engn, Shanghai, Peoples R China

[3] Shanghai Maritime Univ, Coll Logist Engn, Shanghai, Peoples R China

来源：

MEASUREMENT & CONTROL | 2024年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Parallel batch processing machines; dynamic scheduling; multi-objective optimization; parameter sharing strategy; reinforcement learning; SHOP; OPTIMIZATION; ALGORITHMS; SEARCH;

D O I：

10.1177/00202940241269643

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Batch processing machines are often the bottleneck in semiconductor manufacturing and their scheduling plays a key role in production management. Pioneer researches on multi-objective batch machines scheduling mainly focus on evolutionary algorithms, failing to meet the online scheduling demand. To deal with the challenges confronted by incompatible job families, dynamic job arrivals, capacitated machines and multiple objectives, we propose a clustering-aided multi-agent deep reinforcement learning approach (CA-MADRL) for the scheduling problem. Specifically, to achieve diverse nondominated solutions, an offline multi-objective scheduling algorithm named Multi-Subpopulation fast elitist Non-Dominated Sorting Genetic Algorithm (MS-NSGA-II) is firstly developed to obtain the Pareto Fronts, and a clustering algorithm based on cosine distance is employed to analyze the distribution of Pareto frontier solution, which would be used to guide reward functions design in multi-agent deep reinforcement learning. To realize multi-objective optimization, several reinforcement learning base models are trained for different optimization directions, each of which composed of batch forming agent and batch scheduling agent. To alleviate time complexity of model training, a parameter sharing strategy is introduced between different reinforcement learning base model. By validating the proposed approach with 16 instances designed based on actual production data from a semiconductor manufacturing company, it has been demonstrated that the approach not only meets the high-frequency scheduling requirements of manufacturing systems for parallel batch processing machines but also effectively reduces the total job tardiness and machine energy consumption.

引用

页数：18

共 50 条

[11] Multi-objective reinforcement learning for designing ethical multi-agent environments
Rodriguez-Soto, Manel
Lopez-Sanchez, Maite
Rodriguez-Aguilar, Juan A.
NEURAL COMPUTING & APPLICATIONS, 2023,
[12] Multi-agent deep reinforcement learning for dynamic reconfigurable shop scheduling considering batch processing and worker cooperation
Li, Yuxin
Li, Xinyu
Gao, Liang
Lu, Zhibing
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 91
[13] Multi-objective optimization of the textile manufacturing process using deep-Q-network based multi-agent reinforcement learning
He, Zhenglei
Thomassey, Sebastien
Zeng, Xianyi
Xu, Jie
Yi, Changhai
JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 939 - 949
[14] Multi-objective Reconfigurable Manufacturing System Scheduling Optimisation: A Deep Reinforcement Learning Approach
Tang, Jiecheng
Haddad, Yousef
Patsavellas, John
Salonitis, Konstantinos
IFAC PAPERSONLINE, 2023, 56 (02): : 11082 - 11087
[15] Heuristically accelerated reinforcement learning modularization for multi-agent multi-objective problems
Ferreira, Leonardo Anjoletto
Costa Ribeiro, Carlos Henrique
da Costa Bianchi, Reinaldo Augusto
APPLIED INTELLIGENCE, 2014, 41 (02) : 551 - 562
[16] Distributed multi-agent reinforcement learning for multi-objective optimal dispatch of microgrids
Wang, Xiaowen
Liu, Shuai
Xu, Qianwen
Shao, Xinquan
ISA TRANSACTIONS, 2025, 158 : 130 - 140
[17] Heuristically accelerated reinforcement learning modularization for multi-agent multi-objective problems
Leonardo Anjoletto Ferreira
Carlos Henrique Costa Ribeiro
Reinaldo Augusto da Costa Bianchi
Applied Intelligence, 2014, 41 : 551 - 562
[18] Multi-Objective Dynamic Dispatch Optimisation using Multi-Agent Reinforcement Learning
Mannion, Patrick
Mason, Karl
Devlin, Sam
Duggan, Jim
Howley, Enda
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1345 - 1346
[19] Multi-agent Deep Reinforcement Learning for Microgrid Energy Scheduling
Zuo, Zhiqiang
Li, Zhi
Wang, Yijing
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6184 - 6189
[20] SOLVING MULTI-AGENT SCHEDULING PROBLEMS ON PARALLEL MACHINES WITH A GLOBAL OBJECTIVE FUNCTION
Sadi, F.
Soukhal, A.
Billaut, J. -C.
RAIRO-OPERATIONS RESEARCH, 2014, 48 (02) : 255 - 269

← 1 2 3 4 5 →