A clustering-aided multi-agent deep reinforcement learning for multi-objective parallel batch processing machines scheduling in semiconductor manufacturing

被引:0
|
作者
Zhang, Peng [1 ]
Jin, Mengyu [1 ]
Wang, Ming [2 ]
Zhang, Jie [1 ]
He, Junjie [1 ]
Zheng, Peng [3 ]
机构
[1] Donghua Univ, Inst Artificial Intelligence, Shanghai Engn Res Ctr Ind Big Data & Intelligent S, 2999 North Renmin Rd, Shanghai 201620, Peoples R China
[2] Donghua Univ, Coll Mech Engn, Shanghai, Peoples R China
[3] Shanghai Maritime Univ, Coll Logist Engn, Shanghai, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Parallel batch processing machines; dynamic scheduling; multi-objective optimization; parameter sharing strategy; reinforcement learning; SHOP; OPTIMIZATION; ALGORITHMS; SEARCH;
D O I
10.1177/00202940241269643
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Batch processing machines are often the bottleneck in semiconductor manufacturing and their scheduling plays a key role in production management. Pioneer researches on multi-objective batch machines scheduling mainly focus on evolutionary algorithms, failing to meet the online scheduling demand. To deal with the challenges confronted by incompatible job families, dynamic job arrivals, capacitated machines and multiple objectives, we propose a clustering-aided multi-agent deep reinforcement learning approach (CA-MADRL) for the scheduling problem. Specifically, to achieve diverse nondominated solutions, an offline multi-objective scheduling algorithm named Multi-Subpopulation fast elitist Non-Dominated Sorting Genetic Algorithm (MS-NSGA-II) is firstly developed to obtain the Pareto Fronts, and a clustering algorithm based on cosine distance is employed to analyze the distribution of Pareto frontier solution, which would be used to guide reward functions design in multi-agent deep reinforcement learning. To realize multi-objective optimization, several reinforcement learning base models are trained for different optimization directions, each of which composed of batch forming agent and batch scheduling agent. To alleviate time complexity of model training, a parameter sharing strategy is introduced between different reinforcement learning base model. By validating the proposed approach with 16 instances designed based on actual production data from a semiconductor manufacturing company, it has been demonstrated that the approach not only meets the high-frequency scheduling requirements of manufacturing systems for parallel batch processing machines but also effectively reduces the total job tardiness and machine energy consumption.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Multi-objective reinforcement learning for designing ethical multi-agent environments
    Rodriguez-Soto, Manel
    Lopez-Sanchez, Maite
    Rodriguez-Aguilar, Juan A.
    NEURAL COMPUTING & APPLICATIONS, 2023,
  • [12] Multi-agent deep reinforcement learning for dynamic reconfigurable shop scheduling considering batch processing and worker cooperation
    Li, Yuxin
    Li, Xinyu
    Gao, Liang
    Lu, Zhibing
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 91
  • [13] Multi-objective optimization of the textile manufacturing process using deep-Q-network based multi-agent reinforcement learning
    He, Zhenglei
    Thomassey, Sebastien
    Zeng, Xianyi
    Xu, Jie
    Yi, Changhai
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 939 - 949
  • [14] Multi-objective Reconfigurable Manufacturing System Scheduling Optimisation: A Deep Reinforcement Learning Approach
    Tang, Jiecheng
    Haddad, Yousef
    Patsavellas, John
    Salonitis, Konstantinos
    IFAC PAPERSONLINE, 2023, 56 (02): : 11082 - 11087
  • [15] Heuristically accelerated reinforcement learning modularization for multi-agent multi-objective problems
    Ferreira, Leonardo Anjoletto
    Costa Ribeiro, Carlos Henrique
    da Costa Bianchi, Reinaldo Augusto
    APPLIED INTELLIGENCE, 2014, 41 (02) : 551 - 562
  • [16] Distributed multi-agent reinforcement learning for multi-objective optimal dispatch of microgrids
    Wang, Xiaowen
    Liu, Shuai
    Xu, Qianwen
    Shao, Xinquan
    ISA TRANSACTIONS, 2025, 158 : 130 - 140
  • [17] Heuristically accelerated reinforcement learning modularization for multi-agent multi-objective problems
    Leonardo Anjoletto Ferreira
    Carlos Henrique Costa Ribeiro
    Reinaldo Augusto da Costa Bianchi
    Applied Intelligence, 2014, 41 : 551 - 562
  • [18] Multi-Objective Dynamic Dispatch Optimisation using Multi-Agent Reinforcement Learning
    Mannion, Patrick
    Mason, Karl
    Devlin, Sam
    Duggan, Jim
    Howley, Enda
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1345 - 1346
  • [19] Multi-agent Deep Reinforcement Learning for Microgrid Energy Scheduling
    Zuo, Zhiqiang
    Li, Zhi
    Wang, Yijing
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6184 - 6189
  • [20] SOLVING MULTI-AGENT SCHEDULING PROBLEMS ON PARALLEL MACHINES WITH A GLOBAL OBJECTIVE FUNCTION
    Sadi, F.
    Soukhal, A.
    Billaut, J. -C.
    RAIRO-OPERATIONS RESEARCH, 2014, 48 (02) : 255 - 269