A clustering-aided multi-agent deep reinforcement learning for multi-objective parallel batch processing machines scheduling in semiconductor manufacturing

被引:0
|
作者
Zhang, Peng [1 ]
Jin, Mengyu [1 ]
Wang, Ming [2 ]
Zhang, Jie [1 ]
He, Junjie [1 ]
Zheng, Peng [3 ]
机构
[1] Donghua Univ, Inst Artificial Intelligence, Shanghai Engn Res Ctr Ind Big Data & Intelligent S, 2999 North Renmin Rd, Shanghai 201620, Peoples R China
[2] Donghua Univ, Coll Mech Engn, Shanghai, Peoples R China
[3] Shanghai Maritime Univ, Coll Logist Engn, Shanghai, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Parallel batch processing machines; dynamic scheduling; multi-objective optimization; parameter sharing strategy; reinforcement learning; SHOP; OPTIMIZATION; ALGORITHMS; SEARCH;
D O I
10.1177/00202940241269643
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Batch processing machines are often the bottleneck in semiconductor manufacturing and their scheduling plays a key role in production management. Pioneer researches on multi-objective batch machines scheduling mainly focus on evolutionary algorithms, failing to meet the online scheduling demand. To deal with the challenges confronted by incompatible job families, dynamic job arrivals, capacitated machines and multiple objectives, we propose a clustering-aided multi-agent deep reinforcement learning approach (CA-MADRL) for the scheduling problem. Specifically, to achieve diverse nondominated solutions, an offline multi-objective scheduling algorithm named Multi-Subpopulation fast elitist Non-Dominated Sorting Genetic Algorithm (MS-NSGA-II) is firstly developed to obtain the Pareto Fronts, and a clustering algorithm based on cosine distance is employed to analyze the distribution of Pareto frontier solution, which would be used to guide reward functions design in multi-agent deep reinforcement learning. To realize multi-objective optimization, several reinforcement learning base models are trained for different optimization directions, each of which composed of batch forming agent and batch scheduling agent. To alleviate time complexity of model training, a parameter sharing strategy is introduced between different reinforcement learning base model. By validating the proposed approach with 16 instances designed based on actual production data from a semiconductor manufacturing company, it has been demonstrated that the approach not only meets the high-frequency scheduling requirements of manufacturing systems for parallel batch processing machines but also effectively reduces the total job tardiness and machine energy consumption.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A multi-objective multi-agent deep reinforcement learning approach to residential appliance scheduling
    Lu, Junlin
    Mannion, Patrick
    Mason, Karl
    IET SMART GRID, 2022, 5 (04) : 260 - 280
  • [2] Multi-objective scheduling of cloud-edge cooperation in distributed manufacturing via multi-agent deep reinforcement learning
    Guo, Peng
    Shi, Haichao
    Wang, Yi
    Xiong, Jianyu
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024,
  • [3] Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning
    Wang, Yuandou
    Liu, Hang
    Zheng, Wanbo
    Xia, Yunni
    Li, Yawen
    Chen, Peng
    Guo, Kunyin
    Xie, Hong
    IEEE ACCESS, 2019, 7 : 39974 - 39982
  • [4] Multi-Agent Deep Reinforcement Learning based Multi-Objective Resource Optimization in a Distributed Manufacturing System
    Shen, Xinchang
    Tham, Chen-Khong
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [5] Multi-Agent Deep Reinforcement Learning for Resource Allocation in the Multi-Objective HetNet
    Nie, Hongrui
    Li, Shaosheng
    Liu, Yong
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 116 - 121
  • [6] Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning
    Tao, Mengxue
    Li, Qiang
    Yu, Junxi
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2025, 13 (01)
  • [7] Multi-agent Deep Reinforcement Learning Based Integrated Scheduling of Machines and AGVs in Discrete Manufacturing Workshop
    Geng, Sai
    Guo, Yu
    Huang, Shaohua
    Sitahong, Adilanmu
    2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024, 2024, : 59 - 64
  • [8] Multi-agent deep reinforcement learning based Predictive Maintenance on parallel machines
    Rodriguez, Marcelo Luis Ruiz
    Kubler, Sylvain
    de Giorgio, Andrea
    Cordy, Maxime
    Robert, Jeremy
    Le Traon, Yves
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
  • [9] Multi-objective Mathematical Modeling for Scheduling Machines in Parallel with Batch Processors
    Ampry, Evy Segarawati
    Komariah, Aan
    Kurniady, Dedy Achmad
    Rafiq, Muhammad
    Priatna, Asep
    Ali, Muneam Hussein
    Marhoon, Haydar Abdulameer
    Thangavelu, Lakshmi
    Chaudhary, Purnima
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2022, 21 (02): : 366 - 380
  • [10] Multi-objective reinforcement learning for designing ethical multi-agent environments
    Rodriguez-Soto, Manel
    Lopez-Sanchez, Maite
    Rodriguez-Aguilar, Juan A.
    NEURAL COMPUTING & APPLICATIONS, 2023,