A clustering-aided multi-agent deep reinforcement learning for multi-objective parallel batch processing machines scheduling in semiconductor manufacturing

被引:0
|
作者
Zhang, Peng [1 ]
Jin, Mengyu [1 ]
Wang, Ming [2 ]
Zhang, Jie [1 ]
He, Junjie [1 ]
Zheng, Peng [3 ]
机构
[1] Donghua Univ, Inst Artificial Intelligence, Shanghai Engn Res Ctr Ind Big Data & Intelligent S, 2999 North Renmin Rd, Shanghai 201620, Peoples R China
[2] Donghua Univ, Coll Mech Engn, Shanghai, Peoples R China
[3] Shanghai Maritime Univ, Coll Logist Engn, Shanghai, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Parallel batch processing machines; dynamic scheduling; multi-objective optimization; parameter sharing strategy; reinforcement learning; SHOP; OPTIMIZATION; ALGORITHMS; SEARCH;
D O I
10.1177/00202940241269643
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Batch processing machines are often the bottleneck in semiconductor manufacturing and their scheduling plays a key role in production management. Pioneer researches on multi-objective batch machines scheduling mainly focus on evolutionary algorithms, failing to meet the online scheduling demand. To deal with the challenges confronted by incompatible job families, dynamic job arrivals, capacitated machines and multiple objectives, we propose a clustering-aided multi-agent deep reinforcement learning approach (CA-MADRL) for the scheduling problem. Specifically, to achieve diverse nondominated solutions, an offline multi-objective scheduling algorithm named Multi-Subpopulation fast elitist Non-Dominated Sorting Genetic Algorithm (MS-NSGA-II) is firstly developed to obtain the Pareto Fronts, and a clustering algorithm based on cosine distance is employed to analyze the distribution of Pareto frontier solution, which would be used to guide reward functions design in multi-agent deep reinforcement learning. To realize multi-objective optimization, several reinforcement learning base models are trained for different optimization directions, each of which composed of batch forming agent and batch scheduling agent. To alleviate time complexity of model training, a parameter sharing strategy is introduced between different reinforcement learning base model. By validating the proposed approach with 16 instances designed based on actual production data from a semiconductor manufacturing company, it has been demonstrated that the approach not only meets the high-frequency scheduling requirements of manufacturing systems for parallel batch processing machines but also effectively reduces the total job tardiness and machine energy consumption.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Dynamic scheduling of tasks in cloud manufacturing with multi-agent reinforcement learning
    Wang, Xiaohan
    Zhang, Lin
    Liu, Yongkui
    Li, Feng
    Chen, Zhen
    Zhao, Chun
    Bai, Tian
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 65 : 130 - 145
  • [22] Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems
    Zhang, Yi
    Zhu, Haihua
    Tang, Dunbing
    Zhou, Tong
    Gui, Yong
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
  • [23] Multi-Objective Optimization in Air-to-Air Communication System Based on Multi-Agent Deep Reinforcement Learning
    Lin, Shaofu
    Chen, Yingying
    Li, Shuopeng
    SENSORS, 2023, 23 (23)
  • [24] Exploring multi-agent reinforcement learning for unrelated parallel machine scheduling
    Zampella, Maria
    Otamendi, Urtzi
    Belaunzaran, Xabier
    Artetxe, Arkaitz
    Olaizola, Igor G.
    Sierra, Basilio
    Longo, Giuseppe
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [25] Multi-objective multi-agent deep reinforcement learning to reduce bus bunching for multiline services with a shared corridor
    Wang, Jiawei
    Sun, Lijun
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 155
  • [26] Joint Multi-Objective Optimization for Radio Access Network Slicing Using Multi-Agent Deep Reinforcement Learning
    Zhou, Guorong
    Zhao, Liqiang
    Zheng, Gan
    Xie, Zhijie
    Song, Shenghui
    Chen, Kwang-Cheng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11828 - 11843
  • [27] MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning
    Hu, Tianmeng
    Luo, Biao
    Yang, Chunhua
    Huang, Tingwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12098 - 12112
  • [28] A novel multi-objective optimization based multi-agent deep reinforcement learning approach for microgrid resources planning
    Abid, Md. Shadman
    Apon, Hasan Jamil
    Hossain, Salman
    Ahmed, Ashik
    Ahshan, Razzaqul
    Lipu, M. S. Hossain
    APPLIED ENERGY, 2024, 353
  • [29] Multi-objective optimization of turbine blade profiles based on multi-agent reinforcement learning
    Li, Lele
    Zhang, Weihao
    Li, Ya
    Jiang, Chiju
    Wang, Yufan
    ENERGY CONVERSION AND MANAGEMENT, 2023, 297
  • [30] ACO-based multi-objective scheduling of parallel batch processing machines with advanced process control constraints
    Li Li
    F. Qiao
    Q. D. Wu
    The International Journal of Advanced Manufacturing Technology, 2009, 44 : 985 - 994