Out-of-order execution enabled deep reinforcement learning for dynamic additive manufacturing scheduling

被引:3
|
作者
Sun, Mingyue [1 ]
Ding, Jiyuchen [2 ]
Zhao, Zhiheng [1 ,2 ,3 ]
Chen, Jian [4 ]
Huang, George Q. [1 ,2 ]
Wang, Lihui [5 ]
机构
[1] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Res Inst Adv Mfg, Hong Kong, Peoples R China
[3] Huazhong Univ Sci & Technol, State Key Lab Intelligent Mfg Equipment & Technol, Wuhan, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Dept Econ Management, Nanjing, Peoples R China
[5] KTH Royal Inst Technol, Dept Prod Engn, Stockholm, Sweden
基金
中国国家自然科学基金;
关键词
Dynamic scheduling; Additive manufacturing; Dynamic order arrival; Dueling DQN; ALGORITHM; EFFICIENT; MACHINE;
D O I
10.1016/j.rcim.2024.102841
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Additive Manufacturing (AM) has revolutionized the production landscape by enabling on-demand customized manufacturing. However, the efficient management of dynamic AM orders poses significant challenges for production planning and scheduling. This paper addresses the dynamic scheduling problem considering batch processing, random order arrival and machine eligibility constraints, aiming to minimize total tardiness in a parallel non-identical AM machine environment. To tackle this problem, we propose the out-of-order enabled dueling deep Q network (O3-DDQN) approach. In the proposed approach, the problem is formulated as a Markov decision process (MDP). Three-dimensional features, encompassing dynamic orders, AM machines, and delays, are extracted using a 'look around' method to represent the production status at a rescheduling point. Additionally, five novel composite scheduling rules based on the out-of-order principle are introduced for selection when an AM machine completes processing or a new order arrives. Moreover, we design a reward function that is strongly correlated with the objective to evaluate the agent's chosen action. Experimental results demonstrate the superiority of the O3-DDQN approach over single scheduling rules, randomly selected rules, and the classic DQN method. The average improvement rate of performance reaches 13.09% compared to composite scheduling rules and random rules. Additionally, the O3-DDQN outperforms the classic DQN agent with a 6.54% improvement rate. The O3-DDQN algorithm improves scheduling in dynamic AM environments, enhancing productivity and on-time delivery. This research contributes to advancing AM production and offers insights into efficient resource allocation.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Deep reinforcement learning-based dynamic scheduling for resilient and sustainable manufacturing: A systematic review
    Zhang, Chao
    Juraschek, Max
    Herrmann, Christoph
    JOURNAL OF MANUFACTURING SYSTEMS, 2024, 77 : 962 - 989
  • [42] Job shop smart manufacturing scheduling by deep reinforcement learning
    Serrano-Ruiz, Julio C.
    Mula, Josefa
    Poler, Raul
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2024, 38
  • [43] Scheduling of Customized Tasks in Cloud Manufacturing with Deep Reinforcement Learning
    Lv, Ming
    Cao, Yu
    Qiu, Xingbo
    Liu, Yongkui
    Zhang, Lin
    INTELLIGENT NETWORKED THINGS, CINT 2024, PT II, 2024, 2139 : 241 - 252
  • [44] Dynamic VNF Scheduling: A Deep Reinforcement Learning Approach
    Zhang, Zixiao
    He, Fujun
    Oki, Eiji
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (07) : 557 - 570
  • [45] Geometric deep reinforcement learning for dynamic DAG scheduling
    Grinsztajn, Nathan
    Beaumont, Olivier
    Jeannot, Emmanuel
    Preux, Philippe
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 258 - 265
  • [46] Teleconsultation dynamic scheduling with a deep reinforcement learning approach
    Chen, Wenjia
    Li, Jinlin
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 149
  • [47] Data-driven out-of-order model for synchronized planning, scheduling, and execution in modular construction fit-out management
    Jiang, Yishuo
    Li, Mingxing
    Ma, Benedict Jun
    Zhong, Ray Y.
    Huang, George Q.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (16) : 2457 - 2480
  • [48] Analysis and simulation of an out-of-order execution model in vector multiprocessor systems
    Univ Coll Dublin, Dublin, United Kingdom
    Parallel Comput, 13 (1963-1986):
  • [49] Can out-of-order instruction execution in multiprocessors be made sequentially consistent?
    Higham, L
    Kawash, J
    NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 261 - 265
  • [50] Solving a Dynamic Scheduling Problem for a Manufacturing System with Reinforcement Learning
    Heik, David
    Bahrpeyma, Fouad
    Reichelt, Dirk
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 413 - 432