A Deep Reinforcement Advantage Actor-Critic-Based Co-Evolution Algorithm for Energy-Aware Distributed Heterogeneous Flexible Job Shop Scheduling

被引:0
|
作者
Xu, Hua [1 ]
Tao, Juntai [1 ]
Huang, Lingxiang [1 ]
Zhang, Chenjie [1 ]
Zheng, Jianlu [1 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, 1800 Li Hu Ave, Wuxi 214122, Peoples R China
关键词
deep reinforcement learning (DRL); co-evolution; dueling deep Q-networks; distributed heterogeneous flexible job shop scheduling problem (DHF[!text type='JS']JS[!/text]P); advantage actor-critic (AAC); FLOW-SHOP; MINIMIZING MAKESPAN; GENETIC ALGORITHM; OPTIMIZATION; SEARCH;
D O I
10.3390/pr13010095
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
With the rapid advancement of the manufacturing industry and the widespread implementation of intelligent manufacturing systems, the energy-aware distributed heterogeneous flexible job shop scheduling problem (DHFJSP) has emerged as a critical challenge in optimizing modern production systems. This study introduces an innovative method to reduce both the makespan and the total energy consumption (TEC) in the context of the DHFJSP. A deep reinforcement advantage Actor-Critic-based co-evolution algorithm (DRAACCE) is proposed to address the issue, which leverages the powerful decision-making and perception abilities of the advantage Actor-Critic (AAC) method. The DRAACCE algorithm consists of three main components: First, to ensure a balance between global and local search capabilities, we propose a new co-evolutionary strategy. This enables the algorithm to explore the solution space efficiently while maintaining robust exploration and exploitation. Next, a novel evolution strategy is introduced to improve the algorithm's convergence rate and solution diversity, ensuring that the search process is both fast and effective. Finally, we integrate deep reinforcement learning with the advantage Actor-Critic framework to select elite solutions, enhancing the optimization process and leading to superior performance in minimizing both TEC and makespan. Extensive experiments validate the effectiveness of the proposed DRAACCE algorithm. The experimental results show that DRAACCE significantly outperforms existing state-of-the-art methods on all 20 instances and a real-world case, achieving better solutions in terms of both makespan and TEC.
引用
收藏
页数:23
相关论文
共 35 条
  • [31] Intelligent learning-based cooperative and competitive multi-objective optimization for energy-aware distributed heterogeneous welding shop scheduling
    Fayong Zhang
    Caixian Li
    Rui Li
    Wenyin Gong
    Complex & Intelligent Systems, 2024, 10 : 3459 - 3471
  • [32] Matheuristic and learning-oriented multi-objective artificial bee colony algorithm for energy-aware flexible assembly job shop scheduling problem
    Hu, Yifan
    Zhang, Liping
    Zhang, Zikai
    Li, Zixiang
    Tang, Qiuhua
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [33] Multi-objective fitness landscape-based estimation of distribution algorithm for distributed heterogeneous flexible job shop scheduling problem
    Zhao, Fuqing
    Li, Mengjie
    Zhu, Ningning
    Xu, Tianpeng
    Jonrinaldi
    APPLIED SOFT COMPUTING, 2025, 171
  • [34] A multidimensional probabilistic model based evolutionary algorithm for the energy-efficient distributed flexible job-shop scheduling problem
    Zhang Z.-Q.
    Li Y.
    Qian B.
    Hu R.
    Yang J.-B.
    Engineering Applications of Artificial Intelligence, 2024, 135
  • [35] A Q-learning-based improved multi-objective genetic algorithm for solving distributed heterogeneous assembly flexible job shop scheduling problems with transfers
    Yang, Zhijie
    Hu, Xinkai
    Li, Yibing
    Liang, Muxi
    Wang, Kaipu
    Wang, Lei
    Tang, Hongtao
    Guo, Shunsheng
    JOURNAL OF MANUFACTURING SYSTEMS, 2025, 79 : 398 - 418