A Deep Reinforcement Advantage Actor-Critic-Based Co-Evolution Algorithm for Energy-Aware Distributed Heterogeneous Flexible Job Shop Scheduling

被引：0

作者：

Xu, Hua ^{[1
]}

Tao, Juntai ^{[1
]}

Huang, Lingxiang ^{[1
]}

Zhang, Chenjie ^{[1
]}

Zheng, Jianlu ^{[1
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, 1800 Li Hu Ave, Wuxi 214122, Peoples R China

来源：

PROCESSES | 2025年 / 13卷 / 01期

关键词：

deep reinforcement learning (DRL); co-evolution; dueling deep Q-networks; distributed heterogeneous flexible job shop scheduling problem (DHF[!text type='JS']JS[!/text]P); advantage actor-critic (AAC); FLOW-SHOP; MINIMIZING MAKESPAN; GENETIC ALGORITHM; OPTIMIZATION; SEARCH;

D O I：

10.3390/pr13010095

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

With the rapid advancement of the manufacturing industry and the widespread implementation of intelligent manufacturing systems, the energy-aware distributed heterogeneous flexible job shop scheduling problem (DHFJSP) has emerged as a critical challenge in optimizing modern production systems. This study introduces an innovative method to reduce both the makespan and the total energy consumption (TEC) in the context of the DHFJSP. A deep reinforcement advantage Actor-Critic-based co-evolution algorithm (DRAACCE) is proposed to address the issue, which leverages the powerful decision-making and perception abilities of the advantage Actor-Critic (AAC) method. The DRAACCE algorithm consists of three main components: First, to ensure a balance between global and local search capabilities, we propose a new co-evolutionary strategy. This enables the algorithm to explore the solution space efficiently while maintaining robust exploration and exploitation. Next, a novel evolution strategy is introduced to improve the algorithm's convergence rate and solution diversity, ensuring that the search process is both fast and effective. Finally, we integrate deep reinforcement learning with the advantage Actor-Critic framework to select elite solutions, enhancing the optimization process and leading to superior performance in minimizing both TEC and makespan. Extensive experiments validate the effectiveness of the proposed DRAACCE algorithm. The experimental results show that DRAACCE significantly outperforms existing state-of-the-art methods on all 20 instances and a real-world case, achieving better solutions in terms of both makespan and TEC.

引用

页数：23

共 35 条

[31] Intelligent learning-based cooperative and competitive multi-objective optimization for energy-aware distributed heterogeneous welding shop scheduling
Fayong Zhang
Caixian Li
Rui Li
Wenyin Gong
Complex & Intelligent Systems, 2024, 10 : 3459 - 3471
[32] Matheuristic and learning-oriented multi-objective artificial bee colony algorithm for energy-aware flexible assembly job shop scheduling problem
Hu, Yifan
Zhang, Liping
Zhang, Zikai
Li, Zixiang
Tang, Qiuhua
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[33] Multi-objective fitness landscape-based estimation of distribution algorithm for distributed heterogeneous flexible job shop scheduling problem
Zhao, Fuqing
Li, Mengjie
Zhu, Ningning
Xu, Tianpeng
Jonrinaldi
APPLIED SOFT COMPUTING, 2025, 171
[34] A multidimensional probabilistic model based evolutionary algorithm for the energy-efficient distributed flexible job-shop scheduling problem
Zhang Z.-Q.
Li Y.
Qian B.
Hu R.
Yang J.-B.
Engineering Applications of Artificial Intelligence, 2024, 135
[35] A Q-learning-based improved multi-objective genetic algorithm for solving distributed heterogeneous assembly flexible job shop scheduling problems with transfers
Yang, Zhijie
Hu, Xinkai
Li, Yibing
Liang, Muxi
Wang, Kaipu
Wang, Lei
Tang, Hongtao
Guo, Shunsheng
JOURNAL OF MANUFACTURING SYSTEMS, 2025, 79 : 398 - 418

← 1 2 3 4 →