Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

被引：43

作者：

Yang, Shengluo ^{[1
,2
,3
,4
]}

Xu, Zhigang ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang, Peoples R China

[2] Chinese Acad Sci, Inst Robot, Shenyang, Peoples R China

[3] Inst Intelligent Mfg, Shenyang, Peoples R China

[4] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2022年 / 60卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; dynamic scheduling and reconfiguration; A2C; reconfigurable manufacturing system (RMS); intelligent scheduling; dynamic job arrival; ITERATED GREEDY ALGORITHM; PERMUTATION FLOW-SHOP; TOTAL TARDINESS; OPTIMIZATION; MINIMIZATION; HEURISTICS; EARLINESS;

D O I：

10.1080/00207543.2021.1943037

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 parts per thousand and 2.20 parts per thousand CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration.

引用

页码：4936 / 4953

页数：18

共 50 条

[31] Self-repair of smart manufacturing systems by deep reinforcement learning
Epureanu, Bogdan, I
Li, Xingyu
Nassehi, Aydin
Koren, Yoram
CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2020, 69 (01) : 421 - 424
[32] An intelligent scheduling control method for smart grid based on deep learning
Tong, Zhanying
Zhou, Yingying
Xu, Ke
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 7679 - 7695
[33] Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
Gracla, Steffen
Beck, Edgar
Bockelmann, Carsten
Dekorsy, Armin
IEEE COMMUNICATIONS LETTERS, 2023, 27 (01) : 210 - 213
[34] Dynamic Job Shop Scheduling via Deep Reinforcement Learning
Liang, Xinjie
Song, Wen
Wei, Pengfei
2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 369 - 376
[35] Optimal Order Acceptance and Scheduling via Deep Reinforcement Learning
Qian, Jing
Chen, Chao
Wu, Keyu
Yu, Li
2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 63 - 68
[36] Intelligent Reflecting Surface Configurations for Smart Radio Using Deep Reinforcement Learning
Wang, Wei
Zhang, Wei
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) : 2335 - 2346
[37] Distributed Real-Time Scheduling in Cloud Manufacturing by Deep Reinforcement Learning
Zhang, Lixiang
Yang, Chen
Yan, Yan
Hu, Yaoguang
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (12) : 8999 - 9007
[38] Study on deep reinforcement learning for multi-task scheduling in cloud manufacturing
Xiao, Jiuhong
Cai, Yishuai
Chen, Yong
INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2025,
[39] Deep Reinforcement Learning for Multiobjective Scheduling in Industry 5.0 Reconfigurable Manufacturing Systems
Bezoui, Madani
Kermali, Abdelfatah
Bounceur, Ahcene
Qaisar, Saeed Mian
Almaktoom, Abdulaziz Turki
MACHINE LEARNING FOR NETWORKING, MLN 2023, 2024, 14525 : 90 - 107
[40] Platform-enterprise collaborative scheduling in cloud manufacturing with deep reinforcement learning
Niu, Wenbo
Liu, Yongkui
Ping, Yaoyao
Zhang, Lin
INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2025,

← 1 2 3 4 5 →