Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

被引：43

作者：

Yang, Shengluo ^{[1
,2
,3
,4
]}

Xu, Zhigang ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang, Peoples R China

[2] Chinese Acad Sci, Inst Robot, Shenyang, Peoples R China

[3] Inst Intelligent Mfg, Shenyang, Peoples R China

[4] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2022年 / 60卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; dynamic scheduling and reconfiguration; A2C; reconfigurable manufacturing system (RMS); intelligent scheduling; dynamic job arrival; ITERATED GREEDY ALGORITHM; PERMUTATION FLOW-SHOP; TOTAL TARDINESS; OPTIMIZATION; MINIMIZATION; HEURISTICS; EARLINESS;

D O I：

10.1080/00207543.2021.1943037

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 parts per thousand and 2.20 parts per thousand CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration.

引用

页码：4936 / 4953

页数：18

共 50 条

[21] Verification of intelligent scheduling based on deep reinforcement learning for distributed workshops via discrete event simulation
Yang, S. L.
Wang, J. Y.
Xin, L. M.
Xu, Z. G.
ADVANCES IN PRODUCTION ENGINEERING & MANAGEMENT, 2022, 17 (04): : 401 - 412
[22] Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning
Zhang, Cong
Song, Wen
Cao, Zhiguang
Zhang, Jie
Tan, Puay Siew
Xu, Chi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[23] Intelligent Control of Construction Manufacturing Processes using Deep Reinforcement Learning
Flood, Ian
Flood, Paris D. L.
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS (SIMULTECH), 2022, : 112 - 122
[24] Deep Reinforcement Learning for Intelligent Migration of Fog Services in Smart Cities
Lan, Dapeng
Taherkordi, Amir
Eliassen, Frank
Chen, Zhuang
Liu, Lei
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 230 - 244
[25] Deep Reinforcement Learning Empowered Smart Control of Intelligent Reflecting Surface
Wang, Wei
Zhang, Wei
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 5856 - 5861
[26] Task scheduling based on deep reinforcement learning in a cloud manufacturing environment
Dong, Tingting
Xue, Fei
Xiao, Chuangbai
Li, Juntao
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (11):
[27] Smart Master Production Scheduling by Deep Reinforcement Learning: An Exploratory Analysis
Serrano-Ruiz, Julio C.
Mula, Josefa
Poler, Raul
Diaz-Madronero, Manuel
NAVIGATING UNPREDICTABILITY: COLLABORATIVE NETWORKS IN NON-LINEAR WORLDS, PRO-VE 2024, PT II, 2024, 727 : 228 - 244
[28] Scheduling of decentralized robot services in cloud manufacturing with deep reinforcement learning
Liu, Yongkui
Ping, Yaoyao
Zhang, Lin
Wang, Lihui
Xu, Xun
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 80
[29] SCHEDULING MISSION RECONFIGURATION FOR AN INTERFEROMETRY SYNTHETIC APERTURE RADAR USING DEEP REINFORCEMENT LEARNING
Viros-i-Martin, Antoni
Selva, Daniel
Alimo, Ryan
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 6941 - 6944
[30] SCHED2 : Scheduling Deep Learning Training via Deep Reinforcement Learning
Luan, Yunteng
Chen, Xukun
Zhao, Hanyu
Yang, Zhi
Dai, Yafei
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,

← 1 2 3 4 5 →