Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

被引:43
|
作者
Yang, Shengluo [1 ,2 ,3 ,4 ]
Xu, Zhigang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang, Peoples R China
[2] Chinese Acad Sci, Inst Robot, Shenyang, Peoples R China
[3] Inst Intelligent Mfg, Shenyang, Peoples R China
[4] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; dynamic scheduling and reconfiguration; A2C; reconfigurable manufacturing system (RMS); intelligent scheduling; dynamic job arrival; ITERATED GREEDY ALGORITHM; PERMUTATION FLOW-SHOP; TOTAL TARDINESS; OPTIMIZATION; MINIMIZATION; HEURISTICS; EARLINESS;
D O I
10.1080/00207543.2021.1943037
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 parts per thousand and 2.20 parts per thousand CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration.
引用
收藏
页码:4936 / 4953
页数:18
相关论文
共 50 条
  • [21] Verification of intelligent scheduling based on deep reinforcement learning for distributed workshops via discrete event simulation
    Yang, S. L.
    Wang, J. Y.
    Xin, L. M.
    Xu, Z. G.
    ADVANCES IN PRODUCTION ENGINEERING & MANAGEMENT, 2022, 17 (04): : 401 - 412
  • [22] Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning
    Zhang, Cong
    Song, Wen
    Cao, Zhiguang
    Zhang, Jie
    Tan, Puay Siew
    Xu, Chi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [23] Intelligent Control of Construction Manufacturing Processes using Deep Reinforcement Learning
    Flood, Ian
    Flood, Paris D. L.
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS (SIMULTECH), 2022, : 112 - 122
  • [24] Deep Reinforcement Learning for Intelligent Migration of Fog Services in Smart Cities
    Lan, Dapeng
    Taherkordi, Amir
    Eliassen, Frank
    Chen, Zhuang
    Liu, Lei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 230 - 244
  • [25] Deep Reinforcement Learning Empowered Smart Control of Intelligent Reflecting Surface
    Wang, Wei
    Zhang, Wei
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 5856 - 5861
  • [26] Task scheduling based on deep reinforcement learning in a cloud manufacturing environment
    Dong, Tingting
    Xue, Fei
    Xiao, Chuangbai
    Li, Juntao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (11):
  • [27] Smart Master Production Scheduling by Deep Reinforcement Learning: An Exploratory Analysis
    Serrano-Ruiz, Julio C.
    Mula, Josefa
    Poler, Raul
    Diaz-Madronero, Manuel
    NAVIGATING UNPREDICTABILITY: COLLABORATIVE NETWORKS IN NON-LINEAR WORLDS, PRO-VE 2024, PT II, 2024, 727 : 228 - 244
  • [28] Scheduling of decentralized robot services in cloud manufacturing with deep reinforcement learning
    Liu, Yongkui
    Ping, Yaoyao
    Zhang, Lin
    Wang, Lihui
    Xu, Xun
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 80
  • [29] SCHEDULING MISSION RECONFIGURATION FOR AN INTERFEROMETRY SYNTHETIC APERTURE RADAR USING DEEP REINFORCEMENT LEARNING
    Viros-i-Martin, Antoni
    Selva, Daniel
    Alimo, Ryan
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 6941 - 6944
  • [30] SCHED2 : Scheduling Deep Learning Training via Deep Reinforcement Learning
    Luan, Yunteng
    Chen, Xukun
    Zhao, Hanyu
    Yang, Zhi
    Dai, Yafei
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,