Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

被引:43
|
作者
Yang, Shengluo [1 ,2 ,3 ,4 ]
Xu, Zhigang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang, Peoples R China
[2] Chinese Acad Sci, Inst Robot, Shenyang, Peoples R China
[3] Inst Intelligent Mfg, Shenyang, Peoples R China
[4] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; dynamic scheduling and reconfiguration; A2C; reconfigurable manufacturing system (RMS); intelligent scheduling; dynamic job arrival; ITERATED GREEDY ALGORITHM; PERMUTATION FLOW-SHOP; TOTAL TARDINESS; OPTIMIZATION; MINIMIZATION; HEURISTICS; EARLINESS;
D O I
10.1080/00207543.2021.1943037
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 parts per thousand and 2.20 parts per thousand CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration.
引用
收藏
页码:4936 / 4953
页数:18
相关论文
共 50 条
  • [31] Self-repair of smart manufacturing systems by deep reinforcement learning
    Epureanu, Bogdan, I
    Li, Xingyu
    Nassehi, Aydin
    Koren, Yoram
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2020, 69 (01) : 421 - 424
  • [32] An intelligent scheduling control method for smart grid based on deep learning
    Tong, Zhanying
    Zhou, Yingying
    Xu, Ke
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 7679 - 7695
  • [33] Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
    Gracla, Steffen
    Beck, Edgar
    Bockelmann, Carsten
    Dekorsy, Armin
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (01) : 210 - 213
  • [34] Dynamic Job Shop Scheduling via Deep Reinforcement Learning
    Liang, Xinjie
    Song, Wen
    Wei, Pengfei
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 369 - 376
  • [35] Optimal Order Acceptance and Scheduling via Deep Reinforcement Learning
    Qian, Jing
    Chen, Chao
    Wu, Keyu
    Yu, Li
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 63 - 68
  • [36] Intelligent Reflecting Surface Configurations for Smart Radio Using Deep Reinforcement Learning
    Wang, Wei
    Zhang, Wei
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) : 2335 - 2346
  • [37] Distributed Real-Time Scheduling in Cloud Manufacturing by Deep Reinforcement Learning
    Zhang, Lixiang
    Yang, Chen
    Yan, Yan
    Hu, Yaoguang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (12) : 8999 - 9007
  • [38] Study on deep reinforcement learning for multi-task scheduling in cloud manufacturing
    Xiao, Jiuhong
    Cai, Yishuai
    Chen, Yong
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2025,
  • [39] Deep Reinforcement Learning for Multiobjective Scheduling in Industry 5.0 Reconfigurable Manufacturing Systems
    Bezoui, Madani
    Kermali, Abdelfatah
    Bounceur, Ahcene
    Qaisar, Saeed Mian
    Almaktoom, Abdulaziz Turki
    MACHINE LEARNING FOR NETWORKING, MLN 2023, 2024, 14525 : 90 - 107
  • [40] Platform-enterprise collaborative scheduling in cloud manufacturing with deep reinforcement learning
    Niu, Wenbo
    Liu, Yongkui
    Ping, Yaoyao
    Zhang, Lin
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2025,