Dynamic Selection of Priority Rules Based on Deep Reinforcement Learning for Rescheduling of RCPSP

被引:4
|
作者
Wang, Teng [1 ]
Cheng, Wei [1 ]
Zhang, Yahui [1 ]
Hu, Xiaofeng [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai 200240, Peoples R China
来源
IFAC PAPERSONLINE | 2022年 / 55卷 / 10期
关键词
reinforcement learning; project rescheduling; priority rule; transfer learning;
D O I
10.1016/j.ifacol.2022.10.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the uncertainties in the project execution process, the original plan often cannot be carried out correctly and needs to be rescheduled to repair the plan. In this case, rescheduling is required to repair the plan. Priority rules are the most common method for rescheduling because of their known advantages such as simplicity and fast. Although numerous papers have conducted comparative studies on different priority rules, managers often do not know which rules should be used for project rescheduling in specific situations. In this paper, we propose a reinforcement learning based approach for adaptive selection of priority rules in dynamic environments, which includes off-line phase and on-line phase. Reinforcement learning is used to learn scheduling knowledge and obtain the scheduling model in the off-line phase. Transfer learning can be used to reuse scheduling models between different cases in this phase. In the online phase, the scheduling model is used to adaptively select appropriate rules for rescheduling when the initial plan is infeasible due to unexpected disturbance. Experiments show that the proposed method has better rescheduling performance than other heuristic algorithms based on priority rules under different disturbances. Besides, we find that the time consumption of off-line training can be greatly reduced by using transfer learning, which also proves that our method can indeed learn some essential scheduling knowledge. Copyright (C) 2022 The Authors.
引用
收藏
页码:2144 / 2149
页数:6
相关论文
共 50 条
  • [41] Deep Reinforcement Learning based Antenna Selection for Cell Outage Compensation
    Iwamoto, Masayoshi
    Suzuki, Akito
    Kobayashi, Masahiro
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3945 - 3950
  • [42] Signal Priority Control for Trams Using Deep Reinforcement Learning
    Wang Y.-P.
    Guo G.
    Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (12): : 2366 - 2377
  • [43] Transit Signal Priority for Arterial Road with Deep Reinforcement Learning
    Long, Meng
    Chung, Edward
    2023 8TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS, MT-ITS, 2023,
  • [44] Deep reinforcement learning for transit signal priority in a connected environment
    Long, Meng
    Zou, Xiexin
    Zhou, Yue
    Chung, Edward
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 142
  • [45] Research on parameter optimisation of dynamic priority scheduling algorithm based on improved reinforcement learning
    Meng, Shanshan
    Zhu, Qiang
    Xia, Fei
    Lu, Jianfeng
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2020, 14 (16) : 3171 - 3178
  • [46] Experience selection in deep reinforcement learning for control
    De Bruin, Tim
    Kober, Jens
    Tuyls, Karl
    Babuška, Robert
    Journal of Machine Learning Research, 2018, 19 : 1 - 56
  • [47] Experience Selection in Deep Reinforcement Learning for Control
    de Bruin, Tim
    Kober, Jens
    Tuyls, Karl
    Babuska, Robert
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 19
  • [48] Dynamic node selection in camera networks based on approximate reinforcement learning
    Li, Qian
    Sun, Zhengxing
    Chen, Songle
    Xia, Shiming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (24) : 17393 - 17419
  • [49] Deep Reinforcement Learning for Dynamic Radio Access Selection over Future Wireless Networks
    Carballo Gonzalez, Claudia
    Fontes Pupo, Ernesto
    Pereira-Ruisanchez, Dariel
    Atzori, Luigi
    Murroni, Maurizio
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
  • [50] Dynamic node selection in camera networks based on approximate reinforcement learning
    Qian Li
    Zhengxing Sun
    Songle Chen
    Shiming Xia
    Multimedia Tools and Applications, 2016, 75 : 17393 - 17419