Dynamic Selection of Priority Rules Based on Deep Reinforcement Learning for Rescheduling of RCPSP

被引：4

作者：

Wang, Teng ^{[1
]}

Cheng, Wei ^{[1
]}

Zhang, Yahui ^{[1
]}

Hu, Xiaofeng ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai 200240, Peoples R China

来源：

IFAC PAPERSONLINE | 2022年 / 55卷 / 10期

关键词：

reinforcement learning; project rescheduling; priority rule; transfer learning;

D O I：

10.1016/j.ifacol.2022.10.025

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Due to the uncertainties in the project execution process, the original plan often cannot be carried out correctly and needs to be rescheduled to repair the plan. In this case, rescheduling is required to repair the plan. Priority rules are the most common method for rescheduling because of their known advantages such as simplicity and fast. Although numerous papers have conducted comparative studies on different priority rules, managers often do not know which rules should be used for project rescheduling in specific situations. In this paper, we propose a reinforcement learning based approach for adaptive selection of priority rules in dynamic environments, which includes off-line phase and on-line phase. Reinforcement learning is used to learn scheduling knowledge and obtain the scheduling model in the off-line phase. Transfer learning can be used to reuse scheduling models between different cases in this phase. In the online phase, the scheduling model is used to adaptively select appropriate rules for rescheduling when the initial plan is infeasible due to unexpected disturbance. Experiments show that the proposed method has better rescheduling performance than other heuristic algorithms based on priority rules under different disturbances. Besides, we find that the time consumption of off-line training can be greatly reduced by using transfer learning, which also proves that our method can indeed learn some essential scheduling knowledge. Copyright (C) 2022 The Authors.

引用

页码：2144 / 2149

页数：6

共 50 条

[41] Deep Reinforcement Learning based Antenna Selection for Cell Outage Compensation
Iwamoto, Masayoshi
Suzuki, Akito
Kobayashi, Masahiro
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3945 - 3950
[42] Signal Priority Control for Trams Using Deep Reinforcement Learning
Wang Y.-P.
Guo G.
Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (12): : 2366 - 2377
[43] Transit Signal Priority for Arterial Road with Deep Reinforcement Learning
Long, Meng
Chung, Edward
2023 8TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS, MT-ITS, 2023,
[44] Deep reinforcement learning for transit signal priority in a connected environment
Long, Meng
Zou, Xiexin
Zhou, Yue
Chung, Edward
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 142
[45] Research on parameter optimisation of dynamic priority scheduling algorithm based on improved reinforcement learning
Meng, Shanshan
Zhu, Qiang
Xia, Fei
Lu, Jianfeng
IET GENERATION TRANSMISSION & DISTRIBUTION, 2020, 14 (16) : 3171 - 3178
[46] Experience selection in deep reinforcement learning for control
De Bruin, Tim
Kober, Jens
Tuyls, Karl
Babuška, Robert
Journal of Machine Learning Research, 2018, 19 : 1 - 56
[47] Experience Selection in Deep Reinforcement Learning for Control
de Bruin, Tim
Kober, Jens
Tuyls, Karl
Babuska, Robert
JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 19
[48] Dynamic node selection in camera networks based on approximate reinforcement learning
Li, Qian
Sun, Zhengxing
Chen, Songle
Xia, Shiming
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (24) : 17393 - 17419
[49] Deep Reinforcement Learning for Dynamic Radio Access Selection over Future Wireless Networks
Carballo Gonzalez, Claudia
Fontes Pupo, Ernesto
Pereira-Ruisanchez, Dariel
Atzori, Luigi
Murroni, Maurizio
2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
[50] Dynamic node selection in camera networks based on approximate reinforcement learning
Qian Li
Zhengxing Sun
Songle Chen
Shiming Xia
Multimedia Tools and Applications, 2016, 75 : 17393 - 17419

← 1 2 3 4 5 →