Deep Reinforcement Learning Model for Blood Bank Vehicle Routing Multi-Objective Optimization

被引：2

作者：

Altaf, Meteb M. ^{[1
]}

Roshdy, Ahmed Samir ^{[2
]}

AlSagri, Hatoon S. ^{[3
]}

机构：

[1] King Abdul Aziz City Sci & Technol Riyadh, Adv Mfg & Ind 4 0 Ctr, Riyadh, Saudi Arabia

[2] Data Sci & AI Senior Manager Vodafone, Cairo, Egypt

[3] Al Imam Mohammad Ibn Saud Islamic Univ, Informat Syst Dept, Riyadh, Saudi Arabia

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 02期

关键词：

Optimization; blood bank; deep neural network; reinforcement learning; blood centers; multi-objective optimization; LOCATION; MOEA/D;

D O I：

10.32604/cmc.2022.019448

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The overall healthcare system has been prioritized within development top lists worldwide. Since many national populations are aging, combined with the availability of sophisticated medical treatments, healthcare expenditures are rapidly growing. Blood banks are a major component of any healthcare system, which store and provide the blood products needed for organ transplants, emergency medical treatments, and routine surgeries. Timely delivery of blood products is vital, especially in emergency settings. Hence, blood delivery process parameters such as safety and speed have received attention in the literature, as well as other parameters such as delivery cost. In this paper, delivery time and cost are modeled mathematically and marked as objective functions requiring simultaneous optimization. A solution is proposed based on Deep Reinforcement Learning (DRL) to address the formulated delivery functions as Multi-objective Optimization Problems (MOPs). The basic concept of the solution is to decompose the MOP into a scalar optimization sub-problems set, where each one of these sub-problems is modeled as a separate Neural Network (NN). The overall model parameters for each sub-problem are optimized based on a neighborhood parameter transfer and DRL training algorithm. The optimization step for the subproblems is undertaken collaboratively to optimize the overall model. Paretooptimal solutions can be directly obtained using the trained NN. Specifically, the multi-objective blood bank delivery problem is addressed in this research. One major technical advantage of this approach is that once the trained model is available, it can be scaled without the need for model retraining. The scoring can be obtained directly using a straightforward computation of the NN layers in a limited time. The proposed technique provides a set of technical strength points such as the ability to generalize and solve rapidly compared to other multi-objective optimization methods. The model was trained and tested on 5 major hospitals in Saudi Arabia's Riyadh region, and the simulation results indicated that time and cost decreased by 35% and 30%, respectively. In particular, the proposed model outperformed other state-of-the-art MOP solutions such as Genetic Algorithms and Simulated Annealing.

引用

页码：3955 / 3967

页数：13

共 50 条

[21] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
Abels, Axel
Roijers, Diederik M.
Lenaerts, Tom
Nowe, Ann
Steckelmacher, Denis
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[22] Robust Dynamic Multi-Objective Vehicle Routing Optimization Method
Guo, Yi-Nan
Cheng, Jian
Luo, Sha
Gong, Dunwei
Xue, Yu
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 1891 - 1903
[23] A reinforcement learning approach for dynamic multi-objective optimization
Zou, Fei
Yen, Gary G.
Tang, Lixin
Wang, Chunfeng
INFORMATION SCIENCES, 2021, 546 : 815 - 834
[24] Multi-Objective Optimization in Disaster Backup with Reinforcement Learning
Yi, Shanwen
Qin, Yao
Wang, Hua
MATHEMATICS, 2025, 13 (03)
[25] Multi-objective Adaptive Dynamics Attention Model to Solve Multi-objective Vehicle Routing Problem
Luo, Guang
Luo, Jianping
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
[26] Multi-objective vehicle routing problems
Jozefowiez, Nicolas
Semet, Frederic
Talbi, El-Ghazali
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2008, 189 (02) : 293 - 309
[27] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
Horie, Naoto
Matsui, Tohgoroh
Moriyama, Koichi
Mutoh, Atsuko
Inuzuka, Nobuhiro
ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
[28] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
Naoto Horie
Tohgoroh Matsui
Koichi Moriyama
Atsuko Mutoh
Nobuhiro Inuzuka
Artificial Life and Robotics, 2019, 24 : 352 - 359
[29] Allocation of English Remote Guiding based on Deep Reinforcement Learning and Multi-Objective Optimization
Jia Zhiyong
Tian Jing
Zhao Jing
PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 414 - 417
[30] Deep Reinforcement Learning for Adaptive Parameter Control in Differential Evolution for Multi-Objective Optimization
Reijnen, Robbert
Zhang, Yingqian
Bukhsh, Zaharah
Guzek, Mateusz
2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 804 - 811

← 1 2 3 4 5 →