Deep Reinforcement Learning Model for Blood Bank Vehicle Routing Multi-Objective Optimization

被引:2
|
作者
Altaf, Meteb M. [1 ]
Roshdy, Ahmed Samir [2 ]
AlSagri, Hatoon S. [3 ]
机构
[1] King Abdul Aziz City Sci & Technol Riyadh, Adv Mfg & Ind 4 0 Ctr, Riyadh, Saudi Arabia
[2] Data Sci & AI Senior Manager Vodafone, Cairo, Egypt
[3] Al Imam Mohammad Ibn Saud Islamic Univ, Informat Syst Dept, Riyadh, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 02期
关键词
Optimization; blood bank; deep neural network; reinforcement learning; blood centers; multi-objective optimization; LOCATION; MOEA/D;
D O I
10.32604/cmc.2022.019448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The overall healthcare system has been prioritized within development top lists worldwide. Since many national populations are aging, combined with the availability of sophisticated medical treatments, healthcare expenditures are rapidly growing. Blood banks are a major component of any healthcare system, which store and provide the blood products needed for organ transplants, emergency medical treatments, and routine surgeries. Timely delivery of blood products is vital, especially in emergency settings. Hence, blood delivery process parameters such as safety and speed have received attention in the literature, as well as other parameters such as delivery cost. In this paper, delivery time and cost are modeled mathematically and marked as objective functions requiring simultaneous optimization. A solution is proposed based on Deep Reinforcement Learning (DRL) to address the formulated delivery functions as Multi-objective Optimization Problems (MOPs). The basic concept of the solution is to decompose the MOP into a scalar optimization sub-problems set, where each one of these sub-problems is modeled as a separate Neural Network (NN). The overall model parameters for each sub-problem are optimized based on a neighborhood parameter transfer and DRL training algorithm. The optimization step for the subproblems is undertaken collaboratively to optimize the overall model. Paretooptimal solutions can be directly obtained using the trained NN. Specifically, the multi-objective blood bank delivery problem is addressed in this research. One major technical advantage of this approach is that once the trained model is available, it can be scaled without the need for model retraining. The scoring can be obtained directly using a straightforward computation of the NN layers in a limited time. The proposed technique provides a set of technical strength points such as the ability to generalize and solve rapidly compared to other multi-objective optimization methods. The model was trained and tested on 5 major hospitals in Saudi Arabia's Riyadh region, and the simulation results indicated that time and cost decreased by 35% and 30%, respectively. In particular, the proposed model outperformed other state-of-the-art MOP solutions such as Genetic Algorithms and Simulated Annealing.
引用
收藏
页码:3955 / 3967
页数:13
相关论文
共 50 条
  • [21] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
    Abels, Axel
    Roijers, Diederik M.
    Lenaerts, Tom
    Nowe, Ann
    Steckelmacher, Denis
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [22] Robust Dynamic Multi-Objective Vehicle Routing Optimization Method
    Guo, Yi-Nan
    Cheng, Jian
    Luo, Sha
    Gong, Dunwei
    Xue, Yu
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 1891 - 1903
  • [23] A reinforcement learning approach for dynamic multi-objective optimization
    Zou, Fei
    Yen, Gary G.
    Tang, Lixin
    Wang, Chunfeng
    INFORMATION SCIENCES, 2021, 546 : 815 - 834
  • [24] Multi-Objective Optimization in Disaster Backup with Reinforcement Learning
    Yi, Shanwen
    Qin, Yao
    Wang, Hua
    MATHEMATICS, 2025, 13 (03)
  • [25] Multi-objective Adaptive Dynamics Attention Model to Solve Multi-objective Vehicle Routing Problem
    Luo, Guang
    Luo, Jianping
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [26] Multi-objective vehicle routing problems
    Jozefowiez, Nicolas
    Semet, Frederic
    Talbi, El-Ghazali
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2008, 189 (02) : 293 - 309
  • [27] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [28] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [29] Allocation of English Remote Guiding based on Deep Reinforcement Learning and Multi-Objective Optimization
    Jia Zhiyong
    Tian Jing
    Zhao Jing
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 414 - 417
  • [30] Deep Reinforcement Learning for Adaptive Parameter Control in Differential Evolution for Multi-Objective Optimization
    Reijnen, Robbert
    Zhang, Yingqian
    Bukhsh, Zaharah
    Guzek, Mateusz
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 804 - 811