Deep Reinforcement Learning Model for Blood Bank Vehicle Routing Multi-Objective Optimization

被引:2
|
作者
Altaf, Meteb M. [1 ]
Roshdy, Ahmed Samir [2 ]
AlSagri, Hatoon S. [3 ]
机构
[1] King Abdul Aziz City Sci & Technol Riyadh, Adv Mfg & Ind 4 0 Ctr, Riyadh, Saudi Arabia
[2] Data Sci & AI Senior Manager Vodafone, Cairo, Egypt
[3] Al Imam Mohammad Ibn Saud Islamic Univ, Informat Syst Dept, Riyadh, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 02期
关键词
Optimization; blood bank; deep neural network; reinforcement learning; blood centers; multi-objective optimization; LOCATION; MOEA/D;
D O I
10.32604/cmc.2022.019448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The overall healthcare system has been prioritized within development top lists worldwide. Since many national populations are aging, combined with the availability of sophisticated medical treatments, healthcare expenditures are rapidly growing. Blood banks are a major component of any healthcare system, which store and provide the blood products needed for organ transplants, emergency medical treatments, and routine surgeries. Timely delivery of blood products is vital, especially in emergency settings. Hence, blood delivery process parameters such as safety and speed have received attention in the literature, as well as other parameters such as delivery cost. In this paper, delivery time and cost are modeled mathematically and marked as objective functions requiring simultaneous optimization. A solution is proposed based on Deep Reinforcement Learning (DRL) to address the formulated delivery functions as Multi-objective Optimization Problems (MOPs). The basic concept of the solution is to decompose the MOP into a scalar optimization sub-problems set, where each one of these sub-problems is modeled as a separate Neural Network (NN). The overall model parameters for each sub-problem are optimized based on a neighborhood parameter transfer and DRL training algorithm. The optimization step for the subproblems is undertaken collaboratively to optimize the overall model. Paretooptimal solutions can be directly obtained using the trained NN. Specifically, the multi-objective blood bank delivery problem is addressed in this research. One major technical advantage of this approach is that once the trained model is available, it can be scaled without the need for model retraining. The scoring can be obtained directly using a straightforward computation of the NN layers in a limited time. The proposed technique provides a set of technical strength points such as the ability to generalize and solve rapidly compared to other multi-objective optimization methods. The model was trained and tested on 5 major hospitals in Saudi Arabia's Riyadh region, and the simulation results indicated that time and cost decreased by 35% and 30%, respectively. In particular, the proposed model outperformed other state-of-the-art MOP solutions such as Genetic Algorithms and Simulated Annealing.
引用
收藏
页码:3955 / 3967
页数:13
相关论文
共 50 条
  • [31] Deep Reinforcement Learning Based Adaptive Operator Selection for Evolutionary Multi-Objective Optimization
    Tian, Ye
    Li, Xiaopeng
    Ma, Haiping
    Zhang, Xingyi
    Tan, Kay Chen
    Jin, Yaochu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 1051 - 1064
  • [32] A Multi-Objective Optimization Method for Shelter Site Selection Based on Deep Reinforcement Learning
    Zhang, Di
    Meng, Huan
    Wang, Moyang
    Xu, Xianrui
    Yan, Jianhai
    Li, Xiang
    TRANSACTIONS IN GIS, 2024, 28 (08) : 2722 - 2741
  • [33] Evolutionary Multi-objective Optimization for Multi-depot Vehicle Routing in Logistics
    Bi, Xiaowen
    Han, Zeyu
    Tang, Wallace K. S.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2017, 10 (01) : 1337 - 1344
  • [34] Evolutionary Multi-objective Optimization for Multi-depot Vehicle Routing in Logistics
    Xiaowen Bi
    Zeyu Han
    Wallace K. S. Tang
    International Journal of Computational Intelligence Systems, 2017, 10 : 1337 - 1344
  • [35] Wireless Resource Allocation Algorithm Based on Multi-Objective Deep Reinforcement Learning for Vehicle-to-Vehicle Communications
    Li, Ke
    Ma, Sai
    Dai, Penglin
    Ren, Jing
    Fan, Pingzhi
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (09): : 2229 - 2245
  • [36] Multi-Objective Optimization for the Vehicle Routing Problem With Outsourcing and Profit Balancing
    Zhang, Zizhen
    Qin, Hu
    Li, Yanzhi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (05) : 1987 - 2001
  • [37] Multi-Objective Joint Optimization of Loading and Capacity Vehicle Routing Problem
    Wang, Chao
    Jin, Chun
    Han, Jim
    2013 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2013, : 251 - 255
  • [38] Optimization of Vehicle Routing Problem Based on Multi-objective Genetic Algorithm
    Zhong, Ru
    Wu, Jianping
    Du, Yiman
    SUSTAINABLE DEVELOPMENT OF URBAN INFRASTRUCTURE, PTS 1-3, 2013, 253-255 : 1356 - +
  • [39] A novel particle swarm optimization for multi-objective vehicle routing problem
    Qin, Guihe, 2016, Xi'an Jiaotong University (50):
  • [40] Multi-objective path planning based on deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Cui, Yunfei
    Du, Xue
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3273 - 3279