Artificial Pancreas Control for Diabetes using TD3 Deep Reinforcement Learning

被引:4
|
作者
Mackey, Alan [1 ]
Furey, Eoghan [1 ]
机构
[1] Letterkenny Inst Technol, Dept Comp, Letterkenny, Ireland
关键词
Artificial Pancreas; Diabetes; Deep Reinforcement learning; TD3; MINIMAL MODEL; BLOOD-GLUCOSE;
D O I
10.1109/ISSC55427.2022.9826219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Diabetes Mellitus is a chronic condition that affects approximately 6.5% of the population in Ireland. As well as being a burden on those who suffer from it, it is a huge burden to the state and accounts for approximately 10% of total global health spend. Diabetes cannot be managed from a clinical setting so there is a requirement for self-management with a constant need to understand what current blood glucose values are and responding by treatment with an appropriate dose of insulin. Fortunately, diabetes technology has improved dramatically in the last number of years with the invention of the continuous glucose monitor (CGM) that can report a blood glucose reading as frequently as every five minutes and insulin pumps that infuse insulin in frequent small doses mimicking endogenous insulin. Currently humans are still required to manage these devices, but it is every patient's (and clinicians) wish to close the loop and automate control. This study looks at control algorithms and asks if deep reinforcement learning (DRL) can be used as a potential solution for devising patient specific policies for control. A Twin Delayed Deep Deterministic Policy Gradient (TD3) model is implemented in a simulated environment and tested on three in-silico patients. The results show promise in controlling blood glucose profiles for the patients but in a limited setting. It concludes that while DRL is capable of learning to control blood glucose further research is required before it could be considered for human use.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Manipulator Control using Federated Deep Reinforcement Learning
    Shivkumar, S.
    Kumaar, A. A. Nippun
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [32] Dynamic metasurface control using Deep Reinforcement Learning
    Zhao, Ying
    Li, Liang
    Lanteri, Stephane
    Viquerat, Jonathan
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2022, 197 : 377 - 395
  • [33] Basal Glucose Control in Type 1 Diabetes Using Deep Reinforcement Learning: An In Silico Validation
    Zhu, Taiyu
    Li, Kezhi
    Herrero, Pau
    Georgiou, Pantelis
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (04) : 1223 - 1232
  • [34] Advancing Active Suspension Control With TD3-PSC: Integrating Physical Safety Constraints Into Deep Reinforcement Learning
    Deng, Mingxing
    Sun, Dongxu
    Zhan, Liu
    Xu, Xiaowei
    Zou, Junyi
    IEEE ACCESS, 2024, 12 : 115628 - 115641
  • [35] Genetically optimized TD3 algorithm for efficient access control in the internet of vehicles
    Al-Atawi, Abdullah A.
    WIRELESS NETWORKS, 2024, 30 (09) : 7581 - 7601
  • [36] Speed Optimization Control of a Permanent Magnet Synchronous Motor Based on TD3
    Hu, Zuolei
    Zhang, Yingjie
    Li, Ming
    Liao, Yuhua
    ENERGIES, 2025, 18 (04)
  • [37] Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
    Schaff, Charles
    Yunis, David
    Chakrabarti, Ayan
    Walter, Matthew R.
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9798 - 9805
  • [38] Learning Control for Air Hockey Striking using Deep Reinforcement Learning
    Taitler, Ayal
    Shimkin, Nahum
    2017 INTERNATIONAL CONFERENCE ON CONTROL, ARTIFICIAL INTELLIGENCE, ROBOTICS & OPTIMIZATION (ICCAIRO), 2017, : 22 - 27
  • [39] Artificial pancreas provides better diabetes control
    Skodvin, Torbjorn Oygard
    TIDSSKRIFT FOR DEN NORSKE LAEGEFORENING, 2023, 143 (01) : 36 - 36
  • [40] Nonlinear control strategies for 3-DOF control moment gyroscope using deep reinforcement learning
    Yan Xiong
    Siyuan Liu
    Jianxiang Zhang
    Mingxing Xu
    Liang Guo
    Neural Computing and Applications, 2024, 36 : 6441 - 6465