Identifying Continuous Glucose Monitoring Data Using Machine Learning

被引:5
|
作者
Herrero, Pau [1 ,3 ]
Reddy, Monika [2 ]
Georgiou, Pantelis [1 ]
Oliver, Nick S. [2 ]
机构
[1] Imperial Coll London, Ctr Bioinspired Technol, Dept Elect & Elect Engn, London, England
[2] Fac Med Imperial Coll, Dept Med, Div Diabet Endocrinol & Metab, London, England
[3] Imperial Coll London, Ctr Bioinspired Technol, Dept Elect & Elect Engn, South Kensington Campus, London SW7 2AZ, England
关键词
Type; 1; diabetes; Continuous glucose monitoring; Cybersecurity; Data privacy; Machine learning; CHALLENGES; CARE;
D O I
10.1089/dia.2021.0498
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and Aims: The recent increase in wearable devices for diabetes care, and in particular the use of continuous glucose monitoring (CGM), generates large data sets and associated cybersecurity challenges. In this study, we demonstrate that it is possible to identify CGM data at an individual level by using standard machine learning techniques.Methods: The publicly available REPLACE-BG data set (NCT02258373) containing 226 adult participants with type 1 diabetes (T1D) wearing CGM over 6 months was used. A support vector machine (SVM) binary classifier aiming to determine if a CGM data stream belongs to an individual participant was trained and tested for each of the subjects in the data set. To generate the feature vector used for classification, 12 standard glycemic metrics were selected and evaluated at different time periods of the day (24 h, day, night, breakfast, lunch, and dinner). Different window lengths of CGM data (3, 7, 15, and 30 days) were chosen to evaluate their impact on the classification performance. A recursive feature selection method was employed to select the minimum subset of features that did not significantly degrade performance.Results: A total of 40 features were generated as a result of evaluating the glycemic metrics over the selected time periods (24 h, day, night, breakfast, lunch, and dinner). A window length of 15 days was found to perform the best in terms of accuracy (86.8% +/- 12.8%) and F1 score (0.86 +/- 0.16). The corresponding sensitivity and specificity were 85.7% +/- 19.5% and 87.9% +/- 17.5%, respectively. Through recursive feature selection, a subset of 9 features was shown to perform similarly to the 40 features.Conclusion: It is possible to determine with a relatively high accuracy if a CGM data stream belongs to an individual. The proposed approach can be used as a digital CGM "fingerprint" or for detecting glycemic changes within an individual, for example during intercurrent illness.
引用
收藏
页码:403 / 408
页数:6
相关论文
共 50 条
  • [31] Machine learning-based glucose prediction with use of continuous glucose and physical activity data: The Maastricht Study
    Foreman, Y. D.
    van Doorn, W. P. T.
    Schaper, N. C.
    Savelberg, H. H. C.
    Koster, A.
    van der Kallen, C. J. H.
    Wesselius, A.
    Eussen, S. J. P.
    Schram, M. T.
    Dagnelie, P. C.
    de Galan, B. E.
    Bekers, O.
    Stehouwer, C. D. A.
    Meex, S. J. R.
    Brouwers, M. C. G.
    DIABETOLOGIA, 2020, 63 (SUPPL 1) : S351 - S352
  • [32] Model Identification using Continuous Glucose Monitoring Data for Type 1 Diabetes
    Boiroux, Dimitri
    Hagdrup, Morten
    Mahmoudi, Zeinab
    Poulsen, Niels Kjolstad
    Madsen, Henrik
    Jorgensen, John Bagterp
    IFAC PAPERSONLINE, 2016, 49 (07): : 759 - 764
  • [33] A Machine Learning Test Data Set for Continuous Security Monitoring of Industrial Control Systems
    Francia, Guillermo A., III
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1043 - 1048
  • [34] Continuous monitoring with machine learning and interactive data visualization: An application to a healthcare payroll process
    Zhang, Guangyue
    Atasoy, Hilal
    Vasarhelyi, Miklos A.
    INTERNATIONAL JOURNAL OF ACCOUNTING INFORMATION SYSTEMS, 2022, 46
  • [35] Developing an Algorithm for Identifying Mortality in MarketScan Claims Data Using Machine Learning
    Xie, Fenglong
    Zhao, Hong
    Yun, Huifeng
    Bernatsky, Sasha
    Curtis, Jeffrey R.
    ARTHRITIS & RHEUMATOLOGY, 2020, 72
  • [36] Identifying multiple sclerosis subtypes using unsupervised machine learning and MRI data
    Eshaghi, Arman
    Young, Alexandra L.
    Wijeratne, Peter A.
    Prados, Ferran
    Arnold, Douglas L.
    Narayanan, Sridar
    Guttmann, Charles R. G.
    Barkhof, Frederik
    Alexander, Daniel C.
    Thompson, Alan J.
    Chard, Declan
    Ciccarelli, Olga
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [37] Data Gap Modeling in Continuous Glucose Monitoring Sensor Data
    Drecogna, Martina
    Vettoretti, Martina
    Del Favero, Simone
    Facchinetti, Andrea
    Sparacino, Giovanni
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 4379 - 4382
  • [38] Discrete glucose profiles identified using continuous glucose monitoring data and their association with adverse pregnancy outcomes
    Battarbee, Ashley N.
    Sauer, Sara M.
    Sanusi, Ayodeji
    Fulcher, Isabel
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2024, 231 (01) : 122e1 - 122e9
  • [39] Data Driven Network Monitoring and Intrusion Detection using Machine Learning
    Williams, Brandon
    Dong, Xishuang
    Qian, Lijun
    2020 SEVENTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORK ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2020, : 262 - 268
  • [40] Using machine learning techniques for Data Quality Monitoring in CMS and ALICE
    Deja, Kamil
    7TH ANNUAL CONFERENCE ON LARGE HADRON COLLIDER PHYSICS, LHCP2019, 2019,