Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data

被引:12
|
作者
Park, Jaram [1 ]
Kim, Jeong-Whun [2 ,3 ]
Ryu, Borim [1 ]
Heo, Eunyoung [1 ]
Jung, Se Young [1 ,4 ]
Yoo, Sooyoung [1 ]
机构
[1] Seoul Natl Univ, Off eHlth Res & Business, Bundang Hosp, 173Beon Gil 82, Seongnam 13620, South Korea
[2] Seoul Natl Univ, Dept Otorhinolaryngol, Bundang Hosp, Seongnam, South Korea
[3] Seoul Natl Univ, Dept Otorhinolaryngol, Coll Med, Seoul, South Korea
[4] Seoul Natl Univ, Dept Family Med, Bundang Hosp, Seongnam, South Korea
关键词
health risk appraisal; risk; hypertension; chronic disease; clustering and classification; decision support systems; ELECTRONIC HEALTH RECORDS; MEDICATION ADHERENCE; RISK; HOSPITALIZATION; PREVALENCE; MORTALITY; OUTCOMES; DISEASE; MODELS;
D O I
10.2196/11757
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Prevention and management of chronic diseases are the main goals of national health maintenance programs. Previously widely used screening tools, such as Health Risk Appraisal, are restricted in their achievement this goal due to their limitations, such as static characteristics, accessibility, and generalizability. Hypertension is one of the most important chronic diseases requiring management via the nationwide health maintenance program, and health care providers should inform patients about their risks of a complication caused by hypertension. Objective: Our goal was to develop and compare machine learning models predicting high-risk vascular diseases for hypertensive patients so that they can manage their blood pressure based on their risk level. Methods: We used a 12-year longitudinal dataset of the nationwide sample cohort, which contains the data of 514,866 patients and allows tracking of patients' medical history across all health care providers in Korea (N= 51,920). To ensure the generalizability of our models, we conducted an external validation using another national sample cohort dataset, comprising one million different patients, published by the National Health Insurance Service. From each dataset, we obtained the data of 74,535 and 59,738 patients with essential hypertension and developed machine learning models for predicting cardiovascular and cerebrovascular events. Six machine learning models were developed and compared for evaluating performances based on validation metrics. Results: Machine learning algorithms enabled us to detect high-risk patients based on their medical history. The long short-term memory-based algorithm outperformed in the within test (F1-score=. 772, external test F1-score=. 613), and the random forest-based algorithm of risk prediction showed better performance over other machine learning algorithms concerning generalization (within test F1-score=.757, external test F1-score=.705). Concerning the number of features, in the within test, the long short-term memory-based algorithms outperformed regardless of the number of features. However, in the external test, the random forest-based algorithm was the best, irrespective of the number of features it encountered. Conclusions: We developed and compared machine learning models predicting high-risk vascular diseases in hypertensive patients so that they may manage their blood pressure based on their risk level. By relying on the prediction model, a government can predict high-risk patients at the nationwide level and establish health care policies in advance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] PATIENT-LEVEL FACTORS ASSOCIATED WITH CEREBROVASCULAR EVENTS IN MAINTENANCE HEMODIALYSIS
    Power, Albert
    Duncan, Neill
    Pusey, Charles
    Usvyat, Len
    Marcelli, Daniele
    Marelli, Cristina
    Kotanko, Peter
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2013, 28 : 85 - 85
  • [2] Wisdom of the CROUD: Development and validation of a patient-level prediction model for opioid use disorder using population-level claims data
    Reps, Jenna Marie
    Cepeda, M. Soledad
    Ryan, Patrick B.
    PLOS ONE, 2020, 15 (02):
  • [3] The Impact of Sedation on Cardio-Cerebrovascular Adverse Events after Surveillance Esophagogastroduodenoscopy in Patients with Gastric Cancer: A Nationwide Population-Based Cohort Study
    Kim, Sang Yoon
    Lee, Jun Kyu
    Lee, Kwang Hyuck
    Jang, Jae-Young
    Kim, Byung-Wook
    KSGE
    GUT AND LIVER, 2024, 18 (02)
  • [4] Cerebrovascular Events in Patients Undergoing Transfemoral Transcatheter Aortic Valve Implantation: A Pooled Patient-Level Study
    van Nieuwkerk, Astrid C.
    Aarts, Hugo M.
    Hemelrijk, Kimberley I.
    Carrillo, Cristobal Urbano
    Tchetche, Didier
    de Brito Jr, Fabio S.
    Barbanti, Marco
    Kornowski, Ran
    Latib, Azeem
    D'Onofrio, Augusto
    Ribichini, Flavio
    Garcia-Blas, Sergio
    Dumonteil, Nicolas
    Abizaid, Alexandre
    Sartori, Samantha
    D'Errigo, Paola
    Tarantini, Giuseppe
    Lunardi, Mattia
    Orvin, Katia
    Pagnesi, Matteo
    Navarro, Felipe
    Dangas, George
    Mehran, Roxana
    Delewi, Ronak
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2024, 13 (17):
  • [5] Patient-Level Effectiveness Prediction Modeling for Glioblastoma Using Classification Trees
    Geldof, Tine
    Van Damme, Nancy
    Huys, Isabelle
    Van Dyck, Walter
    FRONTIERS IN PHARMACOLOGY, 2020, 10
  • [6] Patient-level analysis of outcomes using structured labor and delivery data
    Hall, Eric S.
    Poynton, Mollie R.
    Narus, Scott P.
    Jones, Spencer S.
    Evans, R. Scott
    Varner, Michael W.
    Thornton, Sidney N.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (04) : 702 - 709
  • [7] Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
    Reps, Jenna M.
    Rijnbeek, Peter R.
    Ryan, Patrick B.
    DRUG SAFETY, 2019, 42 (11) : 1377 - 1386
  • [8] Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
    Jenna M. Reps
    Peter R. Rijnbeek
    Patrick B. Ryan
    Drug Safety, 2019, 42 : 1377 - 1386
  • [9] Current Challenges of Using Patient-Level Claims and Electronic Health Record Data for the Longitudinal Evaluation of Duchenne Muscular Dystrophy Outcomes
    Gooch, Katherine L.
    Audhya, Ivana
    Ricchetti-Masterson, Kristen
    Szabo, Shelagh M.
    ADVANCES IN THERAPY, 2024, 41 (09) : 3615 - 3632
  • [10] Costs of Stroke Using Patient-Level Data A Critical Review of the Literature
    Luengo-Fernandez, Ramon
    Gray, Alastair M.
    Rothwell, Peter M.
    STROKE, 2009, 40 (02) : E18 - E23