Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data

被引:12
|
作者
Park, Jaram [1 ]
Kim, Jeong-Whun [2 ,3 ]
Ryu, Borim [1 ]
Heo, Eunyoung [1 ]
Jung, Se Young [1 ,4 ]
Yoo, Sooyoung [1 ]
机构
[1] Seoul Natl Univ, Off eHlth Res & Business, Bundang Hosp, 173Beon Gil 82, Seongnam 13620, South Korea
[2] Seoul Natl Univ, Dept Otorhinolaryngol, Bundang Hosp, Seongnam, South Korea
[3] Seoul Natl Univ, Dept Otorhinolaryngol, Coll Med, Seoul, South Korea
[4] Seoul Natl Univ, Dept Family Med, Bundang Hosp, Seongnam, South Korea
关键词
health risk appraisal; risk; hypertension; chronic disease; clustering and classification; decision support systems; ELECTRONIC HEALTH RECORDS; MEDICATION ADHERENCE; RISK; HOSPITALIZATION; PREVALENCE; MORTALITY; OUTCOMES; DISEASE; MODELS;
D O I
10.2196/11757
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Prevention and management of chronic diseases are the main goals of national health maintenance programs. Previously widely used screening tools, such as Health Risk Appraisal, are restricted in their achievement this goal due to their limitations, such as static characteristics, accessibility, and generalizability. Hypertension is one of the most important chronic diseases requiring management via the nationwide health maintenance program, and health care providers should inform patients about their risks of a complication caused by hypertension. Objective: Our goal was to develop and compare machine learning models predicting high-risk vascular diseases for hypertensive patients so that they can manage their blood pressure based on their risk level. Methods: We used a 12-year longitudinal dataset of the nationwide sample cohort, which contains the data of 514,866 patients and allows tracking of patients' medical history across all health care providers in Korea (N= 51,920). To ensure the generalizability of our models, we conducted an external validation using another national sample cohort dataset, comprising one million different patients, published by the National Health Insurance Service. From each dataset, we obtained the data of 74,535 and 59,738 patients with essential hypertension and developed machine learning models for predicting cardiovascular and cerebrovascular events. Six machine learning models were developed and compared for evaluating performances based on validation metrics. Results: Machine learning algorithms enabled us to detect high-risk patients based on their medical history. The long short-term memory-based algorithm outperformed in the within test (F1-score=. 772, external test F1-score=. 613), and the random forest-based algorithm of risk prediction showed better performance over other machine learning algorithms concerning generalization (within test F1-score=.757, external test F1-score=.705). Concerning the number of features, in the within test, the long short-term memory-based algorithms outperformed regardless of the number of features. However, in the external test, the random forest-based algorithm was the best, irrespective of the number of features it encountered. Conclusions: We developed and compared machine learning models predicting high-risk vascular diseases in hypertensive patients so that they may manage their blood pressure based on their risk level. By relying on the prediction model, a government can predict high-risk patients at the nationwide level and establish health care policies in advance.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] POLLEN EXPOSURE AND ASTHMA-RELATED HEALTHCARE RESOURCE UTILIZATION IN CHILDREN WITH ASTHMA: AN ANALYSIS OF PATIENT-LEVEL CLAIMS AND LINKED WEATHER DATA
    Packnett, E.
    Henriques, C.
    Irwin, D.
    VALUE IN HEALTH, 2019, 22 : S885 - S885
  • [32] The Impact of Drug Vintage on Patient Survival: A Patient-Level Analysis Using Quebec's Provincial Health Plan Data
    Lichtenberg, Frank R.
    Grootendorst, Paul
    Van Audenrode, Marc
    Latremouille-Viau, Dominick
    Lefebvre, Patrick
    VALUE IN HEALTH, 2009, 12 (06) : 847 - 856
  • [33] Fluconazole Prophylaxis for the Prevention of Candidiasis in Premature Infants: A Meta-analysis Using Patient-level Data
    Ericson, Jessica E.
    Kaufman, David A.
    Kicklighter, Stephen D.
    Bhatia, Jatinder
    Testoni, Daniela
    Gao, Jamie
    Smith, P. Brian
    Prather, Kristi O.
    Benjamin, Daniel K., Jr.
    CLINICAL INFECTIOUS DISEASES, 2016, 63 (05) : 604 - 610
  • [34] Aortic valve neocuspidization using the Ozaki technique: A meta-analysis of reconstructed patient-level data
    Mylonas, Konstantinos S.
    Tasoudis, Panagiotis T.
    Pavlopoulos, Dionysios
    Kanakis, Meletios
    Stavridis, George T.
    V. Avgerinos, Dimitrios
    AMERICAN HEART JOURNAL, 2023, 255 : 1 - 11
  • [35] Accurately Reflecting Uncertainty When Using Patient-Level Simulation Models to Extrapolate Clinical Trial Data
    Dakin, Helen A.
    Leal, Jose
    Briggs, Andrew
    Clarke, Philip
    Holman, Rury R.
    Gray, Alastair
    MEDICAL DECISION MAKING, 2020, 40 (04) : 460 - 473
  • [36] Cerclage for short cervix on ultrasonography - Meta-analysis of trials using individual patient-level data
    Berghella, V
    Odibo, AO
    To, MS
    Rust, OA
    Althuisius, SM
    OBSTETRICS AND GYNECOLOGY, 2005, 106 (01): : 181 - 189
  • [37] Identifying the limitations of cardiopulmonary exercise testing prior to esophagectomy using a pooled analysis of patient-level data
    Sivakumar, Jonathan
    Forshaw, Matthew J.
    Lam, Stephen
    Peters, Christopher J.
    Allum, William H.
    Whibley, Jessica
    Sinclair, Rhona C. F.
    Snowden, Christopher P.
    Hii, Michael W.
    Sivakumar, Harry
    Read, Matthew
    DISEASES OF THE ESOPHAGUS, 2022, 35 (11)
  • [38] Descriptive Correlates of Urban Pediatric Violent Injury Using Emergency Medical Service Patient-Level Data
    Walthall, Jennifer D. H.
    Burgess, Aaron
    Weinstein, Elizabeth
    Miramonti, Charles
    Arkins, Thomas
    Wiehe, Sarah
    PEDIATRIC EMERGENCY CARE, 2018, 34 (02) : 69 - 75
  • [39] TREATMENT SWITCHING ANALYSES ON PATIENT-LEVEL DATA TO INFORM TRANSFERABILITY OF A TRIAL-BASED HEALTH ECONOMIC ANALYSIS IN METASTATIC COLORECTAL CANCER: A CASE STUDY USING PATIENT-LEVEL DATA FROM THE FIRE-3 TRIAL
    van Oostrum, I
    Schlichting, M.
    Stintzing, S.
    Heeg, B.
    Heinemann, V
    Pescott, C.
    VALUE IN HEALTH, 2020, 23 : S8 - S8
  • [40] POPCORN: A web service for individual PrognOsis prediction based on multi-center clinical data CollabORatioN without patient-level data sharing
    Tian, Yu
    Shang, Yong
    Tong, Dan-Yang
    Chi, Sheng-Qiang
    Li, Jun
    Kong, Xiang-Xing
    Ding, Ke-Feng
    Li, Jing-Song
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 86 : 1 - 14