Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data

被引：12

作者：

Park, Jaram ^{[1
]}

Kim, Jeong-Whun ^{[2
,3
]}

Ryu, Borim ^{[1
]}

Heo, Eunyoung ^{[1
]}

Jung, Se Young ^{[1
,4
]}

Yoo, Sooyoung ^{[1
]}

机构：

[1] Seoul Natl Univ, Off eHlth Res & Business, Bundang Hosp, 173Beon Gil 82, Seongnam 13620, South Korea

[2] Seoul Natl Univ, Dept Otorhinolaryngol, Bundang Hosp, Seongnam, South Korea

[3] Seoul Natl Univ, Dept Otorhinolaryngol, Coll Med, Seoul, South Korea

[4] Seoul Natl Univ, Dept Family Med, Bundang Hosp, Seongnam, South Korea

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2019年 / 21卷 / 02期

关键词：

health risk appraisal; risk; hypertension; chronic disease; clustering and classification; decision support systems; ELECTRONIC HEALTH RECORDS; MEDICATION ADHERENCE; RISK; HOSPITALIZATION; PREVALENCE; MORTALITY; OUTCOMES; DISEASE; MODELS;

D O I：

10.2196/11757

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: Prevention and management of chronic diseases are the main goals of national health maintenance programs. Previously widely used screening tools, such as Health Risk Appraisal, are restricted in their achievement this goal due to their limitations, such as static characteristics, accessibility, and generalizability. Hypertension is one of the most important chronic diseases requiring management via the nationwide health maintenance program, and health care providers should inform patients about their risks of a complication caused by hypertension. Objective: Our goal was to develop and compare machine learning models predicting high-risk vascular diseases for hypertensive patients so that they can manage their blood pressure based on their risk level. Methods: We used a 12-year longitudinal dataset of the nationwide sample cohort, which contains the data of 514,866 patients and allows tracking of patients' medical history across all health care providers in Korea (N= 51,920). To ensure the generalizability of our models, we conducted an external validation using another national sample cohort dataset, comprising one million different patients, published by the National Health Insurance Service. From each dataset, we obtained the data of 74,535 and 59,738 patients with essential hypertension and developed machine learning models for predicting cardiovascular and cerebrovascular events. Six machine learning models were developed and compared for evaluating performances based on validation metrics. Results: Machine learning algorithms enabled us to detect high-risk patients based on their medical history. The long short-term memory-based algorithm outperformed in the within test (F1-score=. 772, external test F1-score=. 613), and the random forest-based algorithm of risk prediction showed better performance over other machine learning algorithms concerning generalization (within test F1-score=.757, external test F1-score=.705). Concerning the number of features, in the within test, the long short-term memory-based algorithms outperformed regardless of the number of features. However, in the external test, the random forest-based algorithm was the best, irrespective of the number of features it encountered. Conclusions: We developed and compared machine learning models predicting high-risk vascular diseases in hypertensive patients so that they may manage their blood pressure based on their risk level. By relying on the prediction model, a government can predict high-risk patients at the nationwide level and establish health care policies in advance.

引用

页数：13

共 50 条

[21] Logistic regression models for patient-level prediction based on massive observational data: Do we need all data?
John, Luis H.
Kors, Jan A.
Reps, Jenna M.
Ryan, Patrick B.
Rijnbeek, Peter R.
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 163
[22] A comparative patient-level prediction study in OMOP CDM: applicative potential and insights from synthetic data
Najia Ahmadi
Quang Vu Nguyen
Martin Sedlmayr
Markus Wolfien
Scientific Reports, 14
[23] A comparative patient-level prediction study in OMOP CDM: applicative potential and insights from synthetic data
Ahmadi, Najia
Nguyen, Quang Vu
Sedlmayr, Martin
Wolfien, Markus
SCIENTIFIC REPORTS, 2024, 14 (01)
[24] Cost-Effectiveness Analysis of Intensive Treatment of Systolic Hypertension: Results From Sprint Patient-Level Data
Zhang, Zugui
Bellows, Brandon K.
Bress, Adam P.
Moran, Andrew E.
Zhang, Yiyi
Derington, Catherine
Kolm, Paul
Weintraub, William S.
CIRCULATION, 2023, 148
[25] Association between cardio-cerebrovascular disease and systemic antipsoriatic therapy in psoriasis patients using population-based data: A nested case-control study
Kim, Bo Ri
Lee, Kun Hee
Kim, Jinseob
Kim, Jee Woo
Paik, Kyungho
Myung, Woojae
Lee, Hyewon
Choi, Chong Won
Youn, Sang Woong
JOURNAL OF DERMATOLOGY, 2023, 50 (11): : 1442 - 1449
[26] Patient-Level Fall Risk Prediction Using the Observational Medical Outcomes Partnership's Common Data Model: Pilot Feasibility Study
Jung, Hyesil
Yoo, Sooyoung
Kim, Seok
Heo, Eunjeong
Kim, Borham
Lee, Ho-Young
Hwang, Hee
JMIR MEDICAL INFORMATICS, 2022, 10 (03)
[27] Cardiovascular and cerebrovascular events among patients receiving omalizumab: Pooled analysis of patient-level data from 25 randomized, double-blind, placebo-controlled clinical trials
Iribarren, Carlos
Rothman, Kenneth J.
Bradley, Mary S.
Carrigan, Gillis
Eisner, Mark D.
Chen, Hubert
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2017, 139 (05) : 1678 - 1680
[28] Precision Drug Repurposing (PDR): Patient-level modeling and prediction combining foundational knowledge graph with biobank data
Oguztuzun, Cerag
Gao, Zhenxiang
Li, Hui
Xu, Rong
JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 163
[29] The impact of drug vintage on patient survival: A patient-level approach using Quebec's provincial health plan data
Lichtenberg, F.
Van Audenrode, M.
Grootendorst, P.
Latremouille-Viau, D.
Lefebvre, P.
VALUE IN HEALTH, 2008, 11 (03) : A4 - A5
[30] SELECTING A PATIENT CHARACTERISTICS INDEX FOR THE PREDICTION OF MEDICAL OUTCOMES USING ADMINISTRATIVE CLAIMS DATA
MELFI, C
HOLLEMAN, E
ARTHUR, D
KATZ, B
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (07) : 917 - 926

← 1 2 3 4 5 →