Automated risk assessment of newly detected atrial fibrillation poststroke from electronic health record data using machine learning and natural language processing

被引:2
|
作者
Sung, Sheng-Feng [1 ,2 ]
Sung, Kuan-Lin [3 ]
Pan, Ru-Chiou [4 ]
Lee, Pei-Ju [5 ,6 ]
Hu, Ya-Han [7 ]
机构
[1] Ditmanson Med Fdn, Dept Internal Med, Div Neurol, Chiayi Christian Hosp, Chiayi, Taiwan
[2] Min Hwei Jr Coll Hlth Care Management, Dept Nursing, Tainan, Taiwan
[3] Natl Taiwan Univ, Sch Med, Taipei, Taiwan
[4] Ditmanson Med Fdn, Clin Data Ctr, Chiayi Christian Hosp, Dept Med Res, Chiayi, Taiwan
[5] Natl Chung Cheng Univ, Dept Informat Management, Minxiong Township, Chiayi County, Taiwan
[6] Natl Chung Cheng Univ, Inst Healthcare Informat Management, Minxiong Township, Chiayi County, Taiwan
[7] Natl Cent Univ, Dept Informat Management, Taoyuan, Taiwan
来源
关键词
atrial fibrillation; electronic health records; ischemic stroke; natural language processing; prediction; TRANSIENT ISCHEMIC ATTACK; TEXT CLASSIFICATION; FEATURE-SELECTION; VASCULAR EVENTS; STROKE CARE; SCORE; VALIDATION; RECURRENCE; PREDICTION; TAIWAN;
D O I
10.3389/fcvm.2022.941237
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BackgroundTimely detection of atrial fibrillation (AF) after stroke is highly clinically relevant, aiding decisions on the optimal strategies for secondary prevention of stroke. In the context of limited medical resources, it is crucial to set the right priorities of extended heart rhythm monitoring by stratifying patients into different risk groups likely to have newly detected AF (NDAF). This study aimed to develop an electronic health record (EHR)-based machine learning model to assess the risk of NDAF in an early stage after stroke. MethodsLinked data between a hospital stroke registry and a deidentified research-based database including EHRs and administrative claims data was used. Demographic features, physiological measurements, routine laboratory results, and clinical free text were extracted from EHRs. The extreme gradient boosting algorithm was used to build the prediction model. The prediction performance was evaluated by the C-index and was compared to that of the AS5F and CHASE-LESS scores. ResultsThe study population consisted of a training set of 4,064 and a temporal test set of 1,492 patients. During a median follow-up of 10.2 months, the incidence rate of NDAF was 87.0 per 1,000 person-year in the test set. On the test set, the model based on both structured and unstructured data achieved a C-index of 0.840, which was significantly higher than those of the AS5F (0.779, p = 0.023) and CHASE-LESS (0.768, p = 0.005) scores. ConclusionsIt is feasible to build a machine learning model to assess the risk of NDAF based on EHR data available at the time of hospital admission. Inclusion of information derived from clinical free text can significantly improve the model performance and may outperform risk scores developed using traditional statistical methods. Further studies are needed to assess the clinical usefulness of the prediction model.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data
    Tedeschi, Sara K.
    Cai, Tianrun
    He, Zeling
    Ahuja, Yuri
    Hong, Chuan
    Yates, Katherine A.
    Dahal, Kumar
    Xu, Chang
    Lyu, Houchen
    Yoshida, Kazuki
    Solomon, Daniel H.
    Cai, Tianxi
    Liao, Katherine P.
    ARTHRITIS CARE & RESEARCH, 2021, 73 (03) : 442 - 448
  • [42] Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data
    Gianfrancesco, Milena A.
    Tamang, Suzanne
    Yazdany, Jinoos
    Schmajuk, Gabriela
    JAMA INTERNAL MEDICINE, 2018, 178 (11) : 1544 - 1547
  • [43] Classifying Pseudogout Using Machine Learning Approaches with Electronic Health Record Data
    Tedeschi, Sara K.
    Cai, Tianrun
    He, Zeling
    Ahuja, Yuri
    Hong, Chuan
    Yates, Katherine
    Dahal, Kumar
    Xu, Chang
    Lyu, Houchen
    Yoshida, Kazuki
    Solomon, Daniel
    Cai, Tianxi
    Liao, Katherine
    ARTHRITIS & RHEUMATOLOGY, 2019, 71
  • [44] Identification of postoperative complications using electronic health record data and machine learning
    Bronsert, Michael
    Singh, Abhinav B.
    Henderson, William G.
    Hammermeister, Karl
    Meguid, Robert A.
    Colborn, Kathryn L.
    AMERICAN JOURNAL OF SURGERY, 2020, 220 (01): : 114 - 119
  • [45] Safety and Effectiveness of Oral Anticoagulants in Atrial Fibrillation: Real-World Insights Using Natural Language Processing and Machine Learning
    Cosin-Sales, Juan
    Sanchez, Manuel Anguita
    Suarez, Carmen
    Arias-Cabrales, Carlos
    Martinez-Sanchez, Luisa
    Arumi, Daniel
    de Cabo, Susana Fernandez
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (20)
  • [46] Oral anticoagulant treatment in atrial fibrillation: the AFIRMA real-world study using natural language processing and machine learning
    Cosin-Sales, J.
    Anguita, M.
    Suarez, C.
    Arias-Cabrales, C.
    Martinez-Sanchez, L.
    Group, S. R.
    Arumi, D.
    de Cabo, S. Fernandez
    REVISTA CLINICA ESPANOLA, 2024, 224 (07): : 428 - 436
  • [47] Prediction and evaluation of combination pharmacotherapy using natural language processing, machine learning and patient electronic health records
    Ding, Pingjian
    Pan, Yiheng
    Wang, Quanqiu
    Xu, Rong
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 133
  • [48] Textual data transformations using natural language processing for risk assessment
    Kamil, Mohammad Zaid
    Taleb-Berrouane, Mohammed
    Khan, Faisal
    Amyotte, Paul
    Ahmed, Salim
    RISK ANALYSIS, 2023, 43 (10) : 2033 - 2052
  • [49] Using Natural Language Processing and Machine Learning To Identify Gout Flares From Electronic Clinical Notes
    Zheng, Chengyi
    Rashid, Nazia
    Cheetham, T. Craig
    Wu, Yi-Lin
    Levy, Gerald D.
    ARTHRITIS AND RHEUMATISM, 2013, 65 : S856 - S857
  • [50] Using Natural Language Processing and Machine Learning to Identify Gout Flares From Electronic Clinical Notes
    Zheng, Chengyi
    Rashid, Nazia
    Wu, Yi-Lin
    Koblick, River
    Lin, Antony T.
    Levy, Gerald D.
    Cheetham, T. Craig
    ARTHRITIS CARE & RESEARCH, 2014, 66 (11) : 1740 - 1748