Predicting future falls in older people using natural language processing of general practitioners' clinical notes

被引:7
|
作者
Dormosh, Noman [1 ,2 ]
Schut, Martijn C. [1 ,3 ,4 ]
Heymans, Martijn W. [5 ,6 ]
Maarsingh, Otto [7 ,8 ]
Bouman, Jonathan [9 ]
van der Velde, Nathalie [10 ,11 ]
Abu-Hanna, Ameen [1 ,2 ]
机构
[1] Univ Amsterdam, Amsterdam UMC, Dept Med Informat, Amsterdam, Netherlands
[2] Amsterdam Publ Hlth, Aging & Later Life & Methodol Amsterdam, Amsterdam, Netherlands
[3] Vrije Univ Amsterdam, Amsterdam UMC, Dept Clin Chem, Amsterdam, Netherlands
[4] Amsterdam Publ Hlth, Methodol & Qual Care, Amsterdam, Netherlands
[5] Vrije Univ Amsterdam, Amsterdam UMC, Dept Epidemiol & Data Sci, Amsterdam, Netherlands
[6] Amsterdam Publ Hlth, Methodol & Personalized Med, Amsterdam, Netherlands
[7] Vrije Univ Amsterdam, Amsterdam UMC, Dept Gen practice, Amsterdam, Netherlands
[8] Amsterdam Publ Hlth, Aging & Later Life & Mental Hlth, Amsterdam, Netherlands
[9] Univ Amsterdam, Amsterdam UMC, Dept Gen Practice, Amsterdam, Netherlands
[10] Univ Amsterdam, Amsterdam UMC, Dept Internal Med, Sect Geriatr Med, Amsterdam, Netherlands
[11] Amsterdam Publ Hlth, Aging & Later Life, Amsterdam, Netherlands
关键词
accidental falls; fall prediction; natural language processing; electronic health records; free text; topic modelling; older people; RISK-FACTORS; ADULTS; CARE; CONSEQUENCES; INFORMATION; CHALLENGES; INJURIES; MODELS;
D O I
10.1093/ageing/afad046
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Background Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. Methods We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. Results Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700-0.719), 0.685 (0.676-0.694) and 0.718 (0.708-0.727), respectively. All the models showed good calibration. Conclusions Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Identifying Type II workplace violence from clinical notes using natural language processing
    Byon, Ha Do
    Harris, Catherine
    Crandall, Mary
    Song, Jiyoun
    Topaz, Maxim
    WORKPLACE HEALTH & SAFETY, 2023, 71 (10) : 484 - 490
  • [22] Severity score extraction from clinical notes using natural language processing: Applications to dermatology
    Kumar, Vikas
    Rasouliyan, Lawrence
    Althoff, Amanda G.
    Chang, Stella
    Long, Stacey
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 125 - 125
  • [23] Using natural language processing methods to classify use status of dietary supplements in clinical notes
    Yadan Fan
    Rui Zhang
    BMC Medical Informatics and Decision Making, 18
  • [24] Extracting cancer concepts from clinical notes using natural language processing: a systematic review
    Gholipour, Maryam
    Khajouei, Reza
    Amiri, Parastoo
    Gohari, Sadrieh Hajesmaeel
    Ahmadian, Leila
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [25] Extracting cancer concepts from clinical notes using natural language processing: a systematic review
    Maryam Gholipour
    Reza Khajouei
    Parastoo Amiri
    Sadrieh Hajesmaeel Gohari
    Leila Ahmadian
    BMC Bioinformatics, 24
  • [26] Prevalence of Sensitive Terms in Clinical Notes Using Natural Language Processing Techniques: Observational Study
    Lee, Jennifer
    Yang, Samuel
    Holland-Hall, Cynthia
    Sezgin, Emre
    Gill, Manjot
    Linwood, Simon
    Huang, Yungui
    Hoffman, Jeffrey
    JMIR MEDICAL INFORMATICS, 2022, 10 (06)
  • [27] Relation Detection to Identify Stroke Assertions from Clinical Notes Using Natural Language Processing
    Yang, Audrey
    Kamien, Sam
    Davoudi, Anahita
    Hwang, Sy
    Gandhi, Meet
    Urbanowicz, Ryan
    Mowery, Danielle
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 619 - 623
  • [28] FEASIBILITY OF USING NATURAL LANGUAGE PROCESSING TO EXTRACT CANCER PAIN SCORE FROM CLINICAL NOTES
    Naseri, Hossien
    RADIOTHERAPY AND ONCOLOGY, 2019, 139 : S65 - S65
  • [29] Using natural language processing to automatically extract cancer outcomes data from clinical notes
    Liptrot, Tom
    Karystianis, George
    Nenadic, Goran
    Keane, John
    Livsey, Jacqueline
    Barker-Hewitt, Matthew
    O'Hara, Catherine
    EUROPEAN JOURNAL OF CANCER CARE, 2015, 24 : 11 - 11
  • [30] Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review
    Sheikhalishahi, Seyedmostafa
    Miotto, Riccardo
    Dudley, Joel T.
    Lavelli, Alberto
    Rinaldi, Fabio
    Osmani, Venet
    JMIR MEDICAL INFORMATICS, 2019, 7 (02) : 15 - 32