Extracting social determinants of health from electronic health records using natural language processing: a systematic review

被引:101
|
作者
Patra, Braja G. [1 ]
Sharma, Mohit M. [1 ]
Vekaria, Veer [1 ]
Adekkanattu, Prakash [2 ]
Patterson, Olga, V [3 ,4 ]
Glicksberg, Benjamin [5 ]
Lepow, Lauren A. [5 ]
Ryu, Euijung [6 ]
Biernacka, Joanna M. [6 ]
Furmanchuk, Al'ona [7 ]
George, Thomas J. [8 ]
Hogan, William [9 ]
Wu, Yonghui [8 ]
Yang, Xi [8 ]
Bian, Jiang [8 ]
Weissman, Myrna [10 ]
Wickramaratne, Priya [10 ]
Mann, J. John [10 ]
Olfson, Mark [10 ]
Campion, Thomas R., Jr. [1 ,2 ]
Weiner, Mark [1 ]
Pathak, Jyotishman [1 ]
机构
[1] Weill Cornell Med, Dept Populat Hlth Sci, 425 E 61st St,Suite 301, New York, NY 10065 USA
[2] Weill Cornell Med, Informat Technol & Serv, New York, NY 10065 USA
[3] Univ Utah, Dept Internal Med, Div Epidemiol, Salt Lake City, UT 84112 USA
[4] US Dept Vet Affairs, Salt Lake City, UT USA
[5] Icahn Sch Med Mt Sinai, New York, NY 10029 USA
[6] Mayo Clin, Dept Quantitat Hlth Sci, Rochester, MN USA
[7] Northwestern Univ, Chicago, IL 60611 USA
[8] Univ Florida, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL USA
[9] Univ Florida, Coll Med, Dept Med, Div Hematol & Oncol, Gainesville, FL USA
[10] Columbia Univ, Vagelos Coll Phys & Surg, New York, NY USA
关键词
social determinants of health; population health outcomes; electronic health records; natural language processing; information extraction; machine learning; PROBLEM OPIOID USE; BINGE-EATING DISORDER; AUTOMATED IDENTIFICATION; UNSTRUCTURED DATA; CARE; VALIDATION; ABUSE; RISK;
D O I
10.1093/jamia/ocab170
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. Materials and Methods: A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. Results: Smoking status (n=27), substance use (n=21), homelessness (n=20), and alcohol use (n=15) are the most frequently studied SDoH categories. Homelessness (n=7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n=13), substance use (n=9), and alcohol use (n=9). Conclusion: NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems.
引用
收藏
页码:2716 / 2727
页数:12
相关论文
共 50 条
  • [21] Using a natural language processing toolkit to classify electronic health records by psychiatric diagnosis
    Hutto, Alissa
    Zikry, Tarek M.
    Bohac, Buck
    Rose, Terra
    Staebler, Jasmine
    Slay, Janet
    Cheever, C. Ray
    Kosorok, Michael R.
    Nash, Rebekah P.
    HEALTH INFORMATICS JOURNAL, 2024, 30 (04)
  • [22] Using Natural Language Processing and Machine Learning to Identify Incident Stroke From Electronic Health Records
    Zhao, Yiqing
    Fu, Sunyang
    Bielinski, Suzette J.
    Decker, Paul
    Chamberlain, Alanna M.
    Roger, Veronique L.
    Liu, Hongfang
    Larson, Nicolas B.
    CIRCULATION, 2020, 141
  • [23] Automated Extraction of Stroke Severity From Unstructured Electronic Health Records Using Natural Language Processing
    Fernandes, Marta
    Westover, M. Brandon
    Singhal, Aneesh B.
    Zafar, Sahar F.
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2024, 13 (21):
  • [24] AUTOMATED, ACCURATE IDENTIFICATION OF VENTRICULAR TACHYCARDIA FROM ELECTRONIC HEALTH RECORDS USING NATURAL LANGUAGE PROCESSING
    Brennan, Kelly
    Azizi, Zahra
    Feng, Ruibin
    Goyal, Jatin
    Liu, Xichong
    Ganesan, Prasanth
    Ruiperez-Campillo, Samuel
    Baykaner, Tina
    Badhwar, Nitish
    John, Roy M.
    Viswanathan, Mohan
    Perino, Alexander
    Wang, Paul J.
    Rogers, Albert J.
    Narayan, Sanjiv M.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 83 (13) : 2644 - 2644
  • [25] NATURAL LANGUAGE PROCESSING METHODS ENHANCE MACE IDENTIFICATION FROM ELECTRONIC HEALTH RECORDS
    St Laurent, S.
    Guo, M.
    Alfonso, R.
    Okoro, T.
    Johansen, K.
    Dember, L.
    Lindsay, A.
    VALUE IN HEALTH, 2018, 21 : S217 - S217
  • [26] Extracting Behavioral Determinants of Health from Electronic Health Records: Classifying Yoga Mentions in the Clinic
    Penrod, Nadia M.
    Lynch, Selah
    Moore, Jason H.
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 77 - 82
  • [27] Engaging stakeholders in integrating social determinants of health into electronic health records: a scoping review
    Wark, Kyle
    Cheung, Karen
    Wolter, Erika
    Avey, Jaedon P.
    INTERNATIONAL JOURNAL OF CIRCUMPOLAR HEALTH, 2021, 80 (01)
  • [28] Using natural language processing to analyze unstructured patient-reported outcomes data derived from electronic health records for cancer populations: a systematic review
    Sim, Jin-Ah
    Huang, Xiaolei
    Horan, Madeline R.
    Baker, Justin N.
    Huang, I-Chan
    EXPERT REVIEW OF PHARMACOECONOMICS & OUTCOMES RESEARCH, 2024, 24 (04) : 467 - 475
  • [29] Natural Language Processing and Social Determinants of Health in Mental Health Research: AI-Assisted Scoping Review
    Scherbakov, Dmitry A.
    Hubig, Nina C.
    Lenert, Leslie A.
    Alekseyenko, Alexander, V
    Obeid, Jihad S.
    JMIR MENTAL HEALTH, 2025, 12
  • [30] Adding Personal and Social Determinants of Health to Electronic Health Records
    Weissman, Myrna
    Talati, Ardesheer
    Pathak, Jyotishman
    BIOLOGICAL PSYCHIATRY, 2020, 87 (09) : S69 - S70