Extracting social determinants of health from electronic health records using natural language processing: a systematic review

被引:101
|
作者
Patra, Braja G. [1 ]
Sharma, Mohit M. [1 ]
Vekaria, Veer [1 ]
Adekkanattu, Prakash [2 ]
Patterson, Olga, V [3 ,4 ]
Glicksberg, Benjamin [5 ]
Lepow, Lauren A. [5 ]
Ryu, Euijung [6 ]
Biernacka, Joanna M. [6 ]
Furmanchuk, Al'ona [7 ]
George, Thomas J. [8 ]
Hogan, William [9 ]
Wu, Yonghui [8 ]
Yang, Xi [8 ]
Bian, Jiang [8 ]
Weissman, Myrna [10 ]
Wickramaratne, Priya [10 ]
Mann, J. John [10 ]
Olfson, Mark [10 ]
Campion, Thomas R., Jr. [1 ,2 ]
Weiner, Mark [1 ]
Pathak, Jyotishman [1 ]
机构
[1] Weill Cornell Med, Dept Populat Hlth Sci, 425 E 61st St,Suite 301, New York, NY 10065 USA
[2] Weill Cornell Med, Informat Technol & Serv, New York, NY 10065 USA
[3] Univ Utah, Dept Internal Med, Div Epidemiol, Salt Lake City, UT 84112 USA
[4] US Dept Vet Affairs, Salt Lake City, UT USA
[5] Icahn Sch Med Mt Sinai, New York, NY 10029 USA
[6] Mayo Clin, Dept Quantitat Hlth Sci, Rochester, MN USA
[7] Northwestern Univ, Chicago, IL 60611 USA
[8] Univ Florida, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL USA
[9] Univ Florida, Coll Med, Dept Med, Div Hematol & Oncol, Gainesville, FL USA
[10] Columbia Univ, Vagelos Coll Phys & Surg, New York, NY USA
关键词
social determinants of health; population health outcomes; electronic health records; natural language processing; information extraction; machine learning; PROBLEM OPIOID USE; BINGE-EATING DISORDER; AUTOMATED IDENTIFICATION; UNSTRUCTURED DATA; CARE; VALIDATION; ABUSE; RISK;
D O I
10.1093/jamia/ocab170
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. Materials and Methods: A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. Results: Smoking status (n=27), substance use (n=21), homelessness (n=20), and alcohol use (n=15) are the most frequently studied SDoH categories. Homelessness (n=7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n=13), substance use (n=9), and alcohol use (n=9). Conclusion: NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems.
引用
收藏
页码:2716 / 2727
页数:12
相关论文
共 50 条
  • [41] Social Determinants of Health in EMS Records: A Mixed-methods Analysis Using Natural Language Processing and Qualitative Content Analysis
    Burnett, Susan J.
    Stemerman, Rachel
    Innes, Johanna C.
    Kaisler, Maria C.
    Crowe, Remle P.
    Clemency, Brian M.
    WESTERN JOURNAL OF EMERGENCY MEDICINE, 2023, 24 (05) : 878 - 887
  • [42] ARTERIAL: A Natural Language Processing Model for Prevention of Information Leakage from Electronic Health Records
    Goldschmidt, Guilherme
    Zeiser, Felipe Andre
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    2023 XIII BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING, SBESC, 2023,
  • [43] Applying Natural Language Processing Toolkits to Electronic Health Records - An Experience Report
    Barrett, Neil
    Weber-Jahnke, Jens H.
    ADVANCES IN INFORMATION TECHNOLOGY AND COMMUNICATION IN HEALTH, 2009, 143 : 441 - 446
  • [44] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Deng, Yu
    Pacheco, Jennifer A.
    Ghosh, Anika
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua C.
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Dorn, Chad
    Weng, Chunhua
    Liu, Cong
    Cordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 22 (SUPPL 2)
  • [46] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Yu Deng
    Jennifer A. Pacheco
    Anika Ghosh
    Anh Chung
    Chengsheng Mao
    Joshua C. Smith
    Juan Zhao
    Wei-Qi Wei
    April Barnado
    Chad Dorn
    Chunhua Weng
    Cong Liu
    Adam Cordon
    Jingzhi Yu
    Yacob Tedla
    Abel Kho
    Rosalind Ramsey-Goldman
    Theresa Walunas
    Yuan Luo
    BMC Medical Informatics and Decision Making, 22
  • [47] Natural Language Processing to Identify Lupus Nephritis Phenotype in Electronic Health Records
    Deng, Yu
    Pacheco, Jennifer
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Weng, Chunhua
    Liu, Cong
    Gordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    ARTHRITIS & RHEUMATOLOGY, 2021, 73 : 666 - 667
  • [48] Natural Language Processing Identifies Goals of Care Documentation in Electronic Health Records
    Joehl, Hillarie E.
    Friend, Patricia
    JOURNAL OF PAIN AND SYMPTOM MANAGEMENT, 2024, 67 (05) : E720 - E721
  • [49] Extracting cancer concepts from clinical notes using natural language processing: a systematic review
    Gholipour, Maryam
    Khajouei, Reza
    Amiri, Parastoo
    Gohari, Sadrieh Hajesmaeel
    Ahmadian, Leila
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [50] Extracting cancer concepts from clinical notes using natural language processing: a systematic review
    Maryam Gholipour
    Reza Khajouei
    Parastoo Amiri
    Sadrieh Hajesmaeel Gohari
    Leila Ahmadian
    BMC Bioinformatics, 24