Multi-step Transfer Learning in Natural Language Processing for the Health Domain

被引:0
|
作者
Manaka, Thokozile [1 ]
Van Zyl, Terence [2 ]
Kar, Deepak [3 ]
Wade, Alisha [4 ]
机构
[1] Univ Witwatersrand, Sch Comp Sci & Appl Math, Johannesburg, Gauteng, South Africa
[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, Gauteng, South Africa
[3] Univ Witwatersrand, Sch Phys, Johannesburg, Gauteng, South Africa
[4] Univ Witwatersrand, Sch Publ Hlth, MRC Wits Rural Publ Hlth & Hlth Transit Res Unit, Johannesburg, South Africa
关键词
Transfer learning; Verbal autopsy; Natural language processing; Text classification; Feature extraction; Fine tuning; MODEL;
D O I
10.1007/s11063-024-11526-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The restricted access to data in healthcare facilities due to patient privacy and confidentiality policies has led to the application of general natural language processing (NLP) techniques advancing relatively slowly in the health domain. Additionally, because clinical data is unique to various institutions and laboratories, there are not enough standards and conventions for data annotation. In places without robust death registration systems, the cause of death (COD) is determined through a verbal autopsy (VA) report. A non-clinician field agent completes a VA report using a set of standardized questions as guide to identify the symptoms of a COD. The narrative text of the VA report is used as a case study to examine the difficulties of applying NLP techniques to the healthcare domain. This paper presents a framework that leverages knowledge across multiple domains via two domain adaptation techniques: feature extraction and fine-tuning. These techniques aim to improve VA text representations for COD classification tasks in the health domain. The framework is motivated by multi-step learning, where a final learning task is realized via a sequence of intermediate learning tasks. The framework builds upon the strengths of the Bidirectional Encoder Representations from Transformers (BERT) and Embeddings from Language Models (ELMo) models pretrained on the general English and biomedical domains. These models are employed to extract features from the VA narratives. Our results demonstrate improved performance when initializing the learning of BERT embeddings with ELMo embeddings. The benefit of incorporating character-level information for learning word embeddings in the English domain, coupled with word-level information for learning word embeddings in the biomedical domain, is also evident.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Learning Multi-Step Reasoning by Solving Arithmetic Tasks
    Wang, Tianduo
    Lu, Wei
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1229 - 1238
  • [32] Evolutionary Optimization of Multi-step Dynamic Systems Learning
    Perez, Edgar Ademir Morales
    Iba, Hitoshi
    2022 8TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND ROBOTICS ENGINEERING (ICMRE 2022), 2022, : 152 - 156
  • [33] Multi-Step Learning to Search for Dynamic Environment Navigation
    Yu, Chung-Che
    Wang, Chieh-Chih
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (03) : 637 - 652
  • [34] Multi-step learning and underlying structure in statistical models
    Fraser, Maia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [35] Multi-step commodity forecasts using deep learning
    Bora, Siddhartha S.
    Katchova, Ani L.
    AGRICULTURAL FINANCE REVIEW, 2024, 84 (4/5) : 269 - 296
  • [36] TaskFusion: An Efficient Transfer Learning Architecture with Dual Delta Sparsity for Multi-Task Natural Language Processing
    Fan, Zichen
    Zhang, Qirui
    Abillama, Pierre
    Shoouri, Sara
    Lee, Changwoo
    Blaauw, David
    Kim, Hun-Seok
    Sylvester, Dennis
    PROCEEDINGS OF THE 2023 THE 50TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, ISCA 2023, 2023, : 62 - 75
  • [37] "Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
    Hundt, Andrew
    Killeen, Benjamin
    Greene, Nicholas
    Wu, Hongtao
    Kwon, Heeyeon
    Paxton, Chris
    Hager, Gregory D.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6724 - 6731
  • [38] Multi-step wrought processing of TiAl-based alloys
    Fuchs, GE
    MATERIALS SCIENCE AND ENGINEERING A-STRUCTURAL MATERIALS PROPERTIES MICROSTRUCTURE AND PROCESSING, 1997, 240 : 584 - 591
  • [39] Multi-step wrought processing of TiAl-based alloys
    Fuchs, G.E.
    Materials Science and Engineering A, 1997, 239-240 : 584 - 591
  • [40] Machine Learning and Natural Language Processing in Mental Health: Systematic Review
    Le Glaz, Aziliz
    Haralambous, Yannis
    Kim-Dufor, Deok-Hee
    Lenca, Philippe
    Billot, Romain
    Ryan, Taylor C.
    Marsh, Jonathan
    DeVylder, Jordan
    Walter, Michel
    Berrouiguet, Sofian
    Lemey, Christophe
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (05)