Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text

被引:0
|
作者
Fábio Lopes
César Teixeira
Hugo Gonçalo Oliveira
机构
[1] University of Coimbra,Center for Informatics and Systems, Department of Informatics Engineering
来源
关键词
Natural language processing; Machine learning; Named entity recognition; Portuguese clinical text;
D O I
暂无
中图分类号
学科分类号
摘要
Electronic Medical Records (EMRs) are written in an unstructured way, often using natural language. Information Extraction (IE) may be used for acquiring knowledge from such texts, including the automatic recognition of meaningful entities, through models for Named Entity Recognition (NER). However, while most work on the previous was made for English, this experience aimed at testing different methods in Portuguese text, more precisely, on the domain of Neurology, and take some conclusions. This paper comprised the comparison between Conditional Random Fields (CRF), bidirectional Long Short-term Memory - Conditional Random Fields (BiLSTM-CRF) and a BiLSTM-CRF with residual learning connections, using not only Portuguese texts from medical journals but also texts from the Coimbra Hospital and Universitary Centre (CHUC) Neurology Service. Furthermore, the performances of BiLSTM-CRF models using word embeddings (WEs) trained with clinical text and WEs trained with general language texts were compared. Deep learning models achieved F1-Scores of nearly 83% and 75%, respectively for relaxed and strict evaluation, on texts extracted from the medical journal. For texts collected from the Hospital, the same achieved F1-Scores of nearly 71% and 62%. This work concludes that deep learning models outperform the shallow learning models and that in-domain WEs get better results than general language WEs, even when the latter are trained with much more text than the former. Furthermore, the results show that it is possible to extract information from Hospital clinical texts with models trained with clinical cases extracted from medical journals, and thus openly available. Nevertheless, such results still require a healthcare technician to check if the information is well extracted.
引用
收藏
相关论文
共 50 条
  • [1] Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text
    Lopes, Fabio
    Teixeira, Cesar
    Oliveira, Hugo Goncalo
    JOURNAL OF MEDICAL SYSTEMS, 2020, 44 (04)
  • [2] Named Entity Recognition in Portuguese Neurology Text Using CRF
    Lopes, Fabio
    Teixeira, Cesar
    Oliveira, Hugo Goncalo
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2019, PT I, 2019, 11804 : 336 - 348
  • [3] A golden resource for named entity recognition in Portuguese
    Santos, Diana
    Cardoso, Nuno
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2006, 3960 : 69 - 79
  • [4] Contributions to Clinical Named Entity Recognition in Portuguese
    Lopes, Fabio
    Teixeira, Cesar
    Oliveira, Hugo Goncalo
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 223 - 233
  • [5] A study of active learning methods for named entity recognition in clinical text
    Chen, Yukun
    Lasko, Thomas A.
    Mei, Qiaozhu
    Denny, Joshua C.
    Xu, Hua
    JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 : 11 - 18
  • [6] Named Entity Recognition: a Survey for the Portuguese Language
    Albuquerque, Hidelberg O.
    Souza, Ellen
    Gomes, Carlos
    Pinto, Matheus Henrique de C.
    Filho, Ricardo P. S.
    Costa, Rosimeire
    Lopes, Vinicius Teixeira de M.
    da Silva, Nadia F. F.
    de Carvalho, Andre C. P. L. F.
    Oliveira, Adriano L. I.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2023, (70): : 171 - 185
  • [7] Product named entity recognition in Chinese text
    Jun Zhao
    Feifan Liu
    Language Resources and Evaluation, 2008, 42 : 197 - 217
  • [8] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94
  • [9] Named Entity Recognition for Short Text Messages
    Ek, Tobias
    Kirkegaard, Camilla
    Jonsson, Hakan
    Nugues, Pierre
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 178 - 187
  • [10] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217