Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text

被引:0
|
作者
Fábio Lopes
César Teixeira
Hugo Gonçalo Oliveira
机构
[1] University of Coimbra,Center for Informatics and Systems, Department of Informatics Engineering
来源
关键词
Natural language processing; Machine learning; Named entity recognition; Portuguese clinical text;
D O I
暂无
中图分类号
学科分类号
摘要
Electronic Medical Records (EMRs) are written in an unstructured way, often using natural language. Information Extraction (IE) may be used for acquiring knowledge from such texts, including the automatic recognition of meaningful entities, through models for Named Entity Recognition (NER). However, while most work on the previous was made for English, this experience aimed at testing different methods in Portuguese text, more precisely, on the domain of Neurology, and take some conclusions. This paper comprised the comparison between Conditional Random Fields (CRF), bidirectional Long Short-term Memory - Conditional Random Fields (BiLSTM-CRF) and a BiLSTM-CRF with residual learning connections, using not only Portuguese texts from medical journals but also texts from the Coimbra Hospital and Universitary Centre (CHUC) Neurology Service. Furthermore, the performances of BiLSTM-CRF models using word embeddings (WEs) trained with clinical text and WEs trained with general language texts were compared. Deep learning models achieved F1-Scores of nearly 83% and 75%, respectively for relaxed and strict evaluation, on texts extracted from the medical journal. For texts collected from the Hospital, the same achieved F1-Scores of nearly 71% and 62%. This work concludes that deep learning models outperform the shallow learning models and that in-domain WEs get better results than general language WEs, even when the latter are trained with much more text than the former. Furthermore, the results show that it is possible to extract information from Hospital clinical texts with models trained with clinical cases extracted from medical journals, and thus openly available. Nevertheless, such results still require a healthcare technician to check if the information is well extracted.
引用
收藏
相关论文
共 50 条
  • [21] Named Entity Recognition for Russian Judicial Rulings Text
    Averina, Maria
    Levanova, Olga
    Kasatkina, Natalia
    2022 32ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2022, : 49 - 55
  • [22] Named Entity Recognition in Twitter Using Images and Text
    Esteves, Diego
    Peres, Rafael
    Lehmann, Jens
    Napolitano, Giulio
    CURRENT TRENDS IN WEB ENGINEERING, ICWE 2017, 2018, 10544 : 191 - 199
  • [23] Named Entity Recognition Method for Process Planning Text
    Dong H.
    Li Y.
    Qiao L.
    Huang Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (02): : 313 - 320
  • [24] Comparing Open Arabic Named Entity Recognition Tools
    Aldumaykhi, Abdullah
    Otai, Saad
    Alsudais, Abdulkareem
    2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI, 2023, : 46 - 51
  • [25] Portuguese Named Entity Recognition Using LSTM-CRF
    Quinta de Castro, Pedro Vitor
    Felipe da Silva, Nadia Felix
    Soares, Anderson da Silva
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 83 - 92
  • [26] Efficient methods for biomedical named entity recognition
    Chan, Shing-Kit
    Lam, Wai
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 729 - 735
  • [27] Arabic Named Entity Recognition from diverse text types
    Shaalan, Khaled
    Raza, Hafsa
    ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 440 - 451
  • [28] Bootstrapped Text-level Named Entity Recognition for Literature
    Brooke, Julian
    Baldwin, Timothy
    Hammond, Adam
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 344 - 350
  • [29] Named Entity Recognition for Mine Electromechanical Equipment Monitoring Text
    Yunfei, Qiu
    Haoran, Xing
    Zhilong, Yu
    Wenwen, Zhang
    Computer Engineering and Applications, 60 (11): : 129 - 138
  • [30] Adversarial training for named entity recognition of rail fault text
    Qu, J.
    Su, S.
    Li, R.
    Wang, G.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1353 - 1358