Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text

被引:0
|
作者
Fábio Lopes
César Teixeira
Hugo Gonçalo Oliveira
机构
[1] University of Coimbra,Center for Informatics and Systems, Department of Informatics Engineering
来源
关键词
Natural language processing; Machine learning; Named entity recognition; Portuguese clinical text;
D O I
暂无
中图分类号
学科分类号
摘要
Electronic Medical Records (EMRs) are written in an unstructured way, often using natural language. Information Extraction (IE) may be used for acquiring knowledge from such texts, including the automatic recognition of meaningful entities, through models for Named Entity Recognition (NER). However, while most work on the previous was made for English, this experience aimed at testing different methods in Portuguese text, more precisely, on the domain of Neurology, and take some conclusions. This paper comprised the comparison between Conditional Random Fields (CRF), bidirectional Long Short-term Memory - Conditional Random Fields (BiLSTM-CRF) and a BiLSTM-CRF with residual learning connections, using not only Portuguese texts from medical journals but also texts from the Coimbra Hospital and Universitary Centre (CHUC) Neurology Service. Furthermore, the performances of BiLSTM-CRF models using word embeddings (WEs) trained with clinical text and WEs trained with general language texts were compared. Deep learning models achieved F1-Scores of nearly 83% and 75%, respectively for relaxed and strict evaluation, on texts extracted from the medical journal. For texts collected from the Hospital, the same achieved F1-Scores of nearly 71% and 62%. This work concludes that deep learning models outperform the shallow learning models and that in-domain WEs get better results than general language WEs, even when the latter are trained with much more text than the former. Furthermore, the results show that it is possible to extract information from Hospital clinical texts with models trained with clinical cases extracted from medical journals, and thus openly available. Nevertheless, such results still require a healthcare technician to check if the information is well extracted.
引用
收藏
相关论文
共 50 条
  • [41] Comparing Annotated Datasets for Named Entity Recognition in English Literature
    Ivanova, Rositsa V.
    Kirrane, Sabrina
    van Erp, Marieke
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3788 - 3797
  • [42] Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese
    Freitas, Claudia
    Mota, Cristina
    Santos, Diana
    Oliveira, Hugo Goncalo
    Carvalho, Paula
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3630 - 3637
  • [43] Chinese Named Entity Recognition Methods Combined with Entity Boundary Cues
    Huang, Rong
    Chen, Yanping
    Hu, Ying
    Huang, Ruizhang
    Qin, Yongbin
    Computer Engineering and Applications, 2024, 60 (06) : 199 - 206
  • [44] Applying Deep Neural Networks to Named Entity Recognition in Portuguese Texts
    Fernandes, Ivo
    Cardoso, Henrique Lopes
    Oliveira, Eugenio
    2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 284 - 289
  • [45] Study of Named Entity Recognition methods in biomedical field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 260 - 265
  • [46] Towards the Named Entity Recognition Methods in Biomedical Field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 375 - 387
  • [47] Biomedical named entity recognition based on recurrent neural networks with different extended methods
    Song, Dingxin
    Li, Lishuang
    Jin, Liuke
    Huang, Degen
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 16 (01) : 17 - 31
  • [48] Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks
    Hu, Xuming
    Jiang, Yong
    Liu, Aiwei
    Huang, Zhongqiang
    Xie, Pengjun
    Huang, Fei
    Wen, Lijie
    Yu, Philip S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9072 - 9087
  • [49] Analysis of Different Supervised Techniques for Named Entity Recognition
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 : 184 - 195
  • [50] Three different models for named entity recognition in Bengali
    Ekbal, Asif
    PROGRESS IN PATTERN RECOGNITION, 2007, : 161 - 170