Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction

被引:12
|
作者
Fabregat, Hermenegildo [1 ,3 ]
Duque, Andres [1 ,2 ]
Martinez-Romo, Juan [1 ,2 ]
Araujo, Lourdes [1 ,2 ]
机构
[1] Univ Nacl Educ Distancia UNED ETS Ingn Informat, Juan del Rosal 16, Madrid 28040, Spain
[2] Escuela Nacl San IMIENS, Inst Mixto Invest, Madrid, Spain
[3] Avature Machine Learning, Madrid, Spain
关键词
Transfer learning; Named Entity Recognition; Negation detection; Relation Extraction; CORPUS;
D O I
10.1016/j.jbi.2022.104279
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objectives: Named Entity Recognition (NER) and Relation Extraction (RE) are two of the most studied tasks in biomedical Natural Language Processing (NLP). The detection of specific terms and entities and the relationships between them are key aspects for the development of more complex automatic systems in the biomedical field. In this work, we explore transfer learning techniques for incorporating information about negation into systems performing NER and RE. The main purpose of this research is to analyse to what extent the successful detection of negated entities in separate tasks helps in the detection of biomedical entities and their relationships.Methods: Three neural architectures are proposed in this work, all of them mainly based on Bidirectional Long Short-Term Memory (Bi-LSTM) networks and Conditional Random Fields (CRFs). While the first architecture is devoted to detecting triggers and scopes of negated entities in any domain, two specific models are developed for performing isolated NER tasks and joint NER and RE tasks in the biomedical domain. Then, weights related to negation detection learned by the first architecture are incorporated into those last models. Two different languages, Spanish and English, are taken into account in the experiments.Results: Performance of the biomedical models is analysed both when the weights of the neural networks are randomly initialized, and when weights from the negation detection model are incorporated into them. Improvements of around 3.5% of F-Measure in the English language and more than 7% in the Spanish language are achieved in the NER task, while the NER+RE task increases F-Measure scores by more than 13% for the NER submodel and around 2% for the RE submodel.Conclusions: The obtained results allow us to conclude that negation-based transfer learning techniques are appropriate for performing biomedical NER and RE tasks. These results highlight the importance of detecting negation for improving the identification of biomedical entities and their relationships. The explored techniques show robustness by maintaining consistent results and improvements across different tasks and languages.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A named entity relation extraction method based on bootstrapping
    He Tingting
    Xu Chao
    Li Jing
    Zhao Junzhe
    2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 758 - 763
  • [42] Faster biomedical named entity recognition based on knowledge distillation
    Hu B.
    Geng T.
    Deng G.
    Duan L.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2021, 61 (09): : 936 - 942
  • [43] Deep learning with word embeddings improves biomedical named entity recognition
    Habibi, Maryam
    Weber, Leon
    Neves, Mariana
    Wiegandt, David Luis
    Leser, Ulf
    BIOINFORMATICS, 2017, 33 (14) : I37 - I48
  • [44] Named Entity Relation Extraction Based on Multiple Features
    Li, Yeqing
    2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 213 - 216
  • [45] A Kernel-Based Approach for Biomedical Named Entity Recognition
    Patra, Rakesh
    Saha, Sujan Kumar
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [46] Ensemble based Active Annotation for Biomedical Named Entity Recognition
    Verma, Mridula
    Sikdar, Utpal
    Saha, Sriparna
    Ekbal, Asif
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 973 - 978
  • [47] Towards Bootstrapping Biomedical Named Entity Recognition using Reinforcement Learning
    Wang, Dongsheng
    Fan, Hongjie
    Liu, Junfei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 778 - 784
  • [48] Two-stage learning algorithm for biomedical named entity recognition
    Che X.-J.
    Xu H.
    Pan M.-Y.
    Liu Q.-L.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (08): : 2380 - 2387
  • [49] Improving named entity recognition accuracy for gene and protein in biomedical text literature
    Tohidi, Hossein
    Ibrahim, Hamidah
    Murad, Masrah Azrifah Azmi
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 10 (03) : 239 - 268
  • [50] Research-based-named Entity Recognition Learning Text Biomedical Extraction by Adoption of Training Bidirectional Language Model (BiLM)
    Abed, Alshreef
    Jingling, Yuan
    Li, Lin
    Journal of Computers (Taiwan), 2020, 31 (04) : 157 - 173