Clustering Based Active Learning for Biomedical Named Entity Recognition

被引:0
|
作者
Han, Xu [1 ]
Kwoh, Chee Keong [1 ]
Kim, Jung-jae [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, 50 Nanyang Ave, Singapore 639798, Singapore
[2] Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition and extraction of biomedical names is an essential task for the biomedical information extraction. However, the preparation of large annotated corpora hinders the training of the Named Entity Recognition (NER) systems. Active learning is reducing the needed manual annotation work in supervised learning task. In this work, we propose a novel clustering based active learning method for the biomedical NER task. We show that the underlying NER system using the proposed method outperforms those with other state of the art active learning methods, including density, Gibbs error and entropy based approaches, as well as the random selection. We compare variations of our proposed method and find the optimal design of the active learning method, which is to use the vector representation of named entities, and to select documents that are representative' and informative', as well as to use the Shared Nearest Neighbor (SNN) clustering approach. In particular, the optimal variant of the proposed method achieves a deficiency gain of 36.3% over the random selection.
引用
收藏
页码:1253 / 1260
页数:8
相关论文
共 50 条
  • [41] Named entity recognition using point prediction and active learning
    Kobayashi, Koga
    Wakabayashi, Kei
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 287 - 293
  • [42] Named Entity Recognition From Biomedical Data
    Refaat, Maged
    Rafea, Ahmed
    Gaballah, Nada
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 838 - 844
  • [43] A comparative study for biomedical named entity recognition
    Xu Wang
    Chen Yang
    Renchu Guan
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 373 - 382
  • [44] Efficient methods for biomedical named entity recognition
    Chan, Shing-Kit
    Lam, Wai
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 729 - 735
  • [45] Feature Importance for Biomedical Named Entity Recognition
    Huggard, Hamish
    Zhang, Aaron
    Zhang, Edmond
    Koh, Yun Sing
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 406 - 417
  • [46] A comparative study for biomedical named entity recognition
    Wang, Xu
    Yang, Chen
    Guan, Renchu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 373 - 382
  • [47] A Systematic Review on Biomedical Named Entity Recognition
    Kanimozhi, U.
    Manjula, D.
    DATA SCIENCE ANALYTICS AND APPLICATIONS, DASAA 2017, 2018, 804 : 19 - 37
  • [48] Biomedical Named Entity Recognition with Less Supervision
    Ghiasvand, Omid
    Kate, Rohit J.
    2015 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2015), 2015, : 495 - 495
  • [49] Named Entity Recognition System for the Biomedical Domain
    Sharma, Raghav
    Chauhan, Deependra
    Sharma, Raksha
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 837 - 840
  • [50] Integrated Deep Learning with Attention Layer Based Approach for Precise Biomedical Named Entity Recognition
    Pooja, H.
    Jagadeesh, Prabhudev M. P.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (06) : 704 - 713