Clustering Based Active Learning for Biomedical Named Entity Recognition

被引:0
|
作者
Han, Xu [1 ]
Kwoh, Chee Keong [1 ]
Kim, Jung-jae [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, 50 Nanyang Ave, Singapore 639798, Singapore
[2] Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition and extraction of biomedical names is an essential task for the biomedical information extraction. However, the preparation of large annotated corpora hinders the training of the Named Entity Recognition (NER) systems. Active learning is reducing the needed manual annotation work in supervised learning task. In this work, we propose a novel clustering based active learning method for the biomedical NER task. We show that the underlying NER system using the proposed method outperforms those with other state of the art active learning methods, including density, Gibbs error and entropy based approaches, as well as the random selection. We compare variations of our proposed method and find the optimal design of the active learning method, which is to use the vector representation of named entities, and to select documents that are representative' and informative', as well as to use the Shared Nearest Neighbor (SNN) clustering approach. In particular, the optimal variant of the proposed method achieves a deficiency gain of 36.3% over the random selection.
引用
收藏
页码:1253 / 1260
页数:8
相关论文
共 50 条
  • [21] Domain Adaptation with Active Learning for Named Entity Recognition
    Sun, Huiyu
    Grishman, Ralph
    Wang, Yingchao
    CLOUD COMPUTING AND SECURITY, ICCCS 2016, PT II, 2016, 10040 : 611 - 622
  • [22] Adversarial Active Learning for Named Entity Recognition in Cybersecurity
    Li, Tao
    Hu, Yongjin
    Ju, Ankang
    Hu, Zhuoran
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (01): : 407 - 420
  • [23] Active Machine Learning Technique For Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Singh, Dhirendra
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 180 - 186
  • [24] A Comparative Study of Biomedical Named Entity Recognition Methods Based Machine Learning Approach
    Rais, Mohammed
    Lachkar, Abdelmonaime
    Lachkar, Abdelhamid
    El Alaoui Ouatik, Said
    2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 329 - 334
  • [25] Noise Reduction Learning Based on XLNet-CRF for Biomedical Named Entity Recognition
    Chai, Zhaoying
    Jin, Han
    Shi, Shenghui
    Zhan, Siyan
    Zhuo, Lin
    Yang, Yu
    Lian, Qi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 595 - 605
  • [26] EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition
    Liu, Yang
    Hu, Jinpeng
    Chen, Zhihong
    Wan, Xiang
    Chang, Tsung-Hui
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8897 - 8905
  • [27] Named entity recognition based on deep learning
    Ji Z.
    Kong D.
    Liu W.
    Dong W.
    Sang Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (06): : 1603 - 1615
  • [28] Leveraging Multi-task Learning for Biomedical Named Entity Recognition
    Mehmood, Tahir
    Gerevini, Alfonso
    Lavelli, Alberto
    Serina, Ivan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI*IA 2019, 2019, 11946 : 431 - 444
  • [29] A Method of Network Attack Named Entity Recognition based on Deep Active Learning
    Wang, Li
    Ma, Yunxiao
    Li, Mingyue
    Li, Hua
    Zhang, Peilong
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 376 - 387
  • [30] A Low-Cost Named Entity Recognition Research Based on Active Learning
    Huang, Han
    Wang, Hongyu
    Jin, Dawei
    SCIENTIFIC PROGRAMMING, 2018, 2018