A comparative study for biomedical named entity recognition

被引:40
|
作者
Wang, Xu [1 ]
Yang, Chen [2 ]
Guan, Renchu [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Coll Earth Sci, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Biomedical named entity recognition; Machine learning; HMM; CRF; DICTIONARY; TEXT;
D O I
10.1007/s13042-015-0426-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With high-throughput technologies applied in biomedical research, the quantity of biomedical literatures grows exponentially. It becomes more and more important to quickly as well as accurately extract knowledge from manuscripts, especially in the era of big data. Named entity recognition (NER), aiming at identifying chunks of text that refers to specific entities, is essentially the initial step for information extraction. In this paper, we will review the three models of biomedical NER and two famous machine learning methods, Hidden Markov Model and Conditional Random Fields, which have been widely applied in bioinformatics. Based on these two methods, six excellent biomedical NER tools are compared in terms of programming language, feature sets, underlying mathematical methods, post-processing techniques and flowcharts. Experimental results of these tools against two widely used corpora, GENETAG and JNLPBA, are conducted. The comparison varies from different entity types to the overall performance. Furthermore, we put forward suggestions about the selection of Bio-NER tools for different applications.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [41] Named Entity Recognition and Relation Detection for Biomedical Information Extraction
    Perera, Nadeesha
    Dehmer, Matthias
    Emmert-Streib, Frank
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2020, 8
  • [42] HDCNN-CRF for Biomedical Text Named Entity Recognition
    Gao, Mingyuan
    Wei, Hao
    Chen, Fei
    Qu, Wen
    Lu, Mingyu
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 191 - 194
  • [43] Biomedical Named Entity Recognition with Tri-training learning
    Cai, YueHong
    Cheng, XianYi
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 2178 - +
  • [44] Improving biomedical Named Entity Recognition with additional external contexts
    Tho, Bui Duc
    Nguyen, Minh -Tien
    Le, Dung Tien
    Ying, Lin -Lung
    Inoue, Shumpei
    Nguyen, Tri-Thanh
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 156
  • [45] A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
    Chen, Yanping
    Hu, Ying
    Li, Yijing
    Huang, Ruizhang
    Qin, Yongbin
    Wu, Yuefei
    Zheng, Qinghua
    Chen, Ping
    IEEE ACCESS, 2020, 8 : 214141 - 214152
  • [46] Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents
    Francis, Sumam
    Van Landeghem, Jordy
    Moens, Marie-Francine
    INFORMATION, 2019, 10 (08)
  • [47] Faster biomedical named entity recognition based on knowledge distillation
    Hu B.
    Geng T.
    Deng G.
    Duan L.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2021, 61 (09): : 936 - 942
  • [48] Transferring From Textual Entailment to Biomedical Named Entity Recognition
    Liang, Tingting
    Xia, Congying
    Zhao, Ziqiang
    Jiang, Yixuan
    Yin, Yuyu
    Yu, Philip S.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (04) : 2577 - 2586
  • [49] BANNER: An executable survey of advances in biomedical named entity recognition
    Department of Computer Science and Engineering, Arizona State University, United States
    不详
    Pac. Symp. Biocomputing, PSB, (652-663):
  • [50] Hierarchical shared transfer learning for biomedical named entity recognition
    Zhaoying Chai
    Han Jin
    Shenghui Shi
    Siyan Zhan
    Lin Zhuo
    Yu Yang
    BMC Bioinformatics, 23