A comparative study for biomedical named entity recognition

被引:40
|
作者
Wang, Xu [1 ]
Yang, Chen [2 ]
Guan, Renchu [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Coll Earth Sci, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Biomedical named entity recognition; Machine learning; HMM; CRF; DICTIONARY; TEXT;
D O I
10.1007/s13042-015-0426-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With high-throughput technologies applied in biomedical research, the quantity of biomedical literatures grows exponentially. It becomes more and more important to quickly as well as accurately extract knowledge from manuscripts, especially in the era of big data. Named entity recognition (NER), aiming at identifying chunks of text that refers to specific entities, is essentially the initial step for information extraction. In this paper, we will review the three models of biomedical NER and two famous machine learning methods, Hidden Markov Model and Conditional Random Fields, which have been widely applied in bioinformatics. Based on these two methods, six excellent biomedical NER tools are compared in terms of programming language, feature sets, underlying mathematical methods, post-processing techniques and flowcharts. Experimental results of these tools against two widely used corpora, GENETAG and JNLPBA, are conducted. The comparison varies from different entity types to the overall performance. Furthermore, we put forward suggestions about the selection of Bio-NER tools for different applications.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [21] Towards the Named Entity Recognition Methods in Biomedical Field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 375 - 387
  • [22] Various criteria in the evaluation of biomedical named entity recognition
    Tsai, RTH
    Wu, SH
    Chou, WC
    Lin, YC
    He, D
    Hsiang, J
    Sung, TY
    Hsu, WL
    BMC BIOINFORMATICS, 2006, 7 (1) : 1 - 8
  • [23] Improving biomedical named entity recognition with syntactic information
    Yuanhe Tian
    Wang Shen
    Yan Song
    Fei Xia
    Min He
    Kenli Li
    BMC Bioinformatics, 21
  • [24] Various criteria in the evaluation of biomedical named entity recognition
    Richard Tzong-Han Tsai
    Shih-Hung Wu
    Wen-Chi Chou
    Yu-Chun Lin
    Ding He
    Jieh Hsiang
    Ting-Yi Sung
    Wen-Lian Hsu
    BMC Bioinformatics, 7
  • [25] Multiobjective Optimization for Biomedical Named Entity Recognition and Classification
    Ekbal, Asif
    Saha, Sriparna
    Sikdar, Utpal Kumar
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 206 - 213
  • [26] Classifier subset selection for biomedical named entity recognition
    Dimililer, Nazife
    Varoglu, Ekrem
    Altincay, Hakan
    APPLIED INTELLIGENCE, 2009, 31 (03) : 267 - 282
  • [27] MMBERT: a unified framework for biomedical named entity recognition
    Fu, Lei
    Weng, Zuquan
    Zhang, Jiheng
    Xie, Haihe
    Cao, Yiqing
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (01) : 327 - 341
  • [28] Accurate Clinical and Biomedical Named Entity Recognition at Scale
    Kocaman, Veysel
    Talby, David
    SOFTWARE IMPACTS, 2022, 13
  • [29] Comparison of named entity recognition methodologies in biomedical documents
    Hye-Jeong Song
    Byeong-Cheol Jo
    Chan-Young Park
    Jong-Dae Kim
    Yu-Seop Kim
    BioMedical Engineering OnLine, 17
  • [30] Classifier subset selection for biomedical named entity recognition
    Nazife Dimililer
    Ekrem Varoğlu
    Hakan Altınçay
    Applied Intelligence, 2009, 31 : 267 - 282