A comparative study for biomedical named entity recognition

被引：40

作者：

Wang, Xu ^{[1
]}

Yang, Chen ^{[2
]}

Guan, Renchu ^{[1
]}

机构：

[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China

[2] Jilin Univ, Coll Earth Sci, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2018年 / 9卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Biomedical named entity recognition; Machine learning; HMM; CRF; DICTIONARY; TEXT;

D O I：

10.1007/s13042-015-0426-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With high-throughput technologies applied in biomedical research, the quantity of biomedical literatures grows exponentially. It becomes more and more important to quickly as well as accurately extract knowledge from manuscripts, especially in the era of big data. Named entity recognition (NER), aiming at identifying chunks of text that refers to specific entities, is essentially the initial step for information extraction. In this paper, we will review the three models of biomedical NER and two famous machine learning methods, Hidden Markov Model and Conditional Random Fields, which have been widely applied in bioinformatics. Based on these two methods, six excellent biomedical NER tools are compared in terms of programming language, feature sets, underlying mathematical methods, post-processing techniques and flowcharts. Experimental results of these tools against two widely used corpora, GENETAG and JNLPBA, are conducted. The comparison varies from different entity types to the overall performance. Furthermore, we put forward suggestions about the selection of Bio-NER tools for different applications.

引用

页码：373 / 382

页数：10

共 50 条

[21] Towards the Named Entity Recognition Methods in Biomedical Field
Sniegula, Anna
Poniszewska-Maranda, Aneta
Chomatek, Lukasz
SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 375 - 387
[22] Various criteria in the evaluation of biomedical named entity recognition
Tsai, RTH
Wu, SH
Chou, WC
Lin, YC
He, D
Hsiang, J
Sung, TY
Hsu, WL
BMC BIOINFORMATICS, 2006, 7 (1) : 1 - 8
[23] Improving biomedical named entity recognition with syntactic information
Yuanhe Tian
Wang Shen
Yan Song
Fei Xia
Min He
Kenli Li
BMC Bioinformatics, 21
[24] Various criteria in the evaluation of biomedical named entity recognition
Richard Tzong-Han Tsai
Shih-Hung Wu
Wen-Chi Chou
Yu-Chun Lin
Ding He
Jieh Hsiang
Ting-Yi Sung
Wen-Lian Hsu
BMC Bioinformatics, 7
[25] Multiobjective Optimization for Biomedical Named Entity Recognition and Classification
Ekbal, Asif
Saha, Sriparna
Sikdar, Utpal Kumar
2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 206 - 213
[26] Classifier subset selection for biomedical named entity recognition
Dimililer, Nazife
Varoglu, Ekrem
Altincay, Hakan
APPLIED INTELLIGENCE, 2009, 31 (03) : 267 - 282
[27] MMBERT: a unified framework for biomedical named entity recognition
Fu, Lei
Weng, Zuquan
Zhang, Jiheng
Xie, Haihe
Cao, Yiqing
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (01) : 327 - 341
[28] Accurate Clinical and Biomedical Named Entity Recognition at Scale
Kocaman, Veysel
Talby, David
SOFTWARE IMPACTS, 2022, 13
[29] Comparison of named entity recognition methodologies in biomedical documents
Hye-Jeong Song
Byeong-Cheol Jo
Chan-Young Park
Jong-Dae Kim
Yu-Seop Kim
BioMedical Engineering OnLine, 17
[30] Classifier subset selection for biomedical named entity recognition
Nazife Dimililer
Ekrem Varoğlu
Hakan Altınçay
Applied Intelligence, 2009, 31 : 267 - 282

← 1 2 3 4 5 →