Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature

被引:47
|
作者
Yang, Zhihao [1 ]
Lin, Hongfei [1 ]
Li, Yanpeng [1 ]
机构
[1] Dalian Univ Technol, Dept Comp Sci & Engn, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
text mining; entity recognition; edit distance; conditional random fields;
D O I
10.1016/j.compbiolchem.2008.03.008
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Bio-entity name recognition is the key step for information extraction from biomedical literature. This paper presents a dictionary-based bio-entity name recognition approach. The approach expands the bio-entity name dictionary via the Abbreviation Definitions identifying algorithm, improves the recall rate through the improved edit distance algorithm and adopts some post-processing methods including Pre-keyword and Post-keyword expansion, Part of Speech expansion, merge of adjacent bio-entity names and the exploitation of the contextual cues to further improve the performance. Experiment results show that with this approach even an internal dictionary-based system could achieve a fairly good performance. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:287 / 291
页数:5
相关论文
共 50 条
  • [1] Exploiting the contextual cues for bio-entity name recognition in biomedical literature
    Yang, Zhihao
    Lin, Hongfei
    Li, Yanpeng
    JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (04) : 580 - 587
  • [2] Developing a hybrid dictionary-based bio-entity recognition technique
    Min Song
    Hwanjo Yu
    Wook-Shin Han
    BMC Medical Informatics and Decision Making, 15
  • [3] Developing a hybrid dictionary-based bio-entity recognition technique
    Song, Min
    Yu, Hwanjo
    Han, Wook-Shin
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15
  • [4] Improving the performance of bio-entity name recognition in biomedical literature via the contextual cues
    Yang, Zhihao
    Lin, Hongfei
    Li, Yanpeng
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2007, 4 (7-8) : 1426 - 1431
  • [5] Dictionary-based matching graph network for biomedical named entity recognition
    Lou, Yinxia
    Zhu, Xun
    Tan, Kai
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [6] Dictionary-based matching graph network for biomedical named entity recognition
    Yinxia Lou
    Xun Zhu
    Kai Tan
    Scientific Reports, 13 (1)
  • [7] Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization
    Fu, Zihao
    Su, Yixuan
    Meng, Zaiqiao
    Collier, Nigel
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 14621 - 14635
  • [8] Improving the performance of dictionary-based approaches in protein name recognition
    Tsuruoka, Y
    Tsujii, J
    JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (06) : 461 - 470
  • [9] Development of Biomedical Corpus Enlargement Platform Using BERT for Bio-entity Recognition
    Phongwattana, Thiptanawat
    Chan, Jonathan H.
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 454 - 463
  • [10] A Staged and Distributed Strategy for Bio-Entity Recognition
    Zhou, Huiwei
    Huang, Degen
    Li, Xiaoyan
    Yang, Yuansheng
    Ren, Fuji
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (10): : 3527 - 3536