Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature

被引:47
|
作者
Yang, Zhihao [1 ]
Lin, Hongfei [1 ]
Li, Yanpeng [1 ]
机构
[1] Dalian Univ Technol, Dept Comp Sci & Engn, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
text mining; entity recognition; edit distance; conditional random fields;
D O I
10.1016/j.compbiolchem.2008.03.008
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Bio-entity name recognition is the key step for information extraction from biomedical literature. This paper presents a dictionary-based bio-entity name recognition approach. The approach expands the bio-entity name dictionary via the Abbreviation Definitions identifying algorithm, improves the recall rate through the improved edit distance algorithm and adopts some post-processing methods including Pre-keyword and Post-keyword expansion, Part of Speech expansion, merge of adjacent bio-entity names and the exploitation of the contextual cues to further improve the performance. Experiment results show that with this approach even an internal dictionary-based system could achieve a fairly good performance. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:287 / 291
页数:5
相关论文
共 50 条
  • [31] Gene name automatic recognition in biomedical literature
    Yang, Zhihao
    Lin, Hongfei
    Zhao, Jing
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 285 - 285
  • [32] Dictionary-based syntactic pattern recognition using tries
    Oommen, BJ
    Badr, G
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 251 - 259
  • [33] Sparse Dictionary-based Representation and Recognition of Action Attributes
    Qiu, Qiang
    Jiang, Zhuolin
    Chellappa, Rama
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 707 - 714
  • [34] Dictionary-Based Face and Person Recognition From Unconstrained Video
    Chen, Yi-Chen
    Patel, Vishal M.
    Phillips, P. Jonathon
    Chellappa, Rama
    IEEE ACCESS, 2015, 3 : 1783 - 1798
  • [35] Dictionary-Based Face Recognition Under Variable Lighting and Pose
    Patel, Vishal M.
    Wu, Tao
    Biswas, Soma
    Phillips, P. Jonathon
    Chellappa, Rama
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2012, 7 (03) : 954 - 965
  • [36] Exploiting the concept level feature for enhanced name entity recognition in Chinese EMRs
    Qing Zhao
    Dan Wang
    Jianqiang Li
    Faheem Akhtar
    The Journal of Supercomputing, 2020, 76 : 6399 - 6420
  • [37] Exploiting the concept level feature for enhanced name entity recognition in Chinese EMRs
    Zhao, Qing
    Wang, Dan
    Li, Jianqiang
    Akhtar, Faheem
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (08): : 6399 - 6420
  • [38] Biomedical Named Entity Recognition Based on MCBERT
    Wang, Sai
    Yilahun, Hankiz
    Hamdulla, Askar
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 247 - 252
  • [39] BoostER: A Performance Boosting Module for Biomedical Entity Recognition
    Pandey, Rahul
    Shamsuzzaman, Md
    Hasan, Sadid A.
    Sorower, Mohammad S.
    Khan, Md Abdullah Al Hafiz
    Liu, Joey
    Datla, Vivek
    Milosevic, Mladen
    Mankovich, Gabe
    van Ommering, Rob
    Dimitrova, Nevenka
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2554 - 2560
  • [40] Co-decision matrix framework for name entity recognition in biomedical text
    Wang, Haochang
    Li, Yu
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (04) : 412 - 423