Gene/protein name recognition based on support vector machine using dictionary as features

被引:33
|
作者
Mitsumori, T
Fation, S
Murata, M
Doi, K
Doi, H
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300101, Japan
[2] Natl Inst Informat & Commun Technol, Kyoto 6190289, Japan
关键词
D O I
10.1186/1471-2105-6-S1-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Automated information extraction from biomedical literature is important because a vast amount of biomedical literature has been published. Recognition of the biomedical named entities is the first step in information extraction. We developed an automated recognition system based on the SVM algorithm and evaluated it in Task 1.A of BioCreAtIvE, a competition for automated gene/protein name recognition. Results: In the work presented here, our recognition system uses the feature set of the word, the part-of-speech (POS), the orthography, the prefix, the suffix, and the preceding class. We call these features "internal resource features", i.e., features that can be found in the training data. Additionally, we consider the features of matching against dictionaries to be external resource features. We investigated and evaluated the effect of these features as well as the effect of tuning the parameters of the SVM algorithm. We found that the dictionary matching features contributed slightly to the improvement in the performance of the f-score. We attribute this to the possibility that the dictionary matching features might overlap with other features in the current multiple feature setting. Conclusion: During SVM learning, each feature alone had a marginally positive effect on system performance. This supports the fact that the SVM algorithm is robust on the high dimensionality of the feature vector space and means that feature selection is not required.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Vehicle color recognition using support vector machine
    Wang, Yunqiong
    You, Zhisheng
    Liu, Zhifang
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2004, 16 (05): : 701 - 706
  • [32] Online handwriting recognition using support vector machine
    Ahmad, AR
    Khalia, M
    Viard-Gaudin, C
    Poisson, E
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A311 - A314
  • [33] Speech Emotion Recognition Using Support Vector Machine
    Al Zoubi, Rouaa
    Turky, Ayad
    Foufou, Sebti
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 7, 2024, 1003 : 519 - 532
  • [34] Improving the performance of dictionary-based approaches in protein name recognition
    Tsuruoka, Y
    Tsujii, J
    JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (06) : 461 - 470
  • [35] Face Recognition using Ensemble Support Vector Machine
    Dey, Aniruddha
    Chowdhury, Shiladitya
    Ghosh, Manas
    2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 45 - 50
  • [36] Recognition of Plant Leaves Using Support Vector Machine
    Man, Qing-Kui
    Zheng, Chun-Hou
    Wang, Xiao-Feng
    Lin, Feng-Yan
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2008, 15 : 192 - +
  • [37] Recognition and classification of histones using support vector machine
    Bhasin, M
    Reinherz, EL
    Reche, PA
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (01) : 102 - 112
  • [38] Partial fingerprint recognition using support vector machine
    Vijayaprasad P.
    Sulaiman M.N.
    Mustapha N.
    Rahmat R.W.O.K.
    Information Technology Journal, 2010, 9 (04) : 844 - 848
  • [39] Sign Language Recognition Using Support Vector Machine
    Sinith, M. S.
    Kamal, Soorej G.
    Nisha, B.
    Nayana, S.
    Surendran, Kiran
    Jith, P. S.
    2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC), 2012, : 122 - 125
  • [40] EMG vowel recognition using a support vector machine
    Precision and Intelligence Laboratory, Tokyo Institute of Technology, Japan
    不详
    Int. Symp. Meas., Anal., Model. Hum. Funct., ISHF, (290-295):