Identification of Mammalian Enzymatic Proteins Based on Sequence-Derived Features and Species-Specific Scheme

被引:7
|
作者
Chai, Haiting [1 ]
Zhang, Jian [2 ]
机构
[1] Univ Glasgow, Coll Med Vet & Life Sci, Glasgow G12 8QQ, Lanark, Scotland
[2] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Enzymatic proteins; species-specific; sequence-based; feature selection; REPLACEMENT THERAPY; ENZYMES; CLASSIFICATION; INSUFFICIENCY;
D O I
10.1109/ACCESS.2018.2798284
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enzymatic proteins (EPs) are widely distributed in organisms and cells and implicated in biochemical processes. Without these proteins, most biochemical reactions slowly occur at mild temperatures and pressures in living bodies. Given the wide application of these proteins in drug discovery and disease therapy, they should be accurately identified, but specific methods have yet to be reported to determine EPs from primary sequences. To achieve this, in this paper, we propose a novel method for predicting mammalian EPs. We collect a series of sequence-based features observed in EPs and perform detailed analyses to investigate the intrinsic properties of enzymatic and non-EPs. To remove redundant features and select an optimal feature subset, we introduce Fisher Markov selector and incremental feature selection. Based on the optimal feature subset, our method achieves the area under the curve values of 0.731, 0.820, and 0.822 on three training datasets using fivefold cross validation. Our strategy also shows a good generalization capability on independent testing datasets. We further compare the differences between our species-specific and universal models, which confirm the effectiveness of introducing the species-specific scheme. We believe that our method is useful for biomedical research on EPs. Our proposed method is implemented in a user-friendly Web server named predict EPs, which is freely available for academic use at http://www.inforstation.com/webservers/PEP/.
引用
收藏
页码:8452 / 8458
页数:7
相关论文
共 50 条
  • [31] Species-specific and mutant MWFE proteins - Their effect on the assembly of a functional mammalian mitochondrial complex I
    Yadava, N
    Potluri, P
    Smith, EN
    Bisevac, A
    Scheffler, IE
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (24) : 21221 - 21230
  • [32] Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme
    Zhang, Jian
    Chen, Wenhan
    Sun, Pingping
    Zhao, Xiaowei
    Ma, Zhiqiang
    BIODATA MINING, 2015, 8
  • [33] Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme
    Jian Zhang
    Wenhan Chen
    Pingping Sun
    Xiaowei Zhao
    Zhiqiang Ma
    BioData Mining, 8
  • [34] Identification of sex, age and species-specific proteins on the surface of the harpacticoid copepod Tigriopus japonicus
    J. H. Ting
    L. S. Kelly
    T. W. Snell
    Marine Biology, 2000, 137 : 31 - 37
  • [35] Identification of sex, age and species-specific proteins on the surface of the harpacticoid copepod Tigriopus japonicus
    Ting, JH
    Kelly, LS
    Snell, TW
    MARINE BIOLOGY, 2000, 137 (01) : 31 - 37
  • [36] IDENTIFICATION OF SPECIES-SPECIFIC, NON-CROSS-REACTIVE PROTEINS OF BORRELIA-BURGDORFERI
    SAYAHTAHERIALTAIE, S
    MEIER, FA
    DALTON, HP
    DIAGNOSTIC MICROBIOLOGY AND INFECTIOUS DISEASE, 1993, 16 (01) : 43 - 51
  • [37] MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
    Kajitani, Rei
    Noguchi, Hideki
    Gotoh, Yasuhiro
    Ogura, Yoshitoshi
    Yoshimura, Dai
    Okuno, Miki
    Toyoda, Atsushi
    Kuwahara, Tomomi
    Hayashi, Tetsuya
    Itoh, Takehiko
    NUCLEIC ACIDS RESEARCH, 2021, 49 (22)
  • [38] GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features
    Hasan, Md. Mehedi
    Kurata, Hiroyuki
    PLOS ONE, 2018, 13 (10):
  • [39] Species-specific model based on sequence and structural information for ubiquitination sites prediction
    Li, Weimin
    Chen, Nan
    Wang, Jie
    Luo, Yin
    Liu, Huazhong
    Ding, Jihong
    Jin, Qun
    JOURNAL OF MOLECULAR BIOLOGY, 2024, 436 (22)
  • [40] A systematic identification of species-specific protein succinylation sites using joint element features information
    Hasan, Md Mehedi
    Khatun, Mst Shamima
    Mollah, Md Nurul Haque
    Yong, Cao
    Guo, Dianjing
    INTERNATIONAL JOURNAL OF NANOMEDICINE, 2017, 12 : 1 - 13