Identification of Mammalian Enzymatic Proteins Based on Sequence-Derived Features and Species-Specific Scheme

被引:7
|
作者
Chai, Haiting [1 ]
Zhang, Jian [2 ]
机构
[1] Univ Glasgow, Coll Med Vet & Life Sci, Glasgow G12 8QQ, Lanark, Scotland
[2] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Enzymatic proteins; species-specific; sequence-based; feature selection; REPLACEMENT THERAPY; ENZYMES; CLASSIFICATION; INSUFFICIENCY;
D O I
10.1109/ACCESS.2018.2798284
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enzymatic proteins (EPs) are widely distributed in organisms and cells and implicated in biochemical processes. Without these proteins, most biochemical reactions slowly occur at mild temperatures and pressures in living bodies. Given the wide application of these proteins in drug discovery and disease therapy, they should be accurately identified, but specific methods have yet to be reported to determine EPs from primary sequences. To achieve this, in this paper, we propose a novel method for predicting mammalian EPs. We collect a series of sequence-based features observed in EPs and perform detailed analyses to investigate the intrinsic properties of enzymatic and non-EPs. To remove redundant features and select an optimal feature subset, we introduce Fisher Markov selector and incremental feature selection. Based on the optimal feature subset, our method achieves the area under the curve values of 0.731, 0.820, and 0.822 on three training datasets using fivefold cross validation. Our strategy also shows a good generalization capability on independent testing datasets. We further compare the differences between our species-specific and universal models, which confirm the effectiveness of introducing the species-specific scheme. We believe that our method is useful for biomedical research on EPs. Our proposed method is implemented in a user-friendly Web server named predict EPs, which is freely available for academic use at http://www.inforstation.com/webservers/PEP/.
引用
收藏
页码:8452 / 8458
页数:7
相关论文
共 50 条
  • [21] IDENTIFICATION OF SPECIES-SPECIFIC AND GENDER-SPECIFIC PROTEINS AND GLYCOPROTEINS OF 3 HUMAN SCHISTOSOMES
    ARONSTEIN, WS
    STRAND, M
    JOURNAL OF PARASITOLOGY, 1983, 69 (06) : 1006 - 1017
  • [22] A neural network learning approach for improving the prediction of residue depth based on sequence-derived features
    Yan, Renxiang
    Wang, Xiaofeng
    Xu, Weiming
    Cai, Weiwen
    Lin, Juan
    Li, Jian
    Song, Jiangning
    RSC ADVANCES, 2016, 6 (72): : 67729 - 67738
  • [23] TYLER, a fast method that accurately predicts cyclin-dependent proteins by using computation-based motifs and sequence-derived features
    Zhang, Jian
    Liang, Xingchen
    Zhou, Feng
    Li, Bo
    Li, Yanling
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (05) : 6410 - 6429
  • [24] Identification of protein functions using a machine-learning approach based on sequence-derived properties
    Lee, Bum Ju
    Shin, Moon Sun
    Oh, Young Joon
    Oh, Hae Seok
    Ryu, Keun Ho
    PROTEOME SCIENCE, 2009, 7
  • [25] Long extrachromosomal circular DNA identification by fusing sequence-derived features of physicochemical properties and nucleotide distribution patterns
    Abbasi, Ahtisham Fazeel
    Asim, Muhammad Nabeel
    Ahmed, Sheraz
    Dengel, Andreas
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [26] Sequence Characterized Amplified Region Markers for Species-specific Identification of Three Threatened Aquilaria Species
    Roslan, Hairul Azman
    Hossain, Md. Anowar
    Othman, Nur Qistina
    Tawan, Cheksum Supiah
    Ipor, Isa
    CHIANG MAI JOURNAL OF SCIENCE, 2017, 44 (04): : 1304 - 1310
  • [27] A proteomics approach for the identification of species-specific immunogenic proteins in the Mycobacterium abscessus complex
    Steindor, Mathis
    Nkwouano, Vanesa
    Stefanski, Anja
    Stuehler, Kai
    Ioerger, Thomas Richard
    Bogumil, David
    Jacobsen, Marc
    Mackenzie, Colin Rae
    Kalscheuer, Rainer
    MICROBES AND INFECTION, 2019, 21 (3-4) : 154 - 162
  • [28] A proteomics approach for the identification of species-specific immunogenic proteins in the Mycobacterium abscessus complex
    Steindor, Mathis
    Nkwouano, Vanesa
    Stefanski, Anja
    Stuehler, Kai
    Ioerger, Thomas R.
    Bogumil, David
    Jacobsen, Marc
    Mackenzie, Colin R.
    Kalscheuer, Rainer
    EUROPEAN RESPIRATORY JOURNAL, 2019, 54
  • [29] Identification of protein functions using a machine-learning approach based on sequence-derived properties
    Bum Ju Lee
    Moon Sun Shin
    Young Joon Oh
    Hae Seok Oh
    Keun Ho Ryu
    Proteome Science, 7
  • [30] Identification of Lactobacillus brevis using a species-specific AFLP-derived marker
    Fusco, Vincenzina
    Quero, Grazia Marina
    Chieffi, Daniele
    Franz, Charles M. A. P.
    INTERNATIONAL JOURNAL OF FOOD MICROBIOLOGY, 2016, 232 : 90 - 94