PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features

被引:2
|
作者
Wang, Wei [1 ,2 ]
Zhang, Yu [1 ]
Liu, Dong [1 ]
Zhang, HongJun [3 ]
Wang, XianFang [4 ]
Zhou, Yun [1 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453000, Henan, Peoples R China
[2] Key Lab Artificial Intelligence & Personalized Le, Xinxiang, Henan, Peoples R China
[3] Anyang Univ, Sch Comp Sci & Technol, Anyang 455000, Peoples R China
[4] Henan Inst Technol, Coll Comp Sci & Technol Engn, Xinxiang 453000, Henan, Peoples R China
基金
中国国家自然科学基金; 欧洲研究理事会;
关键词
Ubiquitination sites; Arabidopsis thaliana; Sequence information; Support vector machine; AMINO-ACIDS; ACCURATE PREDICTION; CD-HIT; PROTEIN;
D O I
10.1007/s11103-022-01288-3
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ubiquitination modification is an important post-translational modification of proteins, which participates in the regulation of many important life activities in cells. At present, ubiquitination proteomics research is mostly concentrated in animals and yeasts, while relatively few studies have been carried out in plants. It can be said that the calculation and prediction of Arabidopsis thaliana ubiquitination sites is still in its infancy. Based on this, we describe a calculation method, PseAraUbi (Prediction of Arabidopsis thaliana ubiquitination sites using pseudo amino acid composition), that can effectively detect ubiquitination sites on Arabidopsis thaliana using support vector machine learning classifiers. Based on protein sequence information, extract features from the Chou-Fasman parameter, amino acids hydrophobicity features, polarity information and selected for classification with the Boruta algorithm. PseAraUbi achieves promising performances with an AUC score of 0.953 with fivefold cross-validation on the training dataset, which are significantly better than that of the pioneer Arabidopsis thaliana ubiquitination sites method. We also proved the ability of our proposed method on independent test sets, thus gaining a competitive advantage. In addition, we also in-depth analyzed the physicochemical properties of amino acids in the region adjacent to the ubiquitination site. To facilitate the community, the source code, optimal feature subset, ubiquitination sites dataset in the Arbidopsis proteome are available at GitHub (https://github.com/HNUBioinformati cs/PseAraUbi.git) for interest users.
引用
收藏
页码:81 / 92
页数:12
相关论文
共 50 条
  • [31] Structural and physico-chemical properties of clathrate hydrates of tetraisoamylammonium polyacrylates
    Terekhova, Irina S.
    Manakov, Andrey Yu
    Villevald, Galina V.
    Karpova, Tamara D.
    JOURNAL OF INCLUSION PHENOMENA AND MACROCYCLIC CHEMISTRY, 2022, 102 (11-12) : 905 - 915
  • [32] Physico-chemical, rheological and structural properties of fractionated potato starches
    Kaur, Lovedeep
    Singh, Jaspreet
    McCarthy, Owen J.
    Singh, Harmit
    JOURNAL OF FOOD ENGINEERING, 2007, 82 (03) : 383 - 394
  • [33] Changes of the structural and physico-chemical properties of the remolten nefelitic basanites
    Rybar, P
    Kysel'ova, K
    Zacharov, M
    Sekula, F
    METALURGIJA, 1998, 37 (04): : 239 - 244
  • [34] Characterization of compensated mutations in terms of structural and physico-chemical properties
    Ferrer-Costa, Carles
    Orozco, Modesto
    de la Cruz, Xavier
    JOURNAL OF MOLECULAR BIOLOGY, 2007, 365 (01) : 249 - 256
  • [35] Oil palm fiber biodegradation: physico-chemical and structural relationships
    Farah Nadia Omar
    Halimatun Saadiah Hafid
    Azhari Samsu Baharuddin
    Mohd Afandi P. Mohammed
    Jaafar Abdullah
    Planta, 2017, 246 : 567 - 577
  • [36] Structural and physico-chemical properties of clathrate hydrates of tetraisoamylammonium polyacrylates
    Irina S. Terekhova
    Andrey Yu. Manakov
    Galina V. Villevald
    Tamara D. Karpova
    Journal of Inclusion Phenomena and Macrocyclic Chemistry, 2022, 102 : 905 - 915
  • [37] Innovative physico-chemical treatment of wastewater incorporating Moringa oleifera seed coagulant
    Bhuptawat, Hitendra
    Folkard, G. K.
    Chaudhari, Sanjeev
    JOURNAL OF HAZARDOUS MATERIALS, 2007, 142 (1-2) : 477 - 482
  • [38] More on the line graph model for predicting physico-chemical properties of alkanes
    Gutman, I
    Tomovic, Z
    ACH-MODELS IN CHEMISTRY, 2000, 137 (04): : 439 - 445
  • [39] The line graph model. Predicting physico-chemical properties of alkanes
    Gutman, I
    Popovic, L
    Estrada, E
    Bertz, SH
    ACH-MODELS IN CHEMISTRY, 1998, 135 (1-2): : 147 - 155
  • [40] Predicting Physico-Chemical Properties of Alkylated Naphthalenes with COSMO-RS
    Schroeder, Bernd
    Coutinho, Joao A. P.
    Santos, Luis M. N. B. F.
    POLYCYCLIC AROMATIC COMPOUNDS, 2013, 33 (01) : 1 - 19