Prediction of the Bonding State of Cysteine Residues in Proteins with Machine-Learning Methods

被引:0
|
作者
Savojardo, Castrense [1 ]
Fariselli, Piero [1 ]
Martelli, Pier Luigi [1 ]
Shukla, Priyank [1 ]
Casadio, Rita [1 ]
机构
[1] Univ Bologna, Biocomp Grp, I-40126 Bologna, Italy
关键词
Machine Learning; Conditional Random Fields; Disulfide Prediction; Disulfide Bonding State; Protein Structure Prediction; Protein Folding;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we evaluate the performance of machine learning methods in the task of predicting the bonding state of cysteines starting from protein sequences. This task is the first step for the identification of disulfide bonds in proteins. We score the performance of three different approaches: 1) Hidden Support Vector Machines (HSVMs) which integrate the SVM predictions with a Hidden Markov Model; 2) SVM-HMMs which discriminatively train models that are isomorphic to a kth-order hidden Markov model; 3) Grammatical-Restrained Hidden Conditional Random Fields (GRHCRFs) that we recently introduced. We evaluate two different encoding schemes based on sequence profile and position specific scoring matrix (PSSM) as computed with the PSI-BLAST program and we show that when the evolutionary information is encoded with PSSM all the methods perform better than with sequence profile. Among the different methods it appears that GRHCRFs perform slightly better than the others achieving a per protein accuracy of 87% with a Matthews correlation coefficient (C) of 0.73. Finally, we investigate the difference between disulfide bonding state predictions in Eukaryotes and Prokaryotes. Our analysis shows that the per-protein accuracy in Prokaryotic proteins is higher than that in Eukaryotes (0.88 vs 0.83). However; given the paucity of bonded cysteines in Prokaryotes as compared to Eukaryotes the Matthews correlation coefficient is drastically reduced (0.48 vs 0.80).
引用
收藏
页码:98 / 111
页数:14
相关论文
共 50 条
  • [31] A Survey for Predicting ATP Binding Residues of Proteins Using Machine Learning Methods
    Yang, Yu-He
    Wang, Jia-Shu
    Yuan, Shi-Shi
    Liu, Meng-Lu
    Su, Wei
    Lin, Hao
    Zhang, Zhao-Yue
    CURRENT MEDICINAL CHEMISTRY, 2022, 29 (05) : 789 - 806
  • [32] Methods for Automatic Machine-Learning Workflow Analysis
    Wendlinger, Lorenz
    Berndl, Emanuel
    Granitzer, Michael
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT V, 2021, 12979 : 52 - 67
  • [33] Machine-Learning Methods on Noisy and Sparse Data
    Poulinakis, Konstantinos
    Drikakis, Dimitris
    Kokkinakis, Ioannis W.
    Spottswood, Stephen Michael
    MATHEMATICS, 2023, 11 (01)
  • [34] Machine-Learning Methods for Computational Science and Engineering
    Frank, Michael
    Drikakis, Dimitris
    Charissis, Vassilis
    COMPUTATION, 2020, 8 (01)
  • [35] How can machine-learning methods assist in virtual screening for hyperuricemia? A healthcare machine-learning approach
    Ichikawa, Daisuke
    Saito, Toki
    Ujita, Waka
    Oyama, Hiroshi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 : 20 - 24
  • [36] In Silico Prediction of CYP2C8 Inhibition with Machine-Learning Methods
    Zhang, Xiaoxiao
    Zhao, Piaopiao
    Wang, Zhiyuan
    Xu, Xuan
    Liu, Guixia
    Tang, Yun
    Li, Weihua
    CHEMICAL RESEARCH IN TOXICOLOGY, 2021, 34 (08) : 1850 - 1859
  • [37] Statistical and machine-learning methods for clearance time prediction of road incidents: A methodology review
    Tang, Jinjun
    Zheng, Lanlan
    Han, Chunyang
    Yin, Weiqi
    Zhang, Yue
    Zou, Yajie
    Huang, Helai
    ANALYTIC METHODS IN ACCIDENT RESEARCH, 2020, 27
  • [38] Predictive Ability of Machine-Learning Methods for Vitamin D Deficiency Prediction by Anthropometric Parameters
    Patino-Alonso, Carmen
    Gomez-Sanchez, Marta
    Gomez-Sanchez, Leticia
    Sanchez Salgado, Benigna
    Rodriguez-Sanchez, Emiliano
    Garcia-Ortiz, Luis
    Gomez-Marcos, Manuel A.
    MATHEMATICS, 2022, 10 (04)
  • [39] e-Bitter: Bitterant Prediction by the Consensus Voting From the Machine-Learning Methods
    Zheng, Suqing
    Jiang, Mengying
    Zhao, Chengwei
    Zhu, Rui
    Hu, Zhicheng
    Xu, Yong
    Lin, Fu
    FRONTIERS IN CHEMISTRY, 2018, 6
  • [40] PEVs Idle Time Prediction at Public Charging Stations Using Machine-Learning Methods
    Almaghrebi, Ahmad
    Al Juheshi, Fares
    James, Kevin
    Aljuhaishi, Nasser
    Alahmad, Mahmoud
    2021 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE & EXPO (ITEC), 2021,