Interpretable Machine Learning of Amino Acid Patterns in Proteins: A Statistical Ensemble Approach

被引:2
|
作者
Braghetto, Anna [1 ,2 ]
Orlandini, Enzo [1 ,2 ]
Baiesi, Marco [1 ,2 ]
机构
[1] Univ Padua, Dept Phys & Astron, Via Marzolo 8, I-35131 Padua, Italy
[2] INFN, Sez Padova, Via Marzolo 8, I-35131 Padua, Italy
关键词
SECONDARY STRUCTURE; POLAR; PREDICTION; DESIGN;
D O I
10.1021/acs.jctc.3c00383
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Explainable and interpretable unsupervised machine learninghelpsone to understand the underlying structure of data. We introduce anensemble analysis of machine learning models to consolidate theirinterpretation. Its application shows that restricted Boltzmann machinescompress consistently into a few bits the information stored in asequence of five amino acids at the start or end of & alpha;-helicesor & beta;-sheets. The weights learned by the machines reveal unexpectedproperties of the amino acids and the secondary structure of proteins:(i) His and Thr have a negligible contribution to the amphiphilicpattern of & alpha;-helices; (ii) there is a class of & alpha;-helicesparticularly rich in Ala at their end; (iii) Pro occupies most oftenslots otherwise occupied by polar or charged amino acids, and itspresence at the start of helices is relevant; (iv) Glu and especiallyAsp on one side and Val, Leu, Iso, and Phe on the other display thestrongest tendency to mark amphiphilic patterns, i.e., extreme valuesof an effective hydrophobicity, though they are notthe most powerful (non)hydrophobic amino acids.
引用
收藏
页码:6011 / 6022
页数:12
相关论文
共 50 条
  • [21] Interpretable ensemble machine learning models for predicting the shear capacity of UHPC joints
    Ye, Meng
    Li, Lifeng
    Jin, Weimeng
    Tang, Jiahao
    Yoo, Doo-Yeol
    Zhou, Cong
    ENGINEERING STRUCTURES, 2024, 315
  • [22] Interpretable Ensemble-Machine-Learning models for predicting creep behavior of concrete
    Liang, Minfei
    Chang, Ze
    Wan, Zhi
    Gan, Yidong
    Schlangen, Erik
    Savija, Branko
    CEMENT & CONCRETE COMPOSITES, 2022, 125
  • [23] Machine Learning: A Practical Approach on the Statistical Learning
    Liu, Shin Ta
    TECHNOMETRICS, 2020, 62 (04) : 560 - 561
  • [24] Unveiling Switching Function of Amino Acids in Proteins Using a Machine Learning Approach
    Mollaei, Parisa
    Farimani, Amir Barati
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (22) : 8472 - 8480
  • [25] Employing the Interpretable Ensemble Learning Approach to Predict the Bandgaps of the Halide Perovskites
    Ren, Chao
    Wu, Yiyuan
    Zou, Jijun
    Cai, Bowen
    MATERIALS, 2024, 17 (11)
  • [26] Machine learning prediction of amino acid patterns in protein N-myristoylation
    Okada, Ryo
    Sugii, Manabu
    Matsuno, Hiroshi
    Miyano, Satoru
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2006, 4146 : 4 - +
  • [27] A Machine Learning Approach for the Automated Interpretation of Plasma Amino Acid Profiles
    Wilkes, Edmund H.
    Emmett, Erin
    Beltran, Luisa
    Woodward, Gary M.
    Carling, Rachel S.
    CLINICAL CHEMISTRY, 2020, 66 (09) : 1210 - 1218
  • [28] Combinatorial discovery and investigation of the synergism of green amino acid corrosion inhibitors: Integrating high-throughput experiments and interpretable machine learning approach
    Yang, Jingzhi
    Zhao, Junsen
    Guo, Xin
    Ran, Yami
    Fu, Zhongheng
    Qian, Hongchang
    Ma, Lingwei
    Keil, Patrick
    Mol, Arjan
    Zhang, Dawei
    CORROSION SCIENCE, 2025, 245
  • [29] An interpretable machine learning approach to identify mechanism of action of antibiotics
    Mihir Mongia
    Mustafa Guler
    Hosein Mohimani
    Scientific Reports, 12
  • [30] OnML: an ontology-based approach for interpretable machine learning
    Pelin Ayranci
    Phung Lai
    Nhathai Phan
    Han Hu
    Alexander Kolinowski
    David Newman
    Deijing Dou
    Journal of Combinatorial Optimization, 2022, 44 : 770 - 793