Interpretable Machine Learning of Amino Acid Patterns in Proteins: A Statistical Ensemble Approach

被引:2
|
作者
Braghetto, Anna [1 ,2 ]
Orlandini, Enzo [1 ,2 ]
Baiesi, Marco [1 ,2 ]
机构
[1] Univ Padua, Dept Phys & Astron, Via Marzolo 8, I-35131 Padua, Italy
[2] INFN, Sez Padova, Via Marzolo 8, I-35131 Padua, Italy
关键词
SECONDARY STRUCTURE; POLAR; PREDICTION; DESIGN;
D O I
10.1021/acs.jctc.3c00383
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Explainable and interpretable unsupervised machine learninghelpsone to understand the underlying structure of data. We introduce anensemble analysis of machine learning models to consolidate theirinterpretation. Its application shows that restricted Boltzmann machinescompress consistently into a few bits the information stored in asequence of five amino acids at the start or end of & alpha;-helicesor & beta;-sheets. The weights learned by the machines reveal unexpectedproperties of the amino acids and the secondary structure of proteins:(i) His and Thr have a negligible contribution to the amphiphilicpattern of & alpha;-helices; (ii) there is a class of & alpha;-helicesparticularly rich in Ala at their end; (iii) Pro occupies most oftenslots otherwise occupied by polar or charged amino acids, and itspresence at the start of helices is relevant; (iv) Glu and especiallyAsp on one side and Val, Leu, Iso, and Phe on the other display thestrongest tendency to mark amphiphilic patterns, i.e., extreme valuesof an effective hydrophobicity, though they are notthe most powerful (non)hydrophobic amino acids.
引用
收藏
页码:6011 / 6022
页数:12
相关论文
共 50 条
  • [31] Predictors of Sustainable Investment Motivation: An Interpretable Machine Learning Approach
    Sosnovskikh, Sergey
    Valko, Danila
    Meyer-Alten, Raphael
    SUSTAINABLE DEVELOPMENT, 2025,
  • [32] "What is relevant in a text document?": An interpretable machine learning approach
    Arras, Leila
    Horn, Franziska
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Samek, Wojciech
    PLOS ONE, 2017, 12 (08):
  • [33] OnML: an ontology-based approach for interpretable machine learning
    Ayranci, Pelin
    Lai, Phung
    Phan, Nhathai
    Hu, Han
    Kolinowski, Alexander
    Newman, David
    Dou, Deijing
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2022, 44 (01) : 770 - 793
  • [34] Using an Interpretable Amino Acid-Based Machine Learning Method to Enhance the Diagnosis of Major Depressive Disorder
    Ho, Cyrus Su Hui
    Tan, Trevor Wei Kiat
    Khoe, Howard Cai Hao
    Chan, Yee Ling
    Tay, Gabrielle Wann Nii
    Tang, Tong Boon
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (05)
  • [35] An interpretable machine learning approach to identify mechanism of action of antibiotics
    Mongia, Mihir
    Guler, Mustafa
    Mohimani, Hosein
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [36] Amino acid coupling patterns in thermophilic proteins
    Liang, HK
    Huang, CM
    Ko, MT
    Hwang, JK
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (01) : 58 - 63
  • [37] Predicting ozone formation in petrochemical industrialized Lanzhou city by interpretable ensemble machine learning
    Wang, Li
    Zhao, Yuan
    Shi, Jinsen
    Ma, Jianmin
    Liu, Xiaoyue
    Han, Dongliang
    Gao, Hong
    Huang, Tao
    ENVIRONMENTAL POLLUTION, 2023, 318
  • [38] An Interpretable Deep Learning Approach for Detecting Marine Heatwaves Patterns
    He, Qi
    Zhu, Zihang
    Zhao, Danfeng
    Song, Wei
    Huang, Dongmei
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [39] CHARACTERIZATION OF AMINO ACID SEQUENCES IN PROTEINS BY STATISTICAL METHODS
    ZIMMERMAN, JM
    ELIEZER, N
    SIMHA, R
    JOURNAL OF THEORETICAL BIOLOGY, 1968, 21 (02) : 170 - +
  • [40] Accurate prediction of essential proteins using ensemble machine learning
    鲁德志
    吴淏
    侯俞彤
    吴云成
    刘媛媛
    王金武
    Chinese Physics B, 2025, 34 (01) : 112 - 119