Identifying functionally informative evolutionary sequence profiles

被引:8
|
作者
Gil, Nelson [1 ]
Fiser, Andras [1 ]
机构
[1] Albert Einstein Coll Med, Dept Syst & Computat Biol, Bronx, NY 10461 USA
关键词
DIRECT-COUPLING ANALYSIS; CORRELATED MUTATIONS; CONTACT PREDICTION; ALIGNMENT QUALITY; STRUCTURAL INFORMATION; MUTUAL INFORMATION; THEORETIC ANALYSIS; PROTEIN-STRUCTURE; RESIDUE CONTACTS; IMPROVES;
D O I
10.1093/bioinformatics/btx779
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Multiple sequence alignments (MSAs) can provide essential input to many bioinformatics applications, including protein structure prediction and functional annotation. However, the optimal selection of sequences to obtain biologically informative MSAs for such purposes is poorly explored, and has traditionally been performed manually. Results: We present Selection of Alignment by Maximal Mutual Information (SAMMI), an automated, sequence-based approach to objectively select an optimal MSA from a large set of alternatives sampled from a general sequence database search. The hypothesis of this approach is that the mutual information among MSA columns will be maximal for those MSAs that contain the most diverse set possible of the most structurally and functionally homogeneous protein sequences. SAMMI was tested to select MSAs for functional site residue prediction by analysis of conservation patterns on a set of 435 proteins obtained from protein-ligand (peptides, nucleic acids and small substrates) and protein-protein interaction databases.
引用
收藏
页码:1278 / 1286
页数:9
相关论文
共 50 条
  • [1] Predicting Functionally Informative Mutations in Escherichia coli BamA Using Evolutionary Covariance Analysis
    Dwyer, Robert S.
    Ricci, Dante P.
    Colwell, Lucy J.
    Silhavy, Thomas J.
    Wingreen, Ned S.
    GENETICS, 2013, 195 (02): : 443 - +
  • [2] An improved method for identifying functionally linked proteins using phylogenetic profiles
    Shawn Cokus
    Sayaka Mizutani
    Matteo Pellegrini
    BMC Bioinformatics, 8
  • [3] An improved method for identifying functionally linked proteins using phylogenetic profiles
    Cokus, Shawn
    Mizutani, Sayaka
    Pellegrini, Matteo
    BMC BIOINFORMATICS, 2007, 8 (Suppl 4)
  • [4] Deep learning with evolutionary and genomic profiles for identifying cancer subtypes
    Lin, Chun-Yu
    Ruan, Peiying
    Li, Ruiming
    Yang, Jinn-Moon
    See, Simon
    Song, Jiangning
    Akutsu, Tatsuya
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (03)
  • [5] Deep Learning with Evolutionary and Genomic Profiles for Identifying Cancer Subtypes
    Lin, Chun-Yu
    Li, Ruiming
    Akutsu, Tatsuya
    Ruan, Peiying
    See, Simon
    Yang, Jinn-Moon
    PROCEEDINGS 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2018, : 147 - 150
  • [6] Identifying functionally important mutations from phenotypically diverse sequence data
    Jensen, Kyle
    Alper, Hal
    Fischer, Curt
    Stephanopoulos, Gregory
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2006, 72 (05) : 3696 - 3701
  • [7] Reading Profiles May Not Be an Informative Approach to Identifying At-Risk Readers in Middle School
    Erbeli, Florina
    CROATIAN JOURNAL OF EDUCATION-HRVATSKI CASOPIS ZA ODGOJ I OBRAZOVANJE, 2022, 24 (01): : 69 - 95
  • [8] Evolutionary profiles from the QR factorization of multiple sequence alignments
    Sethi, A
    O'Donoghue, P
    Luthey-Schulten, Z
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (11) : 4045 - 4050
  • [9] Identifying informative cases for pooling
    Hinrichs, AL
    Reich, T
    AMERICAN JOURNAL OF MEDICAL GENETICS, 2001, 105 (07): : 566 - 566
  • [10] Identifying Structures with Informative Dimensions in Streams
    Bhatnagar, Vasudha
    Kaur, Sharanjit
    Gupta, Neelima
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 375 - 382