RNA global alignment in the joint sequence-structure space using elastic shape analysis

被引:13
|
作者
Laborde, Jose [1 ]
Robinson, Daniel [2 ]
Srivastava, Anuj [1 ]
Klassen, Eric [2 ]
Zhang, Jinfeng [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] Florida State Univ, Dept Math, Tallahassee, FL 32306 USA
基金
美国国家卫生研究院;
关键词
PAIRWISE ALIGNMENT; GENOMICS; MOTIFS; IDENTIFICATION; REPRESENTATION; DISCOVERY; SERVER;
D O I
10.1093/nar/gkt187
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The functions of RNAs, like proteins, are determined by their structures, which, in turn, are determined by their sequences. Comparison/alignment of RNA molecules provides an effective means to predict their functions and understand their evolutionary relationships. For RNA sequence alignment, most methods developed for protein and DNA sequence alignment can be directly applied. RNA 3-dimensional structure alignment, on the other hand, tends to be more difficult than protein structure alignment due to the lack of regular secondary structures as observed in proteins. Most of the existing RNA 3D structure alignment methods use only the backbone geometry and ignore the sequence information. Using both the sequence and backbone geometry information in RNA alignment may not only produce more accurate classification, but also deepen our understanding of the sequence-structure-function relationship of RNA molecules. In this study, we developed a new RNA alignment method based on elastic shape analysis (ESA). ESA treats RNA structures as three dimensional curves with sequence information encoded on additional dimensions so that the alignment can be performed in the joint sequence-structure space. The similarity between two RNA molecules is quantified by a formal distance, geodesic distance. Based on ESA, a rigorous mathematical framework can be built for RNA structure comparison. Means and covariances of full structures can be defined and computed, and probability distributions on spaces of such structures can be constructed for a group of RNAs. Our method was further applied to predict functions of RNA molecules and showed superior performance compared with previous methods when tested on benchmark datasets. The programs are available at ext-link-type="uri" xlink:href="http://stat.fsu.edu/" xmlns:xlink="http://www.w3.org/1999/xlink">http://stat.fsu.edu/ similar to jinfeng/ESA.html.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Modeling protein loops with knowledge-based prediction of sequence-structure alignment
    Peng, Hung-Pin
    Yang, An-Suei
    BIOINFORMATICS, 2007, 23 (21) : 2836 - 2842
  • [32] Comparative modeling in CASP6 using consensus approach to template selection, sequence-structure alignment, and structure assessment
    Venclovas, C
    Margelevicius, M
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 : 99 - 105
  • [33] Fold recognition and accurate sequence-structure alignment of sequences directing β-sheet proteins
    McDonnell, Andrew V.
    Menke, Matthew
    Palmer, Nathan
    King, Jonathan
    Cowen, Lenore
    Berger, Bonnie
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 63 (04) : 976 - 985
  • [34] SSMART: sequence-structure motif identification for RNA-binding proteins
    Munteanu, Aline
    Mukherjee, Neelanjan
    Ohler, Uwe
    BIOINFORMATICS, 2018, 34 (23) : 3990 - 3998
  • [35] Capturing protein sequence-structure specificity using computational sequence design
    Mach, Paul
    Koehl, Patrice
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (09) : 1556 - 1570
  • [36] PSSARD: Protein sequence-structure analysis relational database
    Guruprasad, K
    Srikanth, K
    Babu, AVN
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2005, 36 (04) : 259 - 262
  • [37] Tools for integrated sequence-structure analysis with UCSF Chimera
    Elaine C Meng
    Eric F Pettersen
    Gregory S Couch
    Conrad C Huang
    Thomas E Ferrin
    BMC Bioinformatics, 7
  • [38] Sequence-structure analysis of FAD-containing proteins
    Dym, O
    Eisenberg, D
    PROTEIN SCIENCE, 2001, 10 (09) : 1712 - 1728
  • [39] Tools for integrated sequence-structure analysis with UCSF Chimera
    Meng, Elaine C.
    Pettersen, Eric F.
    Couch, Gregory S.
    Huang, Conrad C.
    Ferrin, Thomas E.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [40] Predicting RNA sequence-structure likelihood via structure-aware deep learning
    Zhou, You
    Pedrielli, Giulia
    Zhang, Fei
    Wu, Teresa
    BMC BIOINFORMATICS, 2024, 25 (01):