RNA global alignment in the joint sequence-structure space using elastic shape analysis

被引:13
|
作者
Laborde, Jose [1 ]
Robinson, Daniel [2 ]
Srivastava, Anuj [1 ]
Klassen, Eric [2 ]
Zhang, Jinfeng [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] Florida State Univ, Dept Math, Tallahassee, FL 32306 USA
基金
美国国家卫生研究院;
关键词
PAIRWISE ALIGNMENT; GENOMICS; MOTIFS; IDENTIFICATION; REPRESENTATION; DISCOVERY; SERVER;
D O I
10.1093/nar/gkt187
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The functions of RNAs, like proteins, are determined by their structures, which, in turn, are determined by their sequences. Comparison/alignment of RNA molecules provides an effective means to predict their functions and understand their evolutionary relationships. For RNA sequence alignment, most methods developed for protein and DNA sequence alignment can be directly applied. RNA 3-dimensional structure alignment, on the other hand, tends to be more difficult than protein structure alignment due to the lack of regular secondary structures as observed in proteins. Most of the existing RNA 3D structure alignment methods use only the backbone geometry and ignore the sequence information. Using both the sequence and backbone geometry information in RNA alignment may not only produce more accurate classification, but also deepen our understanding of the sequence-structure-function relationship of RNA molecules. In this study, we developed a new RNA alignment method based on elastic shape analysis (ESA). ESA treats RNA structures as three dimensional curves with sequence information encoded on additional dimensions so that the alignment can be performed in the joint sequence-structure space. The similarity between two RNA molecules is quantified by a formal distance, geodesic distance. Based on ESA, a rigorous mathematical framework can be built for RNA structure comparison. Means and covariances of full structures can be defined and computed, and probability distributions on spaces of such structures can be constructed for a group of RNAs. Our method was further applied to predict functions of RNA molecules and showed superior performance compared with previous methods when tested on benchmark datasets. The programs are available at ext-link-type="uri" xlink:href="http://stat.fsu.edu/" xmlns:xlink="http://www.w3.org/1999/xlink">http://stat.fsu.edu/ similar to jinfeng/ESA.html.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] RASS: a web server for RNA alignment in the joint sequence-structure space
    He, Gewen
    Steppi, Albert
    Laborde, Jose
    Srivastava, Anuj
    Zhao, Peixiang
    Zhang, Jinfeng
    NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) : W377 - W381
  • [2] Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization
    Bauer, Markus
    Klau, Gunnar W.
    Reinert, Knut
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [3] Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization
    Markus Bauer
    Gunnar W Klau
    Knut Reinert
    BMC Bioinformatics, 8
  • [4] Learning Parameters for Non-coding RNA Sequence-Structure Alignment
    Song, Yinglei
    Liu, Chunmei
    Qu, Junfeng
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 72 - +
  • [5] LaRA 2: parallel and vectorized program for sequence-structure alignment of RNA sequences
    Winkler, Jorg
    Urgese, Gianvito
    Ficarra, Elisa
    Reinert, Knut
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [6] A Bayes-optimal sequence-structure theory that unifies protein sequence-structure recognition and alignment
    Richard H. Lathrop
    Robert G. Rogers
    Temple F. Smith
    James V. White
    Bulletin of Mathematical Biology, 1998, 60 (6) : 1039 - 1071
  • [7] A Bayes-optimal sequence-structure theory that unifies protein sequence-structure recognition and alignment
    Lathrop, RH
    Rogers, RG
    Smith, TF
    White, JV
    BULLETIN OF MATHEMATICAL BIOLOGY, 1998, 60 (06) : 1039 - 1071
  • [8] Sequence-structure relations of pseudoknot RNA
    Huang, Fenix W. D.
    Li, Linda Y. M.
    Reidys, Christian M.
    BMC BIOINFORMATICS, 2009, 10
  • [9] Paramecium: RNA sequence-structure phylogenetics
    Weimer, Marlyn
    Vd'acny, Peter
    Wolf, Matthias
    INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2023, 73 (04)
  • [10] Sequence-structure relations of pseudoknot RNA
    Fenix WD Huang
    Linda YM Li
    Christian M Reidys
    BMC Bioinformatics, 10