RNA global alignment in the joint sequence-structure space using elastic shape analysis

被引:13
|
作者
Laborde, Jose [1 ]
Robinson, Daniel [2 ]
Srivastava, Anuj [1 ]
Klassen, Eric [2 ]
Zhang, Jinfeng [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] Florida State Univ, Dept Math, Tallahassee, FL 32306 USA
基金
美国国家卫生研究院;
关键词
PAIRWISE ALIGNMENT; GENOMICS; MOTIFS; IDENTIFICATION; REPRESENTATION; DISCOVERY; SERVER;
D O I
10.1093/nar/gkt187
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The functions of RNAs, like proteins, are determined by their structures, which, in turn, are determined by their sequences. Comparison/alignment of RNA molecules provides an effective means to predict their functions and understand their evolutionary relationships. For RNA sequence alignment, most methods developed for protein and DNA sequence alignment can be directly applied. RNA 3-dimensional structure alignment, on the other hand, tends to be more difficult than protein structure alignment due to the lack of regular secondary structures as observed in proteins. Most of the existing RNA 3D structure alignment methods use only the backbone geometry and ignore the sequence information. Using both the sequence and backbone geometry information in RNA alignment may not only produce more accurate classification, but also deepen our understanding of the sequence-structure-function relationship of RNA molecules. In this study, we developed a new RNA alignment method based on elastic shape analysis (ESA). ESA treats RNA structures as three dimensional curves with sequence information encoded on additional dimensions so that the alignment can be performed in the joint sequence-structure space. The similarity between two RNA molecules is quantified by a formal distance, geodesic distance. Based on ESA, a rigorous mathematical framework can be built for RNA structure comparison. Means and covariances of full structures can be defined and computed, and probability distributions on spaces of such structures can be constructed for a group of RNAs. Our method was further applied to predict functions of RNA molecules and showed superior performance compared with previous methods when tested on benchmark datasets. The programs are available at ext-link-type="uri" xlink:href="http://stat.fsu.edu/" xmlns:xlink="http://www.w3.org/1999/xlink">http://stat.fsu.edu/ similar to jinfeng/ESA.html.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Integrated architectural solutions for protein sequence-structure alignment - (Preliminary version)
    Eshaghian-Wilner, MM
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVI, PROCEEDINGS: COMPUTER SCIENCE III, 2002, : 337 - 342
  • [22] SAD - a normalized structural alignment database: improving sequence-structure alignments
    Marsden, B
    Abagyan, R
    BIOINFORMATICS, 2004, 20 (15) : 2333 - 2344
  • [23] Structure-based RNA Function Prediction using Elastic Shape Analysis
    Laborde, Jose
    Srivastava, Anuj
    Zhang, Jinfeng
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 16 - 21
  • [24] Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints
    Robin D Dowell
    Sean R Eddy
    BMC Bioinformatics, 7
  • [25] Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints
    D Dowell, Robin
    Eddy, Sean R.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [26] Comparative sequence-structure analysis of Aves insulin
    Islam, Md Mirazul
    Aktaruzzaman, M.
    Mohamed, Zahurin
    BIOINFORMATION, 2015, 11 (02) : 67 - 72
  • [27] Matching and significance evaluation of combined sequence-structure motifs in RNA
    Meyer, C
    Giegerich, R
    ZEITSCHRIFT FUR PHYSIKALISCHE CHEMIE-INTERNATIONAL JOURNAL OF RESEARCH IN PHYSICAL CHEMISTRY & CHEMICAL PHYSICS, 2002, 216 : 193 - 216
  • [28] Servers for sequence-structure relationship analysis and prediction
    Dosztányi, Z
    Magyar, C
    Tusnády, GE
    Cserzo, M
    Fiser, A
    Simon, I
    NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3359 - 3363
  • [29] JOY: protein sequence-structure representation and analysis
    Mizuguchi, K
    Deane, CM
    Blundell, TL
    Johnson, MS
    Overingon, JP
    BIOINFORMATICS, 1998, 14 (07) : 617 - 623
  • [30] Using Structure to Explore the Sequence Alignment Space of Remote Homologs
    Kuziemko, Andrew
    Honig, Barry
    Petrey, Donald
    PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)