Genome rearrangements and sorting by reversals

被引:234
|
作者
Bafna, V
Pevzner, PA
机构
[1] Computer Science Department, Pennsylvania State University, University Park
关键词
computational molecular biology; sorting by reversals; genome rearrangements;
D O I
10.1137/S0097539793250627
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sequence comparison in molecular biology is in the beginning of a major paradigm shift-a shift from gene comparison based on local mutations (i.e., insertions, deletions, and substitutions of nucleotides) to chromosome comparison based on global rearrangements (i.e., inversions and transpositions of fragments). The classical methods of sequence comparison do not work for global rearrangements, and little is known in computer science about the edit distance between sequences if global rearrangements are allowed. In the simplest form, the problem of gene rearrangements corresponds to sorting by reversals, i.e., sorting of an array using reversals of arbitrary fragments. Recently, Kececioglu and Sankoff gave the first approximation algorithm for sorting by reversals with guaranteed error bound 2 and identified open problems related to chromosome rearrangements. One of these problems is Gollan's conjecture on the reversal diameter of the symmetric group. This paper proves the conjecture. Further, the problem of expected reversal distance between two random permutations is investigated. The reversal distance between two random permutations is shown to be very close to the reversal diameter, thereby indicating that reversal distance provides a good separation between related and nonrelated sequences in molecular evolution studies. The gene rearrangement problem forces us to consider reversals of signed permutations, as the genes in DNA could be positively or negatively oriented. An approximation algorithm for signed permutation is presented, which provides a performance guarantee of 3/2 Finally, using the signed permutations approach, an approximation algorithm for sorting by reversals is described which achieves a performance guarantee of 7/4.
引用
收藏
页码:272 / 289
页数:18
相关论文
共 50 条
  • [1] SORTING BY REVERSALS - GENOME REARRANGEMENTS IN PLANT ORGANELLES AND EVOLUTIONARY HISTORY OF X-CHROMOSOME
    BAFNA, V
    PEVZNER, PA
    MOLECULAR BIOLOGY AND EVOLUTION, 1995, 12 (02) : 239 - 246
  • [2] Estimate the distance of genome rearrangements by reversals
    Hao, Jianxiu
    JOURNAL OF MATHEMATICAL CHEMISTRY, 2007, 42 (01) : 81 - 92
  • [3] Estimate the Distance of Genome Rearrangements by Reversals
    Jianxiu Hao
    Journal of Mathematical Chemistry, 2007, 42 : 81 - 92
  • [4] Working on the problem of sorting by transpositions on genome rearrangements
    Walter, MEMT
    Curado, LRAF
    Oliveira, AG
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 372 - 383
  • [5] A 2-approximation algorithm for genome rearrangements by reversals and transpositions
    Gu, QP
    Peng, ST
    Sudborough, H
    THEORETICAL COMPUTER SCIENCE, 1999, 210 (02) : 327 - 339
  • [6] An approximation algorithm for genome sorting by reversals to recover all adjacencies
    Shanshan Zhai
    Peng Zhang
    Daming Zhu
    Weitian Tong
    Yao Xu
    Guohui Lin
    Journal of Combinatorial Optimization, 2019, 37 : 1170 - 1190
  • [7] EXACT AND APPROXIMATION ALGORITHMS FOR SORTING BY REVERSALS, WITH APPLICATION TO GENOME REARRANGEMENT
    KECECIOGLU, J
    SANKOFF, D
    ALGORITHMICA, 1995, 13 (1-2) : 180 - 210
  • [8] An approximation algorithm for genome sorting by reversals to recover all adjacencies
    Zhai, Shanshan
    Zhang, Peng
    Zhu, Daming
    Tong, Weitian
    Xu, Yao
    Lin, Guohui
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2019, 37 (04) : 1170 - 1190
  • [9] Exact and approximation algorithms for sorting by reversals, with application to genome rearrangement
    Univ of Georgia, Athens, United States
    Algorithmica (New York), 1-2 (180-210):
  • [10] An Exact Algorithm for Sorting by Weighted Preserving Genome Rearrangements
    Hartmann, Tom
    Bernt, Matthias
    Middendorf, Martin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (01) : 52 - 62