Genome rearrangements and sorting by reversals

被引:234
|
作者
Bafna, V
Pevzner, PA
机构
[1] Computer Science Department, Pennsylvania State University, University Park
关键词
computational molecular biology; sorting by reversals; genome rearrangements;
D O I
10.1137/S0097539793250627
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sequence comparison in molecular biology is in the beginning of a major paradigm shift-a shift from gene comparison based on local mutations (i.e., insertions, deletions, and substitutions of nucleotides) to chromosome comparison based on global rearrangements (i.e., inversions and transpositions of fragments). The classical methods of sequence comparison do not work for global rearrangements, and little is known in computer science about the edit distance between sequences if global rearrangements are allowed. In the simplest form, the problem of gene rearrangements corresponds to sorting by reversals, i.e., sorting of an array using reversals of arbitrary fragments. Recently, Kececioglu and Sankoff gave the first approximation algorithm for sorting by reversals with guaranteed error bound 2 and identified open problems related to chromosome rearrangements. One of these problems is Gollan's conjecture on the reversal diameter of the symmetric group. This paper proves the conjecture. Further, the problem of expected reversal distance between two random permutations is investigated. The reversal distance between two random permutations is shown to be very close to the reversal diameter, thereby indicating that reversal distance provides a good separation between related and nonrelated sequences in molecular evolution studies. The gene rearrangement problem forces us to consider reversals of signed permutations, as the genes in DNA could be positively or negatively oriented. An approximation algorithm for signed permutation is presented, which provides a performance guarantee of 3/2 Finally, using the signed permutations approach, an approximation algorithm for sorting by reversals is described which achieves a performance guarantee of 7/4.
引用
收藏
页码:272 / 289
页数:18
相关论文
共 50 条
  • [31] Sorting with fixed-length reversals
    Department of Computer Science, State University of New York, Stony Brook, NY 11794-4400, United States
    Discrete Appl Math, 1-3 (269-295):
  • [32] baobabLUNA: the solution space of sorting by reversals
    Braga, Marilia D. V.
    BIOINFORMATICS, 2009, 25 (14) : 1833 - 1835
  • [33] An approximation algorithm for sorting by reversals and transpositions
    Rahman, Atif
    Shatabda, Swakkhar
    Hasan, Masud
    JOURNAL OF DISCRETE ALGORITHMS, 2008, 6 (03) : 449 - 457
  • [34] Sorting by Reversals and Transpositions with Proportion Restriction
    Brito, Klairton Lima
    Alexandrino, Alexsandro Oliveira
    Oliveira, Andre Rodrigues
    Dias, Ulisses
    Dias, Zanoni
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2020, 2020, 12558 : 117 - 128
  • [35] On the Complexity of Sorting by Reversals and Transpositions Problems
    Oliveira, Andre Rodrigues
    Brito, Klairton Lima
    Dias, Ulisses
    Dias, Zanoni
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (11) : 1223 - 1229
  • [36] Fast practical solution of sorting by reversals
    Caprara, A
    Lancia, G
    Ng, SK
    PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2000, : 12 - 21
  • [37] An algorithm for sorting pancake by restricted reversals
    Seo, Jung-Hyun
    Kim, Jong-Seok
    Lee, Hyeong-Ok
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (10): : 3832 - 3850
  • [38] Sorting signed permutations by reversals, revisited
    Kaplan, H
    Verbin, E
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2005, 70 (03) : 321 - 341
  • [39] Perfect Sorting by Reversals and Deletions/Insertions
    Chen, Hong-Yu
    Tan, Xiang
    Li, Guo-Jun
    OPERATIONS RESEARCH AND ITS APPLICATIONS, 2010, 12 : 512 - 518
  • [40] Perfect sorting by reversals is not always difficult
    Berard, Severine
    Bergeron, Anne
    Chauve, Cedric
    Paul, Christophe
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (01) : 4 - 16