Assembling contigs in draft genomes using reversals and block-interchanges

被引:6
|
作者
Li, Chi-Long [1 ]
Chen, Kun-Tze [1 ]
Lu, Chin Lung [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30013, Taiwan
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
ALGORITHM;
D O I
10.1186/1471-2105-14-S5-S9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The techniques of next generation sequencing allow an increasing number of draft genomes to be produced rapidly in a decreasing cost. However, these draft genomes usually are just partially sequenced as collections of unassembled contigs, which cannot be used directly by currently existing algorithms for studying their genome rearrangements and phylogeny reconstruction. In this work, we study the one-sided block (or contig) ordering problem with weighted reversal and block-interchange distance. Given a partially assembled genome pi and a completely assembled genome sigma, the problem is to find an optimal ordering to assemble (i.e., order and orient) the contigs of pi such that the rearrangement distance measured by reversals and block-interchanges (also called generalized transpositions) with the weight ratio 1: 2 between the assembled contigs of pi and sigma is minimized. In addition to genome rearrangements and phylogeny reconstruction, the one-sided block ordering problem particularly has a useful application in genome resequencing, because its algorithms can be used to assemble the contigs of a draft genome pi based on a reference genome sigma. By using permutation groups, we design an efficient algorithm to solve this one-sided block ordering problem in O(delta n) time, where n is the number of genes or markers and delta is the number of used reversals and block-interchanges. We also show that the assembly of the partially assembled genome can be done in O(n) time and its weighted rearrangement distance from the completely assembled genome can be calculated in advance in O(n) time. Finally, we have implemented our algorithm into a program and used some simulated datasets to compare its accuracy performance to a currently existing similar tool, called SIS that was implemented by a heuristic algorithm that considers only reversals, on assembling the contigs in draft genomes based on their reference genomes. Our experimental results have shown that the accuracy performance of our program is better than that of SIS, when the number of reversals and transpositions involved in the rearrangement events between the complete genomes of pi and sigma is increased. In particular, if there are more transpositions involved in the rearrangement events, then the gap of accuracy performance between our program and SIS is increasing.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Assembling contigs in draft genomes using reversals and block-interchanges
    Chi-Long Li
    Kun-Tze Chen
    Chin Lung Lu
    BMC Bioinformatics, 14
  • [2] Sorting by block-interchanges and signed reversals
    Mira, Cleber
    Meidanis, Joao
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 670 - +
  • [3] SPRING: a tool for the analysis of genome rearrangement using reversals and block-interchanges
    Lin, Ying Chih
    Lu, Chin Lung
    Liu, Ying-Chuan
    Tang, Chuan Yi
    NUCLEIC ACIDS RESEARCH, 2006, 34 : W696 - W699
  • [4] Sorting by reversals and block-interchanges with various weight assignments
    Lin, Ying Chih
    Lin, Chun-Yuan
    Lin, Chunhung Richard
    BMC BIOINFORMATICS, 2009, 10
  • [5] Sorting by reversals and block-interchanges with various weight assignments
    Ying Chih Lin
    Chun-Yuan Lin
    Chunhung Richard Lin
    BMC Bioinformatics, 10
  • [6] An O(n3) algorithm for sorting signed genomes by reversals, transpositions, transreversals and block-interchanges
    Yu, Shuzhi
    Hao, Fanchang
    Leong, Hon Wai
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (01)
  • [8] Sorting by prefix block-interchanges
    Labarre, Anthony
    THEORETICAL COMPUTER SCIENCE, 2023, 958
  • [9] Sorting permutations by block-interchanges
    Christie, DA
    INFORMATION PROCESSING LETTERS, 1996, 60 (04) : 165 - 169
  • [10] Sorting permutations by block-interchanges
    Univ of Glasgow, Glasgow, United Kingdom
    Inf Process Lett, 4 (165-169):