Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss

被引:144
|
作者
Bansal, Mukul S. [1 ]
Alm, Eric J. [2 ]
Kellis, Manolis [1 ,3 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] MIT, Dept Biol Engn, Cambridge, MA 02139 USA
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
PHYLOGENETIC TREES; RECONSTRUCTION; HISTORY; ASSOCIATIONS; EVOLUTION; ORTHOLOGS; MODEL;
D O I
10.1093/bioinformatics/bts225
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numerous important applications. Solving this problem requires the use of a reconciliation framework, where the input consists of a gene family phylogeny and the corresponding species phylogeny, and the goal is to reconcile the two by postulating speciation, gene duplication, horizontal gene transfer and gene loss events. This reconciliation problem is referred to as duplication-transfer-loss (DTL) reconciliation and has been extensively studied in the literature. Yet, even the fastest existing algorithms for DTL reconciliation are too slow for reconciling large gene families and for use in more sophisticated applications such as gene tree or species tree reconstruction. Results: We present two new algorithms for the DTL reconciliation problem that are dramatically faster than existing algorithms, both asymptotically and in practice. We also extend the standard DTL reconciliation model by considering distance-dependent transfer costs, which allow for more accurate reconciliation and give an efficient algorithm for DTL reconciliation under this extended model. We implemented our new algorithms and demonstrated up to 100 000-fold speed-up over existing methods, using both simulated and biological datasets. This dramatic improvement makes it possible to use DTL reconciliation for performing rigorous evolutionary analyses of large gene families and enables its use in advanced reconciliation-based gene and species tree reconstruction methods.
引用
收藏
页码:I283 / I291
页数:9
相关论文
共 50 条
  • [1] Efficient error correction algorithms for gene tree reconciliation based on duplication, duplication and loss, and deep coalescence
    Chaudhary, Ruchi
    Burleigh, J. Gordon
    Eulenstein, Oliver
    BMC BIOINFORMATICS, 2012, 13
  • [2] Efficient error correction algorithms for gene tree reconciliation based on duplication, duplication and loss, and deep coalescence
    Ruchi Chaudhary
    J Gordon Burleigh
    Oliver Eulenstein
    BMC Bioinformatics, 13
  • [3] Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees
    Kordi, Misagh
    Bansal, Mukul S.
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 297 - 306
  • [4] Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees
    Kordi, Misagh
    Bansal, Mukul S.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (04) : 1077 - 1090
  • [5] Reconciliation with Segmental Duplication, Transfer, Loss and Gain
    Anselmetti, Yoann
    Delabre, Matteo
    El-Mabrouk, Nadia
    COMPARATIVE GENOMICS (RECOMB-CG 2022), 2022, 13234 : 124 - 145
  • [6] Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
    Pawel Górecki
    Oliver Eulenstein
    BMC Bioinformatics, 13
  • [7] Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
    Gorecki, Pawel
    Eulenstein, Oliver
    BMC BIOINFORMATICS, 2012, 13
  • [8] On the impact of uncertain gene tree rooting on duplication-transfer-loss reconciliation
    Soumya Kundu
    Mukul S. Bansal
    BMC Bioinformatics, 19
  • [9] On the impact of uncertain gene tree rooting on duplication-transfer-loss reconciliation
    Kundu, Soumya
    Bansal, Mukul S.
    BMC BIOINFORMATICS, 2018, 19
  • [10] A probabilistic model for gene content evolution with duplication, loss, and horizontal transfer
    Cs rös, Miklós
    Miklós, István
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2006, 3909 : 206 - 220