Summarizing the solution space in tumor phylogeny inference by multiple consensus trees

被引:17
|
作者
Aguse, Nuraini [1 ]
Qi, Yuanyuan [1 ]
El-Kebir, Mohammed [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
EVOLUTION; TRACKING; HISTORY;
D O I
10.1093/bioinformatics/btz312
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Cancer phylogenies are key to studying tumorigenesis and have clinical implications. Due to the heterogeneous nature of cancer and limitations in current sequencing technology, current cancer phylogeny inference methods identify a large solution space of plausible phylogenies. To facilitate further downstream analyses, methods that accurately summarize such a set T of cancer phylogenies are imperative. However, current summary methods are limited to a single consensus tree or graph and may miss important topological features that are present in different subsets of candidate trees. Results We introduce the Multiple Consensus Tree (MCT) problem to simultaneously cluster T and infer a consensus tree for each cluster. We show that MCT is NP-hard, and present an exact algorithm based on mixed integer linear programming (MILP). In addition, we introduce a heuristic algorithm that efficiently identifies high-quality consensus trees, recovering all optimal solutions identified by the MILP in simulated data at a fraction of the time. We demonstrate the applicability of our methods on both simulated and real data, showing that our approach selects the number of clusters depending on the complexity of the solution space T. Availability and implementation https://github.com/elkebir-group/MCT. Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I408 / I416
页数:9
相关论文
共 50 条
  • [21] Large scale multiple sequence alignment with simultaneous phylogeny inference
    Parmentier, Gilles
    Trystram, Denis
    Zola, Jaroslaw
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (12) : 1534 - 1545
  • [22] Multiple consensus trees: a method to separate divergent genes
    Alain Guénoche
    BMC Bioinformatics, 14
  • [23] Multiple consensus trees: a method to separate divergent genes
    Guenoche, Alain
    BMC BIOINFORMATICS, 2013, 14
  • [24] A new theory of phylogeny inference through construction of multidimensional vector space
    Kitazoe, Y
    Kurihara, Y
    Narita, Y
    Okuhara, Y
    Tominaga, A
    Suzuki, T
    MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) : 812 - 828
  • [25] Tumor phylogeny inference using tree-constrained importance sampling
    Satas, Gryte
    Raphael, Benjamin J.
    BIOINFORMATICS, 2017, 33 (14) : I152 - I160
  • [26] The Efficacy of Consensus Tree Methods for Summarizing Phylogenetic Relationships from a Posterior Sample of Trees Estimated from Morphological Data
    O'Reilly, Joseph E.
    Donoghue, Philip C. J.
    SYSTEMATIC BIOLOGY, 2018, 67 (02) : 354 - 362
  • [27] Combining multiple decision trees using fuzzy-neural inference
    Crockett, K
    Bandar, Z
    Mclean, D
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 1523 - 1527
  • [28] Leaping through Tree Space: Continuous Phylogenetic Inference for Rooted and Unrooted Trees
    Penn, Matthew J.
    Scheidwasser, Neil
    Penn, Joseph
    Donnelly, Christl A.
    Duchene, David A.
    Bhatt, Samir
    GENOME BIOLOGY AND EVOLUTION, 2023, 15 (12):
  • [29] Practical Speedup of Bayesian Inference of Species Phylogenies by Restricting the Space of Gene Trees
    Wang, Yaxuan
    Ogilvie, Huw A.
    Nakhleh, Luay
    MOLECULAR BIOLOGY AND EVOLUTION, 2020, 37 (06) : 1809 - 1818
  • [30] ConDoR: tumor phylogeny inference with a copy-number constrained mutation loss model
    Sashittal, Palash
    Zhang, Haochen
    Iacobuzio-Donahue, Christine A.
    Raphael, Benjamin J.
    GENOME BIOLOGY, 2023, 24 (01)