Characterizing the Phylogenetic Tree-Search Problem

被引:17
|
作者
Money, Daniel [1 ]
Whelan, Simon [1 ]
机构
[1] Univ Manchester, Fac Life Sci, Manchester M13 9PT, Lancs, England
基金
英国生物技术与生命科学研究理事会;
关键词
Algorithms; heuristics; maximum likelihood; NNI; phylogenetics; SPR; tree-search; MAXIMUM-LIKELIHOOD; DNA-SEQUENCES; EVOLUTION; ALGORITHM; SPACE; MODEL;
D O I
10.1093/sysbio/syr097
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phylogenetic trees are important in many areas of biological research, ranging from systematic studies to the methods used for genome annotation. Finding the best scoring tree under any optimality criterion is an NP-hard problem, which necessitates the use of heuristics for tree-search. Although tree-search plays a major role in obtaining a tree estimate, there remains a limited understanding of its characteristics and how the elements of the statistical inferential procedure interact with the algorithms used. This study begins to answer some of these questions through a detailed examination of maximum likelihood tree-search on a wide range of real genome-scale data sets. We examine all 10,395 trees for each of the 106 genes of an eight-taxa yeast phylogenomic data set, then apply different tree-search algorithms to investigate their performance. We extend our findings by examining two larger genome-scale data sets and a large disparate data set that has been previously used to benchmark the performance of tree-search programs. We identify several broad trends occurring during tree-search that provide an insight into the performance of heuristics and may, in the future, aid their development. These trends include a tendency for the true maximum likelihood (best) tree to also be the shortest tree in terms of branch lengths, a weak tendency for tree-search to recover the best tree, and a tendency for tree-search to encounter fewer local optima in genes that have a high information content. When examining current heuristics for tree-search, we find that nearest-neighbor-interchange performs poorly, and frequently finds trees that are significantly different from the best tree. In contrast, subtree-pruning-and-regrafting tends to perform well, nearly always finding trees that are not significantly different to the best tree. Finally, we demonstrate that the precise implementation of a tree-search strategy, including when and where parameters are optimized, can change the character of tree-search, and that good strategies for tree-search may combine existing tree-search programs.
引用
收藏
页码:228 / 239
页数:12
相关论文
共 50 条
  • [41] Efficient Tree-Search MIMO Detection with Probabilistic Node Ordering
    Chang, Ronald Y.
    Chung, Wei-Ho
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [42] Neural Network Guided Tree-Search Policies for Synthesis Planning
    Thakkar, Amol
    Bjerrum, Esben Jannik
    Engkvist, Ola
    Reymond, Jean-Louis
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 721 - 724
  • [43] HEURISTIC TREE-SEARCH WITH NONPARAMETRIC STATISTICAL-INFERENCE METHODS
    ZHANG, WX
    RAO, NSV
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1991, 38 (3-4) : 133 - 152
  • [44] Flexible tree-search based orthogonal matching pursuit algorithm
    Karabulut, GZ
    Moura, L
    Panario, D
    Yongaçoglu, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 673 - 676
  • [45] A TREE-SEARCH ALGORITHM FOR MIXED INTEGER PROGRAMMING-PROBLEMS
    DAKIN, RJ
    COMPUTER JOURNAL, 1965, 8 (03): : 250 - 253
  • [46] Fully Pipelined Implementation of Tree-Search Algorithms for Vector Precoding
    Barrenechea, Maitane
    Mendicute, Mikel
    Arruti, Egoitz
    INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2013, 2013
  • [47] THE B-STAR TREE-SEARCH ALGORITHM - NEW RESULTS
    PALAY, AJ
    ARTIFICIAL INTELLIGENCE, 1982, 19 (02) : 145 - 163
  • [48] Tree-Search Decoding with Path Constraints for Linear Block Codes
    Chang, Chia-Fu
    Lin, Tien-Yu
    Lin, Mao-Chao
    2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 753 - 757
  • [49] A TREE-SEARCH APPROACH TO THE M-PARTITION AND KNAPSACK-PROBLEMS
    ALBANO, A
    ORSINI, R
    COMPUTER JOURNAL, 1980, 23 (03): : 256 - 261
  • [50] GPU-Accelerated Tree-Search in Chapel versus CUDA and HIP
    Helbecque, Guillaume
    Krishnasamy, Ezhilmathi
    Melab, Nouredine
    Bouvry, Pascal
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 872 - 879