Fast and accurate methods for phylogenomic analyses

被引:27
|
作者
Yang, Jimmy [1 ]
Warnow, Tandy [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
基金
美国国家科学基金会;
关键词
SPECIES TREE ESTIMATION; MAXIMUM-LIKELIHOOD; BAYESIAN-ESTIMATION; GENE TREES; INFERENCE; CONCORDANCE; MODEL;
D O I
10.1186/1471-2105-12-S9-S4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Species phylogenies are not estimated directly, but rather through phylogenetic analyses of different gene datasets. However, true gene trees can differ from the true species tree (and hence from one another) due to biological processes such as horizontal gene transfer, incomplete lineage sorting, and gene duplication and loss, so that no single gene tree is a reliable estimate of the species tree. Several methods have been developed to estimate species trees from estimated gene trees, differing according to the specific algorithmic technique used and the biological model used to explain differences between species and gene trees. Relatively little is known about the relative performance of these methods. Results: We report on a study evaluating several different methods for estimating species trees from sequence datasets, simulating sequence evolution under a complex model including indels (insertions and deletions), substitutions, and incomplete lineage sorting. The most important finding of our study is that some fast and simple methods are nearly as accurate as the most accurate methods, which employ sophisticated statistical methods and are computationally quite intensive. We also observe that methods that explicitly consider errors in the estimated gene trees produce more accurate trees than methods that assume the estimated gene trees are correct. Conclusions: Our study shows that highly accurate estimations of species trees are achievable, even when gene trees differ from each other and from the species tree, and that these estimations can be obtained using fairly simple and computationally tractable methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Comparative genomic and phylogenomic analyses of the Bifidobacteriaceae family
    Lugli, Gabriele Andrea
    Milani, Christian
    Turroni, Francesca
    Duranti, Sabrina
    Mancabelli, Leonardo
    Mangifesta, Marta
    Ferrario, Chiara
    Modesto, Monica
    Mattarelli, Paola
    Killer, Jiri
    van Sinderen, Douwe
    Ventura, Marco
    BMC GENOMICS, 2017, 18
  • [32] Phylogenomic analyses data of the avian phylogenomics project
    Jarvis, Erich D.
    Mirarab, Siavash
    Aberer, Andre J.
    Li, Bo
    Houde, Peter
    Li, Cai
    Ho, Simon Y. W.
    Faircloth, Brant C.
    Nabholz, Benoit
    Howard, Jason T.
    Suh, Alexander
    Weber, Claudia C.
    da Fonseca, Rute R.
    Alfaro-Nunez, Alonzo
    Narula, Nitish
    Liu, Liang
    Burt, Dave
    Ellegren, Hans
    Edwards, Scott V.
    Stamatakis, Alexandros
    Mindell, David P.
    Cracraft, Joel
    Braun, Edward L.
    Warnow, Tandy
    Jun, Wang
    Gilbert, M. Thomas Pius
    Zhang, Guojie
    GIGASCIENCE, 2015, 4
  • [33] Phylogenomic Analyses Elucidate the Evolutionary Relationships of Bats
    Tsagkogeorga, Georgia
    Parker, Joe
    Stupka, Elia
    Cotton, James A.
    Rossiter, Stephen J.
    CURRENT BIOLOGY, 2013, 23 (22) : 2262 - 2267
  • [34] Why phylogenomic uncertainty enhances introgression analyses
    Pease, James B.
    MOLECULAR ECOLOGY, 2018, 27 (22) : 4347 - 4349
  • [35] Resources for phylogenomic analyses of Australian terrestrial vertebrates
    Bragg, Jason G.
    Potter, Sally
    Bi, Ke
    Catullo, Renee
    Donnellan, Stephen C.
    Eldridge, Mark D. B.
    Joseph, Leo
    Keogh, J. Scott
    Oliver, Paul
    Rowe, Kevin C.
    Moritz, Craig
    MOLECULAR ECOLOGY RESOURCES, 2017, 17 (05) : 869 - 876
  • [36] Fast accurate eigenvalue methods for graded positive definite matrices
    Mathias, R
    NUMERISCHE MATHEMATIK, 1996, 74 (01) : 85 - 103
  • [37] Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods
    Wang, Suhang
    Aggarwal, Charu
    Liu, Huan
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 485 - 494
  • [38] Fast and accurate quantum Monte Carlo methods for material science
    Alfe, Dario
    Zen, Andrea
    Gillan, Mike
    Sorella, Sandro
    Michaelides, Angelos
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [39] Evaluating methods for phylogenomic analyses, and a new phylogeny for a major frog clade (Hyloidea) based on 2214 loci
    Streicher, Jeffrey W.
    Miller, Elizabeth C.
    Guerrero, Pablo C.
    Correa, Claudio
    Ortiz, Juan C.
    Crawford, Andrew J.
    Pie, Marcio R.
    Wiens, John J.
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2018, 119 : 128 - 143
  • [40] Genome assembly and phylogenomic data analyses using plastid data: Contrasting species tree estimation methods
    Goncalves, D. J. P.
    Simpson, B. B.
    Shimizu, G. H.
    Jansen, R. K.
    Ortiz, E. M.
    DATA IN BRIEF, 2019, 25