Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses

被引:56
|
作者
Lanfear, Robert [1 ,2 ]
Hua, Xia [2 ]
Warren, Dan L. [1 ,2 ]
机构
[1] Macquarie Univ, Dept Biol Sci, Sydney, NSW, Australia
[2] Australian Natl Univ, Ecol Evolut & Genet, Canberra, ACT, Australia
来源
GENOME BIOLOGY AND EVOLUTION | 2016年 / 8卷 / 08期
基金
澳大利亚研究理事会;
关键词
tree distance; phylogenetics; topology; MCMC; ESS; phylogenomics; INFERENCE; EXPLORATION;
D O I
10.1093/gbe/evw171
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC because sequential samples from the chain can be non-independent due to autocorrelation. Typically, phylogeneticists use a rule of thumb that the ESS of all parameters should be greater than 200. However, we have no method to calculate an ESS of tree topology samples, despite the fact that the tree topology is often the parameter of primary interest and is almost always central to the estimation of other parameters. That is, we lack a method to determine whether we have adequately sampled one of the most important parameters in our analyses. In this study, we address this problem by developing methods to estimate the ESS for tree topologies. We combine these methods with two new diagnostic plots for assessing posterior samples of tree topologies, and compare their performance on simulated and empirical data sets. Combined, the methods we present provide new ways to assess the mixing and convergence of phylogenetic tree topologies in Bayesian MCMC analyses.
引用
收藏
页码:2319 / 2332
页数:14
相关论文
共 50 条
  • [1] How Trustworthy Is Your Tree? Bayesian Phylogenetic Effective Sample Size Through the Lens of Monte Carlo Error
    Magee, Andrew
    Karcher, Michael
    Matsen, Frederick A.
    Minin, Volodymyr M.
    BAYESIAN ANALYSIS, 2024, 19 (02): : 565 - 593
  • [2] Phylogenetic effective sample size
    Bartoszek, Krzysztof
    JOURNAL OF THEORETICAL BIOLOGY, 2016, 407 : 371 - 386
  • [3] Bayesian analyses in phylogenetic palaeontology: interpreting the posterior sample
    Wright, April M.
    Lloyd, Graeme T.
    PALAEONTOLOGY, 2020, 63 (06) : 997 - 1006
  • [4] Bayesian sample size determination for estimating binomial parameters from data subject to misclassification
    Rahme, E
    Joseph, L
    Gyorkos, TW
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2000, 49 : 119 - 128
  • [5] Evaluating the Impact of Anatomical Partitioning on Summary Topologies Obtained with Bayesian Phylogenetic Analyses of Morphological Data
    Casali, Daniel M.
    Freitas, Felipe, V
    Perini, Fernando A.
    SYSTEMATIC BIOLOGY, 2023, 72 (01) : 62 - 77
  • [6] COMPARATIVE PHYLOGENETIC ANALYSES OF CRYPTOPHYTE NUCLEAR, NUCLEOMORPH AND PLASTID GENES: TREE TOPOLOGIES AND BRANCH LENGTHS
    Hoef-Emden, K.
    Tran, H-D
    Melkonian, M.
    PHYCOLOGIA, 2005, 44 (04) : 45 - 46
  • [7] Estimating the sample variance from the sample size and range
    Rychtar, Jan
    T. Taylor, Dewey
    STATISTICS IN MEDICINE, 2020, 39 (30) : 4667 - 4686
  • [8] Bayesian sample size determination for estimating a Poisson rate with underreported data
    Stamey, JD
    Seaman, JW
    Young, DM
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2004, 33 (02) : 341 - 354
  • [9] Sample size for estimating organism concentration in ballast water: A Bayesian approach
    Costa, Eliardo G.
    Paulino, Carlos Daniel
    Singer, Julio M.
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2021, 35 (01) : 158 - 171
  • [10] Estimating the effective sample size in association studies of quantitative traits
    Ziyatdinov, Andrey
    Kim, Jihye
    Prokopenko, Dmitry
    Prive, Florian
    Laporte, Fabien
    Loh, Po-Ru
    Kraft, Peter
    Aschard, Hugues
    G3-GENES GENOMES GENETICS, 2021, 11 (06):