Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses

被引:56
|
作者
Lanfear, Robert [1 ,2 ]
Hua, Xia [2 ]
Warren, Dan L. [1 ,2 ]
机构
[1] Macquarie Univ, Dept Biol Sci, Sydney, NSW, Australia
[2] Australian Natl Univ, Ecol Evolut & Genet, Canberra, ACT, Australia
来源
GENOME BIOLOGY AND EVOLUTION | 2016年 / 8卷 / 08期
基金
澳大利亚研究理事会;
关键词
tree distance; phylogenetics; topology; MCMC; ESS; phylogenomics; INFERENCE; EXPLORATION;
D O I
10.1093/gbe/evw171
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC because sequential samples from the chain can be non-independent due to autocorrelation. Typically, phylogeneticists use a rule of thumb that the ESS of all parameters should be greater than 200. However, we have no method to calculate an ESS of tree topology samples, despite the fact that the tree topology is often the parameter of primary interest and is almost always central to the estimation of other parameters. That is, we lack a method to determine whether we have adequately sampled one of the most important parameters in our analyses. In this study, we address this problem by developing methods to estimate the ESS for tree topologies. We combine these methods with two new diagnostic plots for assessing posterior samples of tree topologies, and compare their performance on simulated and empirical data sets. Combined, the methods we present provide new ways to assess the mixing and convergence of phylogenetic tree topologies in Bayesian MCMC analyses.
引用
收藏
页码:2319 / 2332
页数:14
相关论文
共 50 条
  • [31] BAYESIAN-ESTIMATION OF THE COMPLETE SAMPLE-SIZE FROM AN INCOMPLETE POISSON SAMPLE
    WILLIFORD, WO
    JAN, SW
    BIOMETRICS, 1981, 37 (01) : 193 - 193
  • [32] Sample Size for Estimating Disease Prevalence in Free-Ranging Wildlife Populations: A Bayesian Modeling Approach
    Booth, James G.
    Hanley, Brenda J.
    Hodel, Florian H.
    Jennelle, Christopher S.
    Guinness, Joseph
    Them, Cara E.
    Mitchell, Corey I.
    Ahmed, Md Sohel
    Schuler, Krysten L.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2024, 29 (03) : 438 - 454
  • [33] Estimating tree crown size from multiresolution remotely sensed imagery
    Song, C
    Woodcock, CE
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2003, 69 (11): : 1263 - 1270
  • [34] Estimating Effective Sample Size for Monitoring Length Distributions: A Comparative Study of Georges Bank Groundfish
    Zhang, Yuying
    Cadrin, Steve X.
    TRANSACTIONS OF THE AMERICAN FISHERIES SOCIETY, 2013, 142 (01) : 59 - 67
  • [35] Estimating the size of a population from a single sample: Methodology and practical issues
    Laska, E
    Lin, S
    Meisner, M
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 1997, 50 (10) : 1143 - 1154
  • [36] Estimating the Size of a Large Network and its Communities from a Random Sample
    Chen, Lin
    Karbasi, Amin
    Crawford, Forrest W.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [37] Adequacy of sample size for estimating a value from field observational data
    Cormier, Susan M.
    Suter, Glenn W.
    Fernandez, Mark B.
    Zheng, Lei
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2020, 203
  • [38] Calculating the SNP-effective sample size from an alignment
    Haubold, B
    Wiehe, T
    BIOINFORMATICS, 2002, 18 (01) : 36 - 38
  • [39] Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range
    Wan, Xiang
    Wang, Wenqian
    Liu, Jiming
    Tong, Tiejun
    BMC MEDICAL RESEARCH METHODOLOGY, 2014, 14
  • [40] Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range
    Xiang Wan
    Wenqian Wang
    Jiming Liu
    Tiejun Tong
    BMC Medical Research Methodology, 14