Efficient Bayesian inference under the structured coalescent

被引:90
|
作者
Vaughan, Timothy G. [1 ]
Kuehnert, Denise [1 ,2 ,3 ]
Popinga, Alex [1 ,3 ]
Welch, David [1 ,3 ]
Drummond, Alexei J. [1 ,3 ]
机构
[1] Massey Univ, Allan Wilson Ctr Mol Ecol & Evolut, Palmerston North 4442, New Zealand
[2] ETH, Swiss Fed Inst Technol, Inst Integrat Biol, CH-8092 Zurich, Switzerland
[3] Univ Auckland, Dept Comp Sci, Auckland 1142, New Zealand
关键词
MAXIMUM-LIKELIHOOD-ESTIMATION; POPULATION SIZES; MIGRATION RATES; EVOLUTION; DYNAMICS; SEQUENCES; MATRIX;
D O I
10.1093/bioinformatics/btu201
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Population structure significantly affects evolutionary dynamics. Such structure may be due to spatial segregation, but may also reflect any other gene-flow-limiting aspect of a model. In combination with the structured coalescent, this fact can be used to inform phylogenetic tree reconstruction, as well as to infer parameters such as migration rates and subpopulation sizes from annotated sequence data. However, conducting Bayesian inference under the structured coalescent is impeded by the difficulty of constructing Markov Chain Monte Carlo (MCMC) sampling algorithms (samplers) capable of efficiently exploring the state space. Results: In this article, we present a new MCMC sampler capable of sampling from posterior distributions over structured trees: timed phylogenetic trees in which lineages are associated with the distinct subpopulation in which they lie. The sampler includes a set of MCMC proposal functions that offer significant mixing improvements over a previously published method. Furthermore, its implementation as a BEAST 2 package ensures maximum flexibility with respect to model and prior specification. We demonstrate the usefulness of this new sampler by using it to infer migration rates and effective population sizes of H3N2 influenza between New Zealand, New York and Hong Kong from publicly available hemagglutinin (HA) gene sequences under the structured coalescent.
引用
收藏
页码:2272 / 2279
页数:8
相关论文
共 50 条
  • [1] Efficient Bayesian inference under the multispecies coalescent with migration
    Flouri, Tomas
    Jiao, Xiyun
    Huang, Jun
    Rannala, Bruce
    Yang, Ziheng
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (44)
  • [2] Efficient Bayesian Species Tree Inference under the Multispecies Coalescent
    Rannala, Bruce
    Yang, Ziheng
    SYSTEMATIC BIOLOGY, 2017, 66 (05) : 823 - 842
  • [3] An Efficient Coalescent Epoch Model for Bayesian Phylogenetic Inference
    Bouckaert, Remco R.
    SYSTEMATIC BIOLOGY, 2022, : 1549 - 1560
  • [4] Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent
    Wen, Dingqiao
    Yu, Yun
    Nakhleh, Luay
    PLOS GENETICS, 2016, 12 (05):
  • [5] Bayesian Inference Under the Multispecies Coalescent with Ancient DNA Sequences
    Nagel, Anna A.
    Flouri, Tomas
    Yang, Ziheng
    Rannala, Bruce
    SYSTEMATIC BIOLOGY, 2024, 73 (06) : 964 - 978
  • [6] An efficient Bayesian inference framework for coalescent-based nonparametric phylodynamics
    Lan, Shiwei
    Palacios, Julia A.
    Karcher, Michael
    Minin, Vladimir N.
    Shahbaba, Babak
    BIOINFORMATICS, 2015, 31 (20) : 3282 - 3289
  • [7] MASCOT: parameter and state inference under the marginal structured coalescent approximation
    Mueller, Nicola F.
    Rasmussen, David
    Stadler, Tanja
    BIOINFORMATICS, 2018, 34 (22) : 3843 - 3848
  • [8] StarBeast3: Adaptive Parallelized Bayesian Inference under the Multispecies Coalescent
    Douglas, Jordan
    Jimenez-Silva, Cinthy L.
    Bouckaert, Remco
    SYSTEMATIC BIOLOGY, 2022, 71 (04) : 901 - 916
  • [9] New Routes to Phylogeography: A Bayesian Structured Coalescent Approximation
    De Maio, Nicola
    Wu, Chieh-Hsi
    O'Reilly, Kathleen M.
    Wilson, Daniel
    PLOS GENETICS, 2015, 11 (08):
  • [10] Estimating Population Parameters using the Structured Serial Coalescent with Bayesian MCMC Inference when some Demes are Hidden
    Ewing, Greg
    Rodrigo, Allen
    EVOLUTIONARY BIOINFORMATICS, 2006, 2 : 227 - 235