Efficient Bayesian inference under the multispecies coalescent with migration

被引:7
|
作者
Flouri, Tomas [1 ]
Jiao, Xiyun [2 ]
Huang, Jun [3 ]
Rannala, Bruce [4 ]
Yang, Ziheng [1 ]
机构
[1] UCL, Dept Genet Evolut & Environm, London WC1E 6BT, England
[2] China Southern Univ Sci & Technol, Dept Stat & Data Sci, Shenzhen 518055, Peoples R China
[3] Capital Med Univ, Sch Biomed Engn, Dept Intelligent Med Engn, Beijing 100069, Peoples R China
[4] Univ Calif Davis, Dept Evolut & Ecol, Davis, CA 95616 USA
基金
英国生物技术与生命科学研究理事会;
关键词
BPP; gene flow; genomics; migration; multispecies coalescent; MAXIMUM-LIKELIHOOD IMPLEMENTATION; ANCESTRAL POPULATION SIZES; DNA-SEQUENCES; GENE FLOW; MODEL; SPECIATION; DIVERGENCE; INTROGRESSION; NUMBER; TREES;
D O I
10.1073/pnas.2310708120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analyses of genome sequence data have revealed pervasive interspecific gene flow and enriched our understanding of the role of gene flow in speciation and adaptation. Inference of gene flow using genomic data requires powerful statistical methods. Yet current likelihood-based methods involve heavy computation and are feasible for small datasets only. Here, we implement the multispecies-coalescent-with-migration model in the Bayesian program bpp, which can be used to test for gene flow and estimate migration rates, as well as species divergence times and population sizes. We develop Markov chain Monte Carlo algorithms for efficient sampling from the posterior, enabling the analysis of genome-scale datasets with thousands of loci. Implementation of both introgression and migration models in the same program allows us to test whether gene flow occurred continuously over time or in pulses. Analyses of genomic data from Anopheles mosquitoes demonstrate rich information in typical genomic datasets about the mode and rate of gene flow.
引用
收藏
页数:11
相关论文
共 50 条