Anomalous networks under the multispecies coalescent: theory and prevalence

被引:0
|
作者
Cécile Ané
John Fogg
Elizabeth S. Allman
Hector Baños
John A. Rhodes
机构
[1] University of Wisconsin - Madison,Department of Statistics
[2] University of Wisconsin - Madison,Department of Botany
[3] University of Alaska Fairbanks,Department of Mathematics and Statistics
[4] Dalhousie University,Department of Biochemistry & Molecular Biology
[5] Dalhousie University,Department of Mathematics and Statistics
来源
关键词
Gene tree; Species network; Species tree; Admixture graph; Introgression; Gene flow; Anomalous quartets; Concordance factors; 05C90; 60J90; 92D15; 92-10;
D O I
暂无
中图分类号
学科分类号
摘要
Reticulations in a phylogenetic network represent processes such as gene flow, admixture, recombination and hybrid speciation. Extending definitions from the tree setting, an anomalous network is one in which some unrooted tree topology displayed in the network appears in gene trees with a lower frequency than a tree not displayed in the network. We investigate anomalous networks under the Network Multispecies Coalescent Model with possible correlated inheritance at reticulations. Focusing on subsets of 4 taxa, we describe a new algorithm to calculate quartet concordance factors on networks of any level, faster than previous algorithms because of its focus on 4 taxa. We then study topological properties required for a 4-taxon network to be anomalous, uncovering the key role of 32\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3_2$$\end{document}-cycles: cycles of 3 edges parent to a sister group of 2 taxa. Under the model of common inheritance, that is, when each gene tree coalesces within a species tree displayed in the network, we prove that 4-taxon networks are never anomalous. Under independent and various levels of correlated inheritance, we use simulations under realistic parameters to quantify the prevalence of anomalous 4-taxon networks, finding that truly anomalous networks are rare. At the same time, however, we find a significant fraction of networks close enough to the anomaly zone to appear anomalous, when considering the quartet concordance factors observed from a few hundred genes. These apparent anomalies may challenge network inference methods.
引用
收藏
相关论文
共 50 条
  • [31] A stochastic Farris transform for genetic data under the multispecies coalescent with applications to data requirements
    Gautam Dasarathy
    Elchanan Mossel
    Robert Nowak
    Sebastien Roch
    Journal of Mathematical Biology, 2022, 84
  • [32] A simulation study to examine the impact of recombination on phylogenomic inferences under the multispecies coalescent model
    Zhu, Tianqi
    Flouri, Tomas
    Yang, Ziheng
    MOLECULAR ECOLOGY, 2022, 31 (10) : 2814 - 2829
  • [33] Bayesian species identification under the multispecies coalescent provides significant improvements to DNA barcoding analyses
    Yang, Ziheng
    Rannala, Bruce
    MOLECULAR ECOLOGY, 2017, 26 (11) : 3028 - 3036
  • [34] A Preliminary Framework for DNA Barcoding, Incorporating the Multispecies Coalescent
    Dowton, Mark
    Meiklejohn, Kelly
    Cameron, Stephen L.
    Wallman, James
    SYSTEMATIC BIOLOGY, 2014, 63 (04) : 639 - 644
  • [35] Testing Multispecies Coalescent Simulators Using Summary Statistics
    Allman, Elizabeth S.
    Banos, Hector
    Rhodes, John A.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1613 - 1618
  • [36] A Simulation Study to Examine the Information Content in Phylogenomic Data Sets under the Multispecies Coalescent Model
    Huang, Jun
    Flouri, Tomas
    Yang, Ziheng
    MOLECULAR BIOLOGY AND EVOLUTION, 2020, 37 (11) : 3211 - 3224
  • [37] DISSECT: an assignment-free Bayesian discovery method for species delimitation under the multispecies coalescent
    Jones, Graham
    Aydin, Zeynep
    Oxelman, Bengt
    BIOINFORMATICS, 2015, 31 (07) : 991 - 998
  • [38] Gene tree discordance, phylogenetic inference and the multispecies coalescent
    Degnan, James H.
    Rosenberg, Noah A.
    TRENDS IN ECOLOGY & EVOLUTION, 2009, 24 (06) : 332 - 340
  • [39] Origin of land plants using the multispecies coalescent model
    Zhong, Bojian
    Liu, Liang
    Yan, Zhen
    Penny, David
    TRENDS IN PLANT SCIENCE, 2013, 18 (09) : 492 - 495
  • [40] An algorithm for computing the gene tree probability under the multispecies coalescent and its application in the inference of population tree
    Wu, Yufeng
    BIOINFORMATICS, 2016, 32 (12) : 225 - 233