The effects of sampling on delimiting species from multi-locus sequence data

被引:40
|
作者
Rittmeyer, Eric N. [1 ]
Austin, Christopher C. [1 ]
机构
[1] Louisiana State Univ, Museum Nat Sci, Dept Biol Sci, Baton Rouge, LA 70803 USA
基金
美国国家科学基金会;
关键词
Species delimitation; Sampling strategy; Structurama; Nonparametric delimitation; Gaussian clustering; POPULATION-STRUCTURE; MAXIMUM-LIKELIHOOD; BAYESIAN-INFERENCE; TREE ESTIMATION; GENE TREES; DELIMITATION; CONSEQUENCES; TAXONOMY; DIVERGENCE; SIMULATION;
D O I
10.1016/j.ympev.2012.06.031
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As a fundamental unit in biology, species are used in a wide variety of studies, and their delimitation impacts every subfield of the life sciences. Thus, it is of utmost importance that species are delimited in an accurate and biologically meaningful way. However, due to morphologically similar, cryptic species, and processes such as incomplete lineage sorting, this is far from a trivial task. Here, we examine the accuracy and sensitivity to sampling strategy of three recently developed methods that aim to delimit species from multi-locus DNA sequence data without a priori assignments of samples to putative species. Specifically, we simulate data at two species tree depths and a variety of sampling strategies ranging from five alleles per species and five loci to 20 alleles per species and 100 loci to test (1) Structurama, (2) Gaussian clustering, and (3) nonparametric delimitation. We find that Structurama accurately delimits even relatively recently diverged (greater than 1.5 N generations) species when sampling 10 or more loci. We also find that Gaussian clustering delimits more deeply divergent species (greater than 2.5 N generations) relatively well, but is not sufficiently sensitive to delimit more recently diverged species. Finally, we find that nonparametric delimitation performs well with 25 or more loci if gene trees are known without error, but performs poorly with estimated gene genealogies, frequently over-splitting species and mis-assigning samples. We thus suggest that Structurama represents a powerful tool for use in species delimitation. It should be noted, however, that intraspecific population structure may be delimited using this or any of the methods tested herein. We argue that other methods, such as other species delimitation methods requiring a priori putative species assignments (e.g. SpeDeSTEM, Bayesian species delimitation), and other types of data (e.g. morphological, ecological, behavioral) be incorporated in conjunction with these methods in studies attempting to delimit species. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:451 / 463
页数:13
相关论文
共 50 条
  • [1] Delimiting species boundaries within the Babyloniidae (Mollusca: Gastropoda: Neogastropoda) using multi-locus phylogenetic analysis
    Yen, Yu-Hsiu
    Joseph, Julian
    Liu, Shang-Yin Vanson
    ZOOLOGICA SCRIPTA, 2025, 54 (01) : 17 - 32
  • [2] Multi-locus sequence typing of different clinically related Fusarium species
    al-Hatmi, A.
    Menken, S. B. J.
    Capella-Gutierrez, S.
    de Hoog, G. S.
    Stielow, J. B.
    MYCOSES, 2015, 58 : 25 - 25
  • [3] Identification of new Rathayibacter species using multi-locus sequence analysis
    Mauzey, S. J.
    Schneider, W. L.
    Luster, D. G.
    McClure, M. A.
    Murray, T. D.
    Schroeder, B. K.
    PHYTOPATHOLOGY, 2014, 104 (11) : 76 - 76
  • [4] Assessing the fit of the multi-species network coalescent to multi-locus data
    Cai, Ruoyi
    Ane, Cecile
    BIOINFORMATICS, 2021, 37 (05) : 634 - 641
  • [5] Using multi-locus sequence data for addressing species boundaries in commonly accepted lichen-forming fungal species
    Zhao, Xin
    Fernandez-Brime, Samantha
    Wedin, Mats
    Locke, Marissa
    Leavitt, Steven D.
    Lumbsch, H. Thorsten
    ORGANISMS DIVERSITY & EVOLUTION, 2017, 17 (02) : 351 - 363
  • [6] Using multi-locus sequence data for addressing species boundaries in commonly accepted lichen-forming fungal species
    Xin Zhao
    Samantha Fernández-Brime
    Mats Wedin
    Marissa Locke
    Steven D. Leavitt
    H. Thorsten Lumbsch
    Organisms Diversity & Evolution, 2017, 17 : 351 - 363
  • [7] Analytical Framework for Identifying and Differentiating Recent Hitchhiking and Severe Bottleneck Effects from Multi-Locus DNA Sequence Data
    Sargsyan, Ori
    PLOS ONE, 2012, 7 (05):
  • [8] Molecular phylogeny of the genus Arum (Araceae) inferred from multi-locus sequence data and AFLPs
    Linz, Jeanine
    Stoekl, Johannes
    Urru, Isabella
    Kruegel, Tamara
    Stensmyr, Marcus C.
    Hansson, Bill S.
    TAXON, 2010, 59 (02) : 405 - 415
  • [9] STRAIN: an R package for multi-locus sequence typing from whole genome sequencing data
    Dalsass, Mattia
    Bodini, Margherita
    Lambert, Christophe
    Mortier, Marie-Cecile
    Romanelli, Marco
    Medini, Duccio
    Muzzi, Alessandro
    Brozzi, Alessandro
    BMC BIOINFORMATICS, 2019, 20 (Suppl 9)
  • [10] Multi-locus sequence typing for surveillance of Campylobacter
    Sopwith, W.
    Fox, A.
    Regan, M.
    Syed, Q.
    Bolton, E.
    Birtles, A.
    Matthews, M.
    Gee, S.
    Osborn, K.
    INTERNATIONAL JOURNAL OF ANTIMICROBIAL AGENTS, 2007, 29 : S31 - S31