Vestige: Maximum likelihood phylogenetic footprinting

被引:8
|
作者
Wakefield, MJ [1 ]
Maxwell, P
Huttley, GA
机构
[1] Australian Natl Univ, John Curtin Sch Med Res, Predict Med Grp, Canberra, ACT 0200, Australia
[2] Australian Natl Univ, John Curtin Sch Med Res, ARC Ctr Kangaroo Genom, Canberra, ACT 0200, Australia
[3] Australian Natl Univ, John Curtin Sch Med Res, Computat Genom Lab, Canberra, ACT 0200, Australia
[4] Australian Natl Univ, John Curtin Sch Med Res, Ctr Bioinformat Sci, Canberra, ACT 0200, Australia
关键词
D O I
10.1186/1471-2105-6-130
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Phylogenetic footprinting is the identification of functional regions of DNA by their evolutionary conservation. This is achieved by comparing orthologous regions from multiple species and identifying the DNA regions that have diverged less than neutral DNA. Vestige is a phylogenetic footprinting package built on the PyEvolve toolkit that uses probabilistic molecular evolutionary modelling to represent aspects of sequence evolution, including the conventional divergence measure employed by other footprinting approaches. In addition to measuring the divergence, Vestige allows the expansion of the definition of a phylogenetic footprint to include variation in the distribution of any molecular evolutionary processes. This is achieved by displaying the distribution of model parameters that represent partitions of molecular evolutionary substitutions. Examination of the spatial incidence of these effects across regions of the genome can identify DNA segments that differ in the nature of the evolutionary process. Results: Vestige was applied to a reference dataset of the SCL locus from four species and provided clear identification of the known conserved regions in this dataset. To demonstrate the flexibility to use diverse models of molecular evolution and dissect the nature of the evolutionary process Vestige was used to footprint the Ka/Ks ratio in primate BRCA1 with a codon model of evolution. Two regions of putative adaptive evolution were identified illustrating the ability of Vestige to represent the spatial distribution of distinct molecular evolutionary processes. Conclusion: Vestige provides a flexible, open platform for phylogenetic footprinting. Underpinned by the PyEvolve toolkit, Vestige provides a framework for visualising the signatures of evolutionary processes across the genome of numerous organisms simultaneously. By exploiting the maximum-likelihood statistical framework, the complex interplay between mutational processes, DNA repair and selection can be evaluated both spatially (along a sequence alignment) and temporally (for each branch of the tree) providing visual indicators to the attributes and functions of DNA sequences.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Vestige: Maximum likelihood phylogenetic footprinting
    Matthew J Wakefield
    Peter Maxwell
    Gavin A Huttley
    BMC Bioinformatics, 6
  • [2] Maximum likelihood of phylogenetic networks
    Jin, Guohua
    Nakhleh, Luay
    Snir, Sagi
    Tuller, Tamir
    BIOINFORMATICS, 2006, 22 (21) : 2604 - 2611
  • [3] PAML 4: Phylogenetic analysis by maximum likelihood
    Yang, Ziheng
    MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (08) : 1586 - 1591
  • [4] Phylogenetic footprinting
    Cliften, Paul F.
    YEAST GENE ANALYSIS, SECOND EDITION, 2007, 36 : 551 - +
  • [5] Upper bounds on maximum likelihood for phylogenetic trees
    Hendy, Michael D.
    Holland, Barbara R.
    BIOINFORMATICS, 2003, 19 : II66 - II72
  • [6] An investigation of irreproducibility in maximum likelihood phylogenetic inference
    Xing-Xing Shen
    Yuanning Li
    Chris Todd Hittinger
    Xue-xin Chen
    Antonis Rokas
    Nature Communications, 11
  • [7] An investigation of irreproducibility in maximum likelihood phylogenetic inference
    Shen, Xing-Xing
    Li, Yuanning
    Hittinger, Chris Todd
    Chen, Xue-xin
    Rokas, Antonis
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [8] Consistency of a phylogenetic tree maximum likelihood estimator
    RoyChoudhury, Arindam
    Willis, Amy
    Bunge, John
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2015, 161 : 73 - 80
  • [9] On the quirks of maximum parsimony and likelihood on phylogenetic networks
    Bryant, Christopher
    Fischer, Mareike
    Linz, Simone
    Semple, Charles
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 417 : 100 - 108
  • [10] GPU Accelerated Maximum Likelihood Analysis for Phylogenetic Inference
    Rajapaksa, Sandun
    Rasanjana, Wageesha
    Perera, Indika
    Meedeniya, Dulani
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 6 - 10