DISTANCE-BASED PHYLOGENETIC ALGORITHMS: NEW INSIGHTS AND APPLICATIONS

被引:5
|
作者
Pompei, S. [1 ,2 ]
Caglioti, E. [3 ]
Loreto, V. [1 ,2 ]
Tria, F. [2 ]
机构
[1] Univ Roma La Sapienza, Dipartimento Fis, I-00185 Rome, Italy
[2] ISI Fdn, I-10133 Turin, Italy
[3] Univ Roma La Sapienza, Dipartimento Matemat, I-00185 Rome, Italy
关键词
Phylogeny; distance-based methods; noise and horizontal transfer; trees; TREE; DYNAMICS;
D O I
10.1142/S0218202510004672
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Phylogenetic methods have recently been rediscovered in several interesting areas among which immunodynamics, epidemiology and many branches of evolutionary dynamics. In many interesting cases the reconstruction of a correct phylogeny is blurred by high mutation rates and/or horizontal transfer events. As a consequence, a divergence arises between the true evolutionary distances and the distances between pairs of taxa as inferred from the available data, making the phylogenetic reconstruction a challenging problem. Mathematically this divergence translates in the non-additivity of the actual distances between taxa and the quest for new algorithms able to efficiently cope with these effects is wide open. In distance-based reconstruction methods, two properties of additive distances were extensively exploited as antagonist criteria to drive phylogeny reconstruction: on the one hand a local property of quartets, i.e. sets of four taxa in a tree, the four-point condition; on the other hand, a recently proposed formula that allows to write the tree length as a function of the distances between taxa, the Pauplin's formula. A deeper comprehension of the effects of the non-additivity on the inspiring principles of the existing reconstruction algorithms is thus of paramount importance. In this paper we present a comparative analysis of the performances of the most important distance-based phylogenetic algorithms. We focus in particular on the dependence of their performances on two main sources of non-additivity: back-mutation processes and horizontal transfer processes. The comparison is carried out in the framework of a set of generative algorithms for phylogenies that incorporate non-additivity in a tunable way.
引用
收藏
页码:1511 / 1532
页数:22
相关论文
共 50 条
  • [31] Minimum reference network for temperature modeling through distance-based algorithms
    Mendoza, Helver Novoa
    Camero, Edwin Martinez
    Granell, Emilio
    Giraldo, Faber Danilo
    2022 XVLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2022), 2022,
  • [32] Converting non-parametric distance-based classification to anytime algorithms
    Xiaopeng Xi
    Ken Ueno
    Eamonn Keogh
    Dah-Jye Lee
    Pattern Analysis and Applications, 2008, 11 : 321 - 336
  • [33] On approximate algorithms for distance-based queries using R-trees
    Corral, A. (acorral@ual.es), 1600, Oxford University Press (48):
  • [34] How to Account for Reticulation Events in Phylogenetic Analysis: A Comparison of Distance-Based Methods
    F.-J Lapointe
    Journal of Classification, 2000, 17 : 175 - 184
  • [35] Relaxed neighbor joining: A fast distance-based phylogenetic tree construction method
    Evans, J
    Sheneman, L
    Foster, J
    JOURNAL OF MOLECULAR EVOLUTION, 2006, 62 (06) : 785 - 792
  • [36] On approximate algorithms for distance-based queries using R-trees
    Corral, A
    Vassilakopoulos, M
    COMPUTER JOURNAL, 2005, 48 (02): : 220 - 238
  • [37] Distance-based outlier queries in data streams: the novel task and algorithms
    Fabrizio Angiulli
    Fabio Fassetti
    Data Mining and Knowledge Discovery, 2010, 20 : 290 - 324
  • [38] Converting non-parametric distance-based classification to anytime algorithms
    Xi, Xiaopeng
    Ueno, Ken
    Keogh, Eamonn
    Lee, Dah-Jye
    PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (3-4) : 321 - 336
  • [39] Data transformation techniques for preserving privacy in distance-based mining algorithms
    Kadampur, Mohammad Ali
    Somayajulu, D. V. L. N.
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2014, 6 (03) : 285 - 311
  • [40] Distance-based outlier queries in data streams: the novel task and algorithms
    Angiulli, Fabrizio
    Fassetti, Fabio
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (02) : 290 - 324