A weighted least-squares approach for inferring phylogenies from incomplete distance matrices

被引:44
|
作者
Makarenkov, V
Lapointe, FJ
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
[2] Inst Control Sci, Moscow 117806, Russia
[3] Univ Montreal, Dept Sci Biol, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1093/bioinformatics/bth211
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The problem of phylogenetic inference from datasets including incomplete or uncertain entries is among the most relevant issues in systematic biology. In this paper, we propose a new method for reconstructing phylogenetic trees from partial distance matrices. The new method combines the usage of the four-point condition and the ultrametric inequality with a weighted least-squares approximation to solve the problem of missing entries. It can be applied to infer phylogenies from evolutionary data including some missing or uncertain information, for instance, when observed nucleotide or protein sequences contain gaps or missing entries. Results: In a number of simulations involving incomplete datasets, the proposed method outperformed the well-known Ultrametric and Additive procedures. Generally, the new method also outperformed all the other competing approaches including Triangle and Fitch which is the most popular least-squares method for reconstructing phylogenies. We illustrate the usefulness of the introduced method by analyzing two well-known phylogenies derived from complete mammalian mtDNA sequences. Some interesting theoretical results concerning the NP-hardness of the ordinary and weighted least-squares fitting of a phylogenetic tree to a partial distance matrix are also established.
引用
收藏
页码:2113 / 2121
页数:9
相关论文
共 50 条
  • [21] WEIGHTED LEAST-SQUARES RANK ESTIMATES
    CHENG, KS
    HETTMANSPERGER, TP
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1983, 12 (09) : 1069 - 1086
  • [22] Weighted least-squares blind deconvolution
    Fiori, S
    Piazza, F
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2507 - 2510
  • [23] BOOSTED OPTIMAL WEIGHTED LEAST-SQUARES
    Haberstich, Cecile
    Nouy, Anthony
    Perrin, Guillaume
    MATHEMATICS OF COMPUTATION, 2022, 91 (335) : 1281 - 1315
  • [24] USE OF THE METHOD OF GENERALIZED LEAST-SQUARES IN RECONSTRUCTING PHYLOGENIES FROM SEQUENCE DATA
    BULMER, M
    MOLECULAR BIOLOGY AND EVOLUTION, 1991, 8 (06) : 868 - 883
  • [25] Least-squares migration of incomplete reflection data
    Nemeth, T
    Wu, CJ
    Schuster, GT
    GEOPHYSICS, 1999, 64 (01) : 208 - 221
  • [26] On Weighted Sum Decompositions of Weighted Least-squares Estimators
    Li, Xiaobin
    Liu, Yonghui
    PROCEEDINGS OF THE THIRD INTERNATIONAL WORKSHOP ON APPLIED MATRIX THEORY, 2009, : 116 - 118
  • [27] Least-squares migration of incomplete reflection data
    Chevron Petroleum Technology Co., 1300 Beach Boulevard, La Habra
    CA
    90631, United States
    不详
    BC
    V6V 2R9, Canada
    不详
    UT
    84112, United States
    Geophysics, 1 (208-221):
  • [28] Update of parameter matrices of the dynamic system using the least-squares approach
    Lee, Eun-Taik
    Eun, Hee-Chang
    ADVANCES IN MECHANICAL ENGINEERING, 2019, 11 (11)
  • [29] A weighted least-squares approach for B-spline shape representation
    Hwang, Yu-Cheng
    Shieh, Tzon-Liang
    Lan, Leu-Shing
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 894 - 898
  • [30] JACKKNIFING OF WEIGHTED TREES - VALIDATION OF PHYLOGENIES RECONSTRUCTED FROM DISTANCE MATRICES
    LAPOINTE, FJ
    KIRSCH, JAW
    BLEIWEISS, R
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 1994, 3 (03) : 256 - 267