Constructing a tree from homeomorphic subtrees, with applications to computational evolutionary biology

被引:68
|
作者
Henzinger, MR
King, V
Warnow, T
机构
[1] Digital Equipment Corp, Syst Res Ctr, Palo Alto, CA 94301 USA
[2] Univ Victoria, Dept Comp Sci, Victoria, BC, Canada
[3] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
关键词
algorithms; data structures; evolutionary biology; theory of databases;
D O I
10.1007/PL00009268
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We are given a set T = {T-1, T-2,..., T-k} of rooted binary trees, each T-i leaf-labeled by a subset L(T-i) subset of {1,2,..., n). If T is a tree on {1,2,..., n}, we let T\L denote the minimal subtree of T induced by the nodes of L: and all their ancestors. The consensus tree problem asks whether there exists a tree T* such that, for every i, T*\L(T-i) is homeomorphic to T-i. We present algorithms which test if a given set of trees has a consensus tree and if so, construct one. The deterministic algorithm takes time mini O(Nn (1/2)), O(N + n(2) log n)}, where N = Sigma(i)\Ti\, and uses linear space. The randomized algorithm takes time O(N log(3) n) and uses linear space. The previous best for this problem was a 1981 O(Nn) algorithm by Aho et al. Our faster deterministic algorithm uses a new efficient algorithm for the following interesting dynamic graph problem: Given a graph G with n nodes and m edges and a sequence of b batches of one or more edge deletions, then, after each batch, either find a new component that has just been created or determine that there is no such component. For this problem, we have a simple algorithm with running time O(n(2) log n + b(o) min{n(2), m log n}), where b(o) is the number of batches which do not result in a new component. For our particular application, b(o)less than or equal to 1. If all edges are deleted, then the best previously known deterministic algorithm requires time O(m*root n) to solve this problem. We also present two applications of these consensus tree algorithms which solve other problems in computational evolutionary biology.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] From darwinism to evolutionary biology
    Kutschera, U.
    SCIENCE, 2008, 321 (5893) : 1157 - 1158
  • [32] Recovering a tree from the lengths of subtrees spanned by a randomly chosen sequence of leaves
    Evans, Steven N.
    Lanoue, Daniel
    ADVANCES IN APPLIED MATHEMATICS, 2018, 96 : 39 - 75
  • [33] A Tree of Trees: Using Campus Tree Diversity to Integrate Molecular, Organismal, and Evolutionary Biology
    Huang, Sophia
    Whittall, Justen B.
    AMERICAN BIOLOGY TEACHER, 2018, 80 (02): : 144 - 151
  • [34] Advances in systems biology: computational algorithms and applications
    Huang, Yufei
    Zhao, Zhongming
    Xu, Hua
    Shyr, Yu
    Zhang, Bing
    BMC SYSTEMS BIOLOGY, 2012, 6
  • [35] Workload characterization of modem computational biology applications
    Albayraktaroglu, K
    Franklin, M
    2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 435 - 440
  • [36] Computational methods in molecular biology: Theory and applications
    Tsakalidis, A.
    Makris, Ch.
    RECENT PROGRESS IN COMPUTATIONAL SCIENCES AND ENGINEERING, VOLS 7A AND 7B, 2006, 7A-B : 1055 - 1056
  • [37] Towards practical applications in quantum computational biology
    A. K. Fedorov
    M. S. Gelfand
    Nature Computational Science, 2021, 1 : 114 - 119
  • [38] Computational network biology: Data, models, and applications
    Liu, Chuang
    Ma, Yifang
    Zhao, Jing
    Nussinov, Ruth
    Zhang, Yi-Cheng
    Cheng, Feixiong
    Zhang, Zi-Ke
    PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2020, 846 : 1 - 66
  • [39] Towards practical applications in quantum computational biology
    Fedorov, A. K.
    Gelfand, M. S.
    NATURE COMPUTATIONAL SCIENCE, 2021, 1 (02): : 114 - 119
  • [40] Highlight: New Solutions and Open Questions in Computational Evolutionary Biology
    McGrath, Casey
    GENOME BIOLOGY AND EVOLUTION, 2019, 11 (11): : 3179 - 3180