THE LONGEST COMMON SUBSEQUENCE PROBLEM FOR SMALL ALPHABET SIZE BETWEEN MANY STRINGS

被引:0
|
作者
HAKATA, K
IMAI, H
机构
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given two or more strings (for example, DNA and amino acid sequences), the longest common subsequence (LCS) problem is to determine the longest common subsequence obtained by deleting zero or more symbols from each string. The algorithms for computing an LCS between two strings were given by many papers, but there is no efficient algorithm for computing an LCS between more than two strings. This paper proposes a method for computing efficiently the LCS between three or more strings of small alphabet size. Specifically, our algorithm computes the LCS of d(greater-than-or-equal-to 3) strings of length n on alphabet of size s in O(nsd + Dsd(log(d-3) n + log(d-2) s)) time, where D is the number of dominant matches and is much smaller than n(d). Through computational experiments, we demonstrate the effectiveness of our algorithm.
引用
收藏
页码:469 / 478
页数:10
相关论文
共 50 条
  • [1] The constrained longest common subsequence problem for degenerate strings
    Iliopoulos, Costas
    Rahman, M. Sohel
    Voracek, Michal
    Vagner, Ladislav
    IMPLEMENTATION AND APPLICATION OF AUTOMATA, 2007, 4783 : 309 - +
  • [2] Longest common subsequence problem for unoriented and cyclic strings
    Nicolas, Francois
    Rivals, Eric
    THEORETICAL COMPUTER SCIENCE, 2007, 370 (1-3) : 1 - 18
  • [3] Longest Common Subsequence Problem for Run-Length-Encoded Strings
    Ahsan, Shegufta Bakht
    Aziz, Syeda Persia
    Rahmana, M. Sohel
    JOURNAL OF COMPUTERS, 2014, 9 (08) : 1769 - 1775
  • [4] Longest Common Subsequence Problem for Run-Length-Encoded Strings
    Ahsan, Shegufta Bakht
    Aziz, Syeda Persia
    Rahman, M. Sohel
    2012 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2012, : 36 - 41
  • [5] A Fast On-Line Algorithm for the Longest Common Subsequence Problem with Constant Alphabet
    Sakai, Yoshifumi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (01): : 354 - 361
  • [6] COMPUTING A LONGEST COMMON SUBSEQUENCE FOR A SET OF STRINGS
    HSU, WJ
    DU, MW
    BIT, 1984, 24 (01): : 45 - 59
  • [7] Algorithms for the longest common subsequence problem for multiple strings based on geometric maxima
    Hakata, K
    Imai, H
    OPTIMIZATION METHODS & SOFTWARE, 1998, 10 (02): : 233 - 260
  • [8] Algorithms for the longest common subsequence problem for multiple strings based on geometric maxima
    Hakata, Koji
    Imai, Hiroshi
    Optimization Methods and Software, 10 (02): : 233 - 260
  • [9] A Fast Longest Common Subsequence Algorithm for Similar Strings
    Arslan, Abdullah N.
    LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, 2010, 6031 : 82 - 93
  • [10] ALGORITHMS FOR LONGEST COMMON SUBSEQUENCE PROBLEM
    HIRSCHBERG, DS
    JOURNAL OF THE ACM, 1977, 24 (04) : 664 - 675