Towards real-time suffix tree construction

被引:0
|
作者
Amir, Amihood [1 ]
Kopelowitz, Tsvi [1 ]
Lewenstein, Moshe [1 ]
Lewenstein, Noa [1 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, IL-52900 Ramat Gan, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quest for a real-time suffix tree construction algorithm is over three decades old. To date there is no convincing understandable solution to this problem. This paper makes a step in this direction by constructing a suffix tree online in time O(log n) per every single input symbol. Clearly, it is impossible to achieve better than O(log n) time per symbol in the compaxison model, therefore no true real time algorithm can exist for infinite alphabets. Nevertheless, the best that can be hoped for is that the construction time for every symbol does not exceed O(log n) (as opposed to an amortized O(log n) time per symbol, achieved by current known algorithms). To our knowledge, our algorithm is the first that spends in the worst case O(log n) per every single input symbol. We also provide a simple algorithm that constructs online an indexing structure (the BIS) in time O(log n) per input symbol, where n is the number of text symbols input thus far. This structure and fast LCP (Longest Common Prefix) queries on it, provide the backbone for the suffix tree construction. Together, our two data structures provide a searching algorithm for a pattern of length m whose time is O(min(m log vertical bar Sigma vertical bar, m + log n) + tocc), where tocc is the number of occurrences of the pattern.
引用
收藏
页码:67 / 78
页数:12
相关论文
共 50 条
  • [41] Towards Hard Real-Time Erlang
    Nicosia, Vincenzo
    ERLANG'07: PROCEEDINGS OF THE 2007 SIGPLAN ERLANG WORKSHOP, 2007, : 29 - 36
  • [42] Towards a robust real-time decoder
    Davenport, J
    Schwartz, R
    Nguyen, L
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 645 - 648
  • [43] TOWARDS REAL-TIME MOLECULAR DEMOLITION
    MADDOX, J
    NATURE, 1992, 360 (6400) : 103 - 103
  • [44] Towards distributed real-time intelligence
    Macleod, I.M.
    Lun, V.
    Annual Review of Automatic Programming, 1991, 16 (pt 1):
  • [45] Engineering a fast online persistent suffix tree construction
    Bedathur, SJ
    Haritsa, JR
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 720 - 731
  • [46] Parallel Construction of Succinct Representations of Suffix Tree Topologies
    Baier, Uwe
    Beller, Timo
    Ohlebusch, Enno
    STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2015), 2015, 9309 : 234 - 245
  • [47] Parallel Generalized Suffix Tree Construction for Genomic Data
    Al Aziz, Md Momin
    Thulasiraman, Parimala
    Mohammed, Noman
    ALGORITHMS FOR COMPUTATIONAL BIOLOGY (ALCOB 2020), 2020, 12099 : 3 - 15
  • [48] SPACE-ECONOMICAL SUFFIX TREE CONSTRUCTION ALGORITHM
    MCCREIGHT, EM
    JOURNAL OF THE ACM, 1976, 23 (02) : 262 - 272
  • [49] Real-time multicast tree visualization and monitoring
    Makofske, DB
    Almeroth, KC
    SOFTWARE-PRACTICE & EXPERIENCE, 2000, 30 (09): : 1047 - 1065