The Adaptive Suffix Tree: A Space Efficient Sequence Learning Algorithm

被引:0
|
作者
Gunasinghe, Upuli [1 ]
Alahakoon, Damminda [2 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[2] Deakin Univ, Sch Informat Syst, Geelong, Vic 3217, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Adaptive Suffix Trie algorithm was previously proposed by the authors as a sequence learning algorithm for capturing frequent sub sequences of variable length. This algorithm builds up a suffix trie data structure, capturing repetitive patterns in a given set of sequences. Its application has been demonstrated in bioinformatics and text clustering. Suffix trees are the space efficient variant of suffix tries and are thus more widely used in the current literature. In this paper we propose the Adaptive Suffix Tree algorithm, which is based on the same learning principles as the Adaptive Suffix Trie, but has the advantage that it is more space efficient. We discuss the new algorithm in detail and demonstrate that the same set of sub sequences can be learnt by the proposed algorithm while utilizing less than 50% of the space used by its predecessor.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A space efficient algorithm for sequence alignment with inversions
    Gao, Y
    Wu, JF
    Niewiadomski, R
    Wang, Y
    Chen, ZZ
    Lin, GH
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2003, 2697 : 57 - 67
  • [22] An Efficient Algorithm for Approximating Geodesic Distances in Tree Space
    Battagliero, Simone
    Puglia, Giuseppe
    Vicario, Saverio
    Rubino, Francesco
    Scioscia, Gaetano
    Leo, Pietro
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (05) : 1196 - 1207
  • [23] Sparse Suffix Tree Construction in Optimal Time and Space
    Gawrychowski, Pawel
    Kociumaka, Tomasz
    PROCEEDINGS OF THE TWENTY-EIGHTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2017, : 425 - 439
  • [24] Search Results Clustering Algorithm based on the Suffix Tree
    Wang, Dengwei
    Liu, Libo
    Dong, Jing
    Zheng, Jiao
    2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING ICISCE 2015, 2015, : 456 - 460
  • [25] WordNet-Based Suffix Tree Clustering Algorithm
    Dang, Qiuyue
    Zhang, Jiwei
    Lu, Yueming
    Zhang, Kuo
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND COMPUTER APPLICATIONS (ICSA 2013), 2013, 92 : 66 - 74
  • [26] Improving Suffix Tree Clustering Algorithm for Web Documents
    Zhuang, Yan
    Chen, Youguang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE (LEMCS 2015), 2015, 117 : 1557 - 1561
  • [27] A new cluster merging algorithm of Suffix Tree Clustering
    Wang, Jianhua
    Li, Ruixu
    INTELLIGENT INFORMATION PROCESSING III, 2006, 228 : 197 - +
  • [28] Fast sequence clustering using a suffix array algorithm
    Malde, K
    Coward, E
    Jonassen, I
    BIOINFORMATICS, 2003, 19 (10) : 1221 - 1226
  • [29] Fast and space efficient linear suffix array construction
    Zhang, Sen
    Nong, Ge
    DCC: 2008 DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2008, : 553 - 553
  • [30] Space efficient linear time construction of suffix arrays
    Ko, P
    Aluru, S
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 200 - 210