A compression method of double-array structures using linear functions

被引:7
|
作者
Kanda, Shunsuke [1 ]
Fuketa, Masao [1 ]
Morita, Kazuhiro [1 ]
Aoe, Jun-ichi [1 ]
机构
[1] Univ Tokushima, Dept Informat Sci & Intelligent Syst, Minamijosanjima 2-1, Tokushima 7708506, Japan
关键词
Trie; Double-array; Compression method; Information retrieval; ALGORITHM;
D O I
10.1007/s10115-015-0873-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A trie is one of the data structures for keyword search algorithms and is utilized in natural language processing, reserved words search for compilers and so on. The double-array and LOUDS are efficient representation methods for the trie. The double-array provides fast traversal at time complexity of O(1), but the space usage of the double-array is larger than that of LOUDS. LOUDS is a succinct data structure with bit-string, and its space usage is extremely compact. However, its traversal speed is not so fast. This paper presents a new compression method of the double-array with keeping the retrieval speed. Our new method compresses the double-array by dividing the double-array into blocks and by using linear functions. Experimental results for varied keywords show that our new method reduced space usage of the double-array up to about 44 %, and the retrieval speed of the new method was 9-14 times faster than that of LOUDS. Moreover, the results show that the construction speed of the new method was faster than that of the conventional method for a large keyword set.
引用
收藏
页码:55 / 80
页数:26
相关论文
共 50 条
  • [1] A compression method of double-array structures using linear functions
    Shunsuke Kanda
    Masao Fuketa
    Kazuhiro Morita
    Jun-ichi Aoe
    Knowledge and Information Systems, 2016, 48 : 55 - 80
  • [2] A compression method of double array structures
    Fuketa, M
    Tanaka, A
    Morita, K
    Aoe, JI
    WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 1, PROCEEDINGS: INFORMATION SYSTEMS DEVELOPMENT, 2001, : 377 - 382
  • [3] Trie Compact Representation using Double-array Structures with String Labels
    Kanda, Shunsuke
    Fuketa, Masao
    Morita, Kazuhiro
    Aoe, Jun-ichi
    2015 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (IWCIA) PROCEEDINGS, 2015, : 3 - 8
  • [4] A Fast and Compact Language Model Implementation Using Double-Array Structures
    Norimatsu, Jun-Ya
    Yasuhara, Makoto
    Tanaka, Toru
    Yamamoto, Mikio
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (04)
  • [5] Double-array refraction microtremors
    Strobbia, Claudio
    Boaga, Jacopo
    Cassiani, Giorgio
    JOURNAL OF APPLIED GEOPHYSICS, 2015, 121 : 31 - 41
  • [6] Double-Array Compression by Pruning Twin Leaves and Unifying Common Suffixes
    Yata, Susumu
    Oono, Masaki
    Morita, Kazuhiro
    Sumitomo, Toru
    Aoe, Jun-ichi
    2006 INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS (ICOCI 2006), 2006, : 251 - +
  • [7] An improvement key deletion method for double-array structure using single-nodes
    Oono, M
    Fuketa, M
    Morita, K
    Kashiji, S
    Aoe, J
    INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (01) : 47 - 63
  • [8] Design Optimization of Double-array Bolted Joints in Cylindrical Composite Structures
    Kim, Myungjun
    Kim, Yongha
    Kim, Pyeunghwa
    Park, Jungsun
    INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2016, 17 (03) : 332 - 340
  • [9] AN EFFICIENT DIGITAL SEARCH ALGORITHM BY USING A DOUBLE-ARRAY STRUCTURE
    AOE, JI
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1989, 15 (09) : 1066 - 1077
  • [10] Fast insertion methods of a double-array structure
    Morita, Kazuhiro
    Fuketa, Masao
    Yamakawa, Yoshihiro
    Aoe, Jun-Ichi
    Software - Practice and Experience, 2001, 31 (01) : 43 - 65