A compression method of double-array structures using linear functions

被引:7
|
作者
Kanda, Shunsuke [1 ]
Fuketa, Masao [1 ]
Morita, Kazuhiro [1 ]
Aoe, Jun-ichi [1 ]
机构
[1] Univ Tokushima, Dept Informat Sci & Intelligent Syst, Minamijosanjima 2-1, Tokushima 7708506, Japan
关键词
Trie; Double-array; Compression method; Information retrieval; ALGORITHM;
D O I
10.1007/s10115-015-0873-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A trie is one of the data structures for keyword search algorithms and is utilized in natural language processing, reserved words search for compilers and so on. The double-array and LOUDS are efficient representation methods for the trie. The double-array provides fast traversal at time complexity of O(1), but the space usage of the double-array is larger than that of LOUDS. LOUDS is a succinct data structure with bit-string, and its space usage is extremely compact. However, its traversal speed is not so fast. This paper presents a new compression method of the double-array with keeping the retrieval speed. Our new method compresses the double-array by dividing the double-array into blocks and by using linear functions. Experimental results for varied keywords show that our new method reduced space usage of the double-array up to about 44 %, and the retrieval speed of the new method was 9-14 times faster than that of LOUDS. Moreover, the results show that the construction speed of the new method was faster than that of the conventional method for a large keyword set.
引用
收藏
页码:55 / 80
页数:26
相关论文
共 50 条
  • [41] Analytical solution for design parameters of model box to simulate seismic spatial variability effect using double-array shaking tables
    Yu H.-T.
    Chen X.-Z.
    Li P.
    Yantu Gongcheng Xuebao/Chinese Journal of Geotechnical Engineering, 2020, 42 (08): : 1428 - 1434
  • [42] On Distributed Compression of Linear Functions
    Wagner, Aaron B.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2011, 57 (01) : 79 - 94
  • [43] On Distributed Compression of Linear Functions
    Wagner, Aaron B.
    2008 46TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, VOLS 1-3, 2008, : 1546 - 1553
  • [44] ECG compression method using Lorentzian functions model
    Ouamri, Abdelaziz
    Nait-Ali, Amine
    DIGITAL SIGNAL PROCESSING, 2007, 17 (01) : 319 - 326
  • [45] Identification method using the combination of linear shape functions for the monitoring of dynamic ice load on ship structures
    Kong S.
    Tian Y.
    Cui H.
    Ji S.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2022, 41 (14): : 226 - 232
  • [46] BEARING ESTIMATION USING DOUBLE FREQUENCY REASSIGNMENT FOR A LINEAR PASSIVE ARRAY
    Czarnecki, Krzysztof
    Lesniak, Wojciech
    POLISH MARITIME RESEARCH, 2017, 24 (03) : 26 - 35
  • [47] Vibration-Enhanced Heat Transfer of Double-Array Helical Elastic Tube Bundle Heat Exchanger
    Ji, Jiadong
    Li, Feiyang
    Shi, Baojun
    Gao, Runmiao
    Zhang, Jingwei
    Deng, Xu
    JOURNAL OF THERMOPHYSICS AND HEAT TRANSFER, 2022, 36 (02) : 351 - 357
  • [48] Linear structures of Boolean functions
    Dubuc, S
    1998 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY - PROCEEDINGS, 1998, : 440 - 440
  • [49] Ultra-Low Velocity Zones Beneath the Southern Hemisphere Imaged With Double-Array Stacking of PcP Waveforms
    Agboola, Kayode J.
    Hansen, Samantha E.
    Garnero, Edward J.
    Rost, Sebastian
    Li, Mingming
    Shim, Sang-Heon
    JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2024, 129 (04)
  • [50] PHASE FIELD CRYSTAL SIMULATION OF STRESS-INDUCED ANNIHILATION OF SUB-GRAIN BOUNDARY WITH DOUBLE-ARRAY DISLOCATION
    Yang Tao
    Chen Zheng
    Dong Weiping
    ACTA METALLURGICA SINICA, 2011, 47 (10) : 1301 - 1306