A Table Compression Method for Extended Aho-Corasick Automaton

被引:0
|
作者
Liu, Yanbing [1 ]
Yang, Yifu [1 ]
Liu, Ping [1 ]
Tan, Jianlong [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
来源
IMPLEMENTATION AND APPLICATION OF AUTOMATA, PROCEEDINGS | 2009年 / 5642卷
关键词
EFFICIENT IMPLEMENTATION; SPARSE TABLE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Aho-Corasick algorithm is a classic method for matching a set of strings. However, the huge memory usage of Aho-Corasick antomaton prevents it from being applied to large-scale pattern sets. Here we present a simple but efficient table compression method to reduce the automaton's space. The basic idea of our method is based on equivalent rows elimination, which groups state rows into equivalent classes and eliminates the duplicates. Experiments demonstrate that the proposed method significantly reduces the memory usage and still runs at linear searching time comparable to that of extended Aho-Corasick algorithm. Our method provides good trade-off between memory usage and searching time.
引用
收藏
页码:84 / 93
页数:10
相关论文
共 50 条
  • [32] SIMD IMPLEMENTATION OF THE AHO-CORASICK ALGORITHM USING INTEL AVX2
    Lazhar, Ourlis
    Djamel, Bellala
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2019, 20 (03): : 563 - 576
  • [33] Speed-up of Aho-Corasick pattern matching machines by rearranging states
    Nishimura, T
    Fukamachi, S
    Shinohara, T
    EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 175 - 185
  • [34] Construction of Aho Corasick automaton in linear time for integer alphabets
    Dori, S
    Landau, GM
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2005, 3537 : 168 - 177
  • [35] A Secure Live Signature Verification with Aho-Corasick Histogram Algorithm for Mobile Smart Pad
    Tseng, Kuo-Kun
    Chen, He
    Chen, Charles
    Bansong, Charinrat
    ELECTRONICS, 2021, 10 (11)
  • [36] 面向入侵检测的Aho-Corasick算法内存消耗研究
    张雪松
    田宏
    辽宁石油化工大学学报, 2008, 28 (01) : 66 - 69
  • [37] Aho-Corasick String Matching on Shared and Distributed-Memory Parallel Architectures
    Tumeo, Antonino
    Villa, Oreste
    Chavarria-Miranda, Daniel G.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (03) : 436 - 443
  • [38] Enhanced Cybercrime Detection on Twitter Using Aho-Corasick Algorithm and Machine Learning Techniques
    Rawat, Romil
    Raj, A. Samson Arun
    Chakrawarti, Rajesh Kumar
    Sankaran, Krishnan Sakthidasan
    Sarangi, Sanjaya Kumar
    Rawat, Hitesh
    Rawat, Anjali
    Informatica (Slovenia), 2024, 48 (18): : 97 - 108
  • [39] A RUN-TIME EFFICIENT REALIZATION OF AHO-CORASICK PATTERN-MATCHING MACHINES
    ARIKAWA, S
    SHINOHARA, T
    NEW GENERATION COMPUTING, 1984, 2 (02) : 171 - 186
  • [40] On the bit-parallel simulation of the nondeterministic Aho-Corasick and suffix automata for a set of patterns
    Cantone, Domenico
    Faro, Simone
    Giaquinta, Emanuele
    JOURNAL OF DISCRETE ALGORITHMS, 2012, 11 (01) : 25 - 36