A Table Compression Method for Extended Aho-Corasick Automaton

被引:0
|
作者
Liu, Yanbing [1 ]
Yang, Yifu [1 ]
Liu, Ping [1 ]
Tan, Jianlong [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
来源
IMPLEMENTATION AND APPLICATION OF AUTOMATA, PROCEEDINGS | 2009年 / 5642卷
关键词
EFFICIENT IMPLEMENTATION; SPARSE TABLE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Aho-Corasick algorithm is a classic method for matching a set of strings. However, the huge memory usage of Aho-Corasick antomaton prevents it from being applied to large-scale pattern sets. Here we present a simple but efficient table compression method to reduce the automaton's space. The basic idea of our method is based on equivalent rows elimination, which groups state rows into equivalent classes and eliminates the duplicates. Experiments demonstrate that the proposed method significantly reduces the memory usage and still runs at linear searching time comparable to that of extended Aho-Corasick algorithm. Our method provides good trade-off between memory usage and searching time.
引用
收藏
页码:84 / 93
页数:10
相关论文
共 50 条
  • [21] Efficient dictionary matching by Aho-Corasick automata of truncated patterns
    Zhang, Meng
    Fan, Jiashu
    Chen, Dequan
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (04) : 323 - 329
  • [22] Auto implementation of parallel hardware architecture for Aho-Corasick algorithm
    M. Najam-ul-Islam
    Fatima Tu Zahra
    Atif Raza Jafri
    Roman Shah
    Masood ul Hassan
    Muhammad Rashid
    Design Automation for Embedded Systems, 2022, 26 : 29 - 53
  • [23] Space-Time Tradeoff in the Aho-Corasick String Matching Algorithm
    Xu, Yisi
    Pao, Derek
    2015 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2015, : 713 - 714
  • [24] String Matching with Multicore CPUs: Performing Better with the Aho-Corasick Algorithm
    Arudchutha, S.
    Nishanthy, T.
    Ragel, R. G.
    2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 231 - 236
  • [25] Dictionary Matching: Review of the Aho-Corasick Algorithm and Vision for Large Dictionaries
    Qiao ZhanPeng
    Goto, Kento
    Ohshima, Takuya
    Tajima, Masahiro
    Motomichi, Toyama
    ICIST '18: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES, 2018,
  • [26] Efficient implementation of Aho-Corasick pattern matching automata using Unicode
    Nieminen, Janne
    Kilpelainen, Pekka
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (06): : 669 - 690
  • [27] 基于Aho-Corasick算法的多模式匹配算法研究
    王培凤
    李莉
    计算机应用研究, 2011, 28 (04) : 1251 - 1253+1259
  • [28] Memory-Based Architecture for Multicharacter Aho-Corasick String Matching
    Wang, Xing
    Pao, Derek
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (01) : 143 - 154
  • [29] Generalized Aho-Corasick algorithm for signature based anti-virus applications
    Lee, Tsern-Huei
    PROCEEDINGS - 16TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, VOLS 1-3, 2007, : 792 - 797
  • [30] Construction of Aho Corasick automaton in linear time for integer alphabets
    Dori, S
    Landau, GM
    INFORMATION PROCESSING LETTERS, 2006, 98 (02) : 66 - 72