A Table Compression Method for Extended Aho-Corasick Automaton

被引:0
|
作者
Liu, Yanbing [1 ]
Yang, Yifu [1 ]
Liu, Ping [1 ]
Tan, Jianlong [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
来源
IMPLEMENTATION AND APPLICATION OF AUTOMATA, PROCEEDINGS | 2009年 / 5642卷
关键词
EFFICIENT IMPLEMENTATION; SPARSE TABLE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Aho-Corasick algorithm is a classic method for matching a set of strings. However, the huge memory usage of Aho-Corasick antomaton prevents it from being applied to large-scale pattern sets. Here we present a simple but efficient table compression method to reduce the automaton's space. The basic idea of our method is based on equivalent rows elimination, which groups state rows into equivalent classes and eliminates the duplicates. Experiments demonstrate that the proposed method significantly reduces the memory usage and still runs at linear searching time comparable to that of extended Aho-Corasick algorithm. Our method provides good trade-off between memory usage and searching time.
引用
收藏
页码:84 / 93
页数:10
相关论文
共 50 条
  • [41] An Optimized Parallel Failure-less Aho-Corasick Algorithm for DNA Sequence Matching
    Thambawita, V. L. B.
    Ragel, Roshan G.
    Elkaduwe, Dhammike
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS): INTEROPERABLE SUSTAINABLE SMART SYSTEMS FOR NEXT GENERATION, 2016,
  • [42] Hardware Software Co-design of the Aho-Corasick Algorithm: Scalable for Protein Identification?
    Vidanagamachchi, S. M.
    Dewasurendra, S. D.
    Ragel, R. G.
    2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 321 - 325
  • [43] A Probability Model Chinese Word Segmentation Algorithm Based on Aho-Corasick Automata Algorithm
    Xu Y.-B.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2017, 46 (02): : 426 - 433
  • [44] A Memory-Efficient Pipelined Implementation of the Aho-Corasick String-Matching Algorithm
    Pao, Derek
    Lin, Wei
    Liu, Bin
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2010, 7 (02)
  • [45] A memory-efficient reconfigurable Aho-Corasick FSM implementation for intrusion detection systems
    Dimopoulos, Vassilis
    Papaefstathiou, Ioannis
    Pnevmatikatos, Dionisios
    IC-SAMOS: 2007 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION, PROCEEDINGS, 2007, : 186 - +
  • [46] An Efficient Multicharacter Transition String-Matching Engine Based on the Aho-Corasick Algorithm
    Chen, Chien-Chi
    Wang, Sheng-De
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2013, 10 (04)
  • [47] All-pairs suffix/prefix in optimal time using Aho-Corasick space
    Loukides, Grigorios
    Pissis, Solon P.
    INFORMATION PROCESSING LETTERS, 2022, 178
  • [48] A MULTI-CHARACTER TRANSITION STRING MATCHING ARCHITECTURE BASED ON AHO-CORASICK ALGORITHM
    Chen, Chien-Chi
    Wang, Sheng-De
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (12): : 8367 - 8386
  • [49] Multiple-pattern matching in LZW compressed files using Aho-Corasick algorithm
    Tao, T
    Mukherjee, A
    DCC 2005: Data Compression Conference, Proceedings, 2005, : 482 - 482
  • [50] A failureless pipelined Aho-Corasick algorithm for FPGA-based parallel string matching engine
    Kim, HyunJin
    Lecture Notes in Electrical Engineering, 2015, 339 : 157 - 164