Efficient dictionary matching by Aho-Corasick automata of truncated patterns

被引:0
|
作者
Zhang, Meng [1 ]
Fan, Jiashu [1 ]
Chen, Dequan [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
algorithm; dictionary matching; Aho-Corasick automaton;
D O I
10.1504/IJCSM.2016.078738
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We present a space-efficient data structure for dictionary matching. We truncate patterns to truncated patterns where symbols are l-length substrings of the pattern. By employing the AC automaton of truncated patterns and that of l-length substrings, we simulate the AC automaton of the original pattern set. The new structure is space economical as we apply the prefix merging to substrings of patterns. Using this structure, the dictionary matching runs in O(n log k + tocc log k + occ) time where n is the length of the text, k the number of patterns, occ the number of occurrences of patterns in the text, and tocc the number of occurrences of strings that are longest prefix of each pattern with length of a multiple of l.
引用
收藏
页码:323 / 329
页数:7
相关论文
共 50 条
  • [1] Efficient implementation of Aho-Corasick pattern matching automata using Unicode
    Nieminen, Janne
    Kilpelainen, Pekka
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (06): : 669 - 690
  • [2] Highly Compressed Aho-Corasick Automata For Efficient Intrusion Detection
    Zha, Xinyan
    Sahni, Sartaj
    2008 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1-3, 2008, : 1 - 6
  • [3] Dictionary Matching: Review of the Aho-Corasick Algorithm and Vision for Large Dictionaries
    Qiao ZhanPeng
    Goto, Kento
    Ohshima, Takuya
    Tajima, Masahiro
    Motomichi, Toyama
    ICIST '18: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES, 2018,
  • [4] AUGMENTING THE AHO-CORASICK PATTERN-MATCHING MACHINE
    SRIDHAR, MA
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1990, 32 (3-4) : 149 - 153
  • [5] Pattern Matching in YARA: Improved Aho-Corasick Algorithm
    Regeciova, Dominika
    Kolar, Dusan
    Milkovic, Marek
    IEEE ACCESS, 2021, 9 (09): : 62857 - 62866
  • [7] Engineering faster double-array Aho-Corasick automata
    Kanda, Shunsuke
    Akabe, Koichi
    Oda, Yusuke
    SOFTWARE-PRACTICE & EXPERIENCE, 2023, 53 (06): : 1332 - 1361
  • [8] On the bit-parallel simulation of the nondeterministic Aho-Corasick and suffix automata for a set of patterns
    Cantone, Domenico
    Faro, Simone
    Giaquinta, Emanuele
    JOURNAL OF DISCRETE ALGORITHMS, 2012, 11 (01) : 25 - 36
  • [9] Optimized Aho-Corasick String Matching Algorithm for Smart Phones
    Lu, Rui
    Pao, Derek
    2016 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2016, : 342 - 343
  • [10] Memory Efficient Parallelization for Aho-Corasick Algorithm on a GPU
    Nhat-Phuong Tran
    Lee, Myungho
    Hong, Sugwon
    Shin, Minho
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 432 - 438