Efficient dictionary matching by Aho-Corasick automata of truncated patterns

被引:0
|
作者
Zhang, Meng [1 ]
Fan, Jiashu [1 ]
Chen, Dequan [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
algorithm; dictionary matching; Aho-Corasick automaton;
D O I
10.1504/IJCSM.2016.078738
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We present a space-efficient data structure for dictionary matching. We truncate patterns to truncated patterns where symbols are l-length substrings of the pattern. By employing the AC automaton of truncated patterns and that of l-length substrings, we simulate the AC automaton of the original pattern set. The new structure is space economical as we apply the prefix merging to substrings of patterns. Using this structure, the dictionary matching runs in O(n log k + tocc log k + occ) time where n is the length of the text, k the number of patterns, occ the number of occurrences of patterns in the text, and tocc the number of occurrences of strings that are longest prefix of each pattern with length of a multiple of l.
引用
收藏
页码:323 / 329
页数:7
相关论文
共 50 条
  • [31] A File Undelete with Aho-Corasick Algorithm In File Recovery
    Sitompul, Opim Salim
    Handoko, Andrew
    Rahmat, Romi Fadillah
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 427 - 431
  • [32] Fast Document Indexing Using Aho-Corasick State Machine
    Singh, Ramanpreet
    Mohaar, Gurpreet Singh
    PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 469 - 475
  • [33] A failureless pipelined Aho-Corasick algorithm for FPGA-based parallel string matching engine
    Kim, HyunJin
    Lecture Notes in Electrical Engineering, 2015, 339 : 157 - 164
  • [34] Auto implementation of parallel hardware architecture for Aho-Corasick algorithm
    Najam-ul-Islam, M.
    Zahra, Fatima Tu
    Jafri, Atif Raza
    Shah, Roman
    ul Hassan, Masood
    Rashid, Muhammad
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2022, 26 (01) : 29 - 53
  • [35] Node Configuration for the Aho-Corasick Algorithm in Intrusion Detection Systems
    Lacroix, Alexsandre B.
    Langlois, J. M. Pierre
    Boyer, Francois-Raymond
    Gosselin, Antoine
    Bois, Guy
    PROCEEDINGS OF THE 2016 SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'16), 2016, : 121 - 122
  • [36] A Memory Accessing Method for the Parallel Aho-Corasick Algorithm on GPU
    Yoon, JinMyung
    Choi, Kang-Il
    Kim, HyunJin
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SECURITY (ICISS), 2014, : 116 - 118
  • [37] A memory-efficient reconfigurable Aho-Corasick FSM implementation for intrusion detection systems
    Dimopoulos, Vassilis
    Papaefstathiou, Ioannis
    Pnevmatikatos, Dionisios
    IC-SAMOS: 2007 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION, PROCEEDINGS, 2007, : 186 - +
  • [38] Auto implementation of parallel hardware architecture for Aho-Corasick algorithm
    M. Najam-ul-Islam
    Fatima Tu Zahra
    Atif Raza Jafri
    Roman Shah
    Masood ul Hassan
    Muhammad Rashid
    Design Automation for Embedded Systems, 2022, 26 : 29 - 53
  • [39] 基于Aho-Corasick算法的多模式匹配算法研究
    王培凤
    李莉
    计算机应用研究, 2011, 28 (04) : 1251 - 1253+1259
  • [40] Generalized Aho-Corasick algorithm for signature based anti-virus applications
    Lee, Tsern-Huei
    PROCEEDINGS - 16TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, VOLS 1-3, 2007, : 792 - 797