Efficient dictionary matching by Aho-Corasick automata of truncated patterns

被引:0
|
作者
Zhang, Meng [1 ]
Fan, Jiashu [1 ]
Chen, Dequan [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
algorithm; dictionary matching; Aho-Corasick automaton;
D O I
10.1504/IJCSM.2016.078738
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We present a space-efficient data structure for dictionary matching. We truncate patterns to truncated patterns where symbols are l-length substrings of the pattern. By employing the AC automaton of truncated patterns and that of l-length substrings, we simulate the AC automaton of the original pattern set. The new structure is space economical as we apply the prefix merging to substrings of patterns. Using this structure, the dictionary matching runs in O(n log k + tocc log k + occ) time where n is the length of the text, k the number of patterns, occ the number of occurrences of patterns in the text, and tocc the number of occurrences of strings that are longest prefix of each pattern with length of a multiple of l.
引用
收藏
页码:323 / 329
页数:7
相关论文
共 50 条
  • [41] SIMD IMPLEMENTATION OF THE AHO-CORASICK ALGORITHM USING INTEL AVX2
    Lazhar, Ourlis
    Djamel, Bellala
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2019, 20 (03): : 563 - 576
  • [42] A Secure Live Signature Verification with Aho-Corasick Histogram Algorithm for Mobile Smart Pad
    Tseng, Kuo-Kun
    Chen, He
    Chen, Charles
    Bansong, Charinrat
    ELECTRONICS, 2021, 10 (11)
  • [43] 面向入侵检测的Aho-Corasick算法内存消耗研究
    张雪松
    田宏
    辽宁石油化工大学学报, 2008, 28 (01) : 66 - 69
  • [44] Hybrid Compression of the Aho-Corasick Automaton for Static Analysis in Intrusion Detection Systems
    Pungila, Ciprian
    INTERNATIONAL JOINT CONFERENCE CISIS'12 - ICEUTE'12 - SOCO'12 SPECIAL SESSIONS, 2013, 189 : 77 - 86
  • [45] Enhanced Cybercrime Detection on Twitter Using Aho-Corasick Algorithm and Machine Learning Techniques
    Rawat, Romil
    Raj, A. Samson Arun
    Chakrawarti, Rajesh Kumar
    Sankaran, Krishnan Sakthidasan
    Sarangi, Sanjaya Kumar
    Rawat, Hitesh
    Rawat, Anjali
    Informatica (Slovenia), 2024, 48 (18): : 97 - 108
  • [46] A Parallel Aho-Corasick Algorithm with Non-deterministic Finite Automaton Based on OpenMP
    Qu, Jiaxing
    Zhang, Guoyin
    Fang, Zhou
    Liu, Jiahui
    Liu, Xinyu
    Li, Fangzhou
    2015 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION AND NETWORKING (ACN), 2015, : 52 - 55
  • [47] Hardware Software Co-design of the Aho-Corasick Algorithm: Scalable for Protein Identification?
    Vidanagamachchi, S. M.
    Dewasurendra, S. D.
    Ragel, R. G.
    2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 321 - 325
  • [48] All-pairs suffix/prefix in optimal time using Aho-Corasick space
    Loukides, Grigorios
    Pissis, Solon P.
    INFORMATION PROCESSING LETTERS, 2022, 178
  • [49] PAC-k: A Parallel Aho-Corasick String Matching Approach on Graphic Processing Units Using Non-Overlapped Threads
    Ho, ThienLuan
    Oh, Seung-Rohk
    Kim, HyunJin
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2016, E99B (07) : 1523 - 1531
  • [50] 一种基于分类存储的空间高效Aho-Corasick算法
    汪泓才
    李训根
    计算机应用与软件, 2017, 34 (05) : 279 - 282+316