A multiple string matching algorithm based on memory optimization

被引:0
|
作者
Liu, Yanbing [1 ,2 ]
Liu, Ping [1 ]
Tan, Jianlong [1 ]
Guo, Li [1 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
[2] Graduate University of Chinese Academy of Sciences, Beijing 100049, China
关键词
Intrusion detection - Trees (mathematics) - Automata theory - Cache memory - Computer viruses - Computer system firewalls - Network security - Pattern matching - Speed - Forestry;
D O I
暂无
中图分类号
学科分类号
摘要
Multiple string matching algorithms play a fundamental role in many network security systems, such as intrusion detection and prevention systems, anti-virus systems, anti-spam systems, firewall, etc. It has been observed that the memory space usage and cache locality of automata are critical factors affecting multiple string matching algorithms' searching speed. As the pattern set size grows larger and larger, classical multiple string matching algorithms suffer from great performance degradation because of the massive storage usage of string matching automata. The authors propose optimization strategies for the classical string matching algorithm SBOM to reduce its automata size and improve its cache locality, which results in a great promotion in searching speed. More specifically, the Factor Oracle of SBOM algorithm is first replaced with a suffix tree structure, and then the rarely accessed automata nodes are removed through the pruning method to reduce suffix tree to nearly linear space complexity, and finally the pruned suffix tree is represented with double-array trie structure to compress its memory space. Compared with SBOM, this algorithm can greatly reduce memory usage and improve searching speed. Experiments on random data sets show that the proposed algorithm uses memory less than 5% of SBOM and achieves 100% performance improvement over SBOM algorithm. The proposed algorithm is especially suitable for high-speed online pattern matching.
引用
收藏
页码:1768 / 1776
相关论文
共 50 条
  • [41] The Parallel BMH Algorithm of String Matching
    Huang Kun
    Qu Xilong
    You Hong
    INFORMATION AND BUSINESS INTELLIGENCE, PT II, 2012, 268 : 136 - 141
  • [42] Parallelization of KMP String Matching Algorithm
    Alzoabi, Ubaid S.
    Alosaimi, Naser M.
    Bedaiwi, Abdullah S.
    Alabdullatif, Abdullatif M.
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
  • [43] A randomized algorithm for approximate string matching
    Atallah, MJ
    Chyzak, F
    Dumas, P
    ALGORITHMICA, 2001, 29 (03) : 468 - 486
  • [44] AN IMPROVED ALGORITHM FOR APPROXIMATE STRING MATCHING
    GALIL, Z
    PARK, K
    SIAM JOURNAL ON COMPUTING, 1990, 19 (06) : 989 - 999
  • [45] Hybrid string matching algorithm with a pivot
    Al-Ssulami, Abdulrakeeb M.
    JOURNAL OF INFORMATION SCIENCE, 2015, 41 (01) : 82 - 88
  • [46] The wide window string matching algorithm
    He, LT
    Fang, BX
    Sui, J
    THEORETICAL COMPUTER SCIENCE, 2005, 332 (1-3) : 391 - 404
  • [47] A Quantum String-Matching Algorithm
    Prousalis, Konstantinos
    Kydros, Asimakis
    Konofaos, Nikos
    ADVANCED QUANTUM TECHNOLOGIES, 2025, 8 (03)
  • [48] AN IMPROVED ALGORITHM FOR APPROXIMATE STRING MATCHING
    GALIL, Z
    PARK, K
    LECTURE NOTES IN COMPUTER SCIENCE, 1989, 372 : 394 - 404
  • [49] A Jumping String Mode Matching Algorithm
    Wang Wen-jian
    Wu Shun-xiang
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1181 - +
  • [50] A Randomized Algorithm for Approximate String Matching
    M. J. Atallah
    F. Chyzak
    P. Dumas
    Algorithmica, 2001, 29 : 468 - 486