A partition-based efficient algorithm for large scale mul tiple-strings matching

被引:0
|
作者
Liu, Ping [1 ]
Liu, Yan-Bing [1 ]
Tan, Jian-Long [1 ]
机构
[1] Chinese Acad Sci, Software Div, Inst Comp Technol, Beijing 100080, Peoples R China
来源
String Processing and Information Retrieval, Proceedings | 2005年 / 3772卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Filtering plays an important role in the Internet security and information retrieval fields, and usually employs multiple-strings matching algorithm as its key part. All the classical matching algorithms, however, perform badly when the number of the keywords exceeds a critical point, which made large scale multiple-strings matching problem a great challenge. Based on the observation that the speed of the classical algorithms depends mainly on the length of the shortest keyword, a partition strategy was proposed to decompose the keywords set into a series of subsets on which the classical algorithms was performed. For the optimal partition, it was proved that the keywords with same length locate in one subset, and length of keywords in different subsets would not interlace each other. In this paper, we proposed a shortest-path model for the optimal partition finding problem. Experiments on both random and real data demonstrate that our algorithms generally has about a 100-300% speed-up compared with the classical ones.
引用
收藏
页码:399 / 404
页数:6
相关论文
共 50 条
  • [41] An energy-efficient partition-based XYZ-planar routing algorithm for a wireless network-on-chip
    Fahimeh Yazdanpanah
    Raheel AfsharMazayejani
    Mohammad Alaei
    Amin Rezaei
    Masoud Daneshtalab
    The Journal of Supercomputing, 2019, 75 : 837 - 861
  • [42] Improved pattern matching algorithm based on partition
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    Dalian Haishi Daxue Xuebao, 2008, 1 (41-44):
  • [43] An automatic partition-based parallel algorithm for grid-based distributed hydrological models
    Xu, Zhenwu
    Tang, Guoping
    Jiang, Tao
    Chen, Xiaohua
    Chen, Tao
    Niu, Xiangyu
    ENVIRONMENTAL MODELLING & SOFTWARE, 2021, 144
  • [44] Square partition-based node scheduling algorithm for wireless passive sensor networks
    Lu, Xu
    Chen, Rongjun
    Liu, Jun
    Cheng, Lianglun
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2018, 31 (08)
  • [45] A λ-level partition-based linear back projection algorithm to electrical resistance tomography
    Liu, Xuezhen
    Yue, Shihong
    Ren, Honghao
    2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
  • [46] Partition-based algorithm for estimating transportation network reliability with dependent link failures
    Sumalee, Agachai
    Watling, David P.
    JOURNAL OF ADVANCED TRANSPORTATION, 2008, 42 (03) : 213 - 238
  • [47] Partition-based algorithm for estimating transportation network reliability with dependent link failures
    Sumalee, Agachai
    Watling, David P.
    Journal of Advanced Transportation, 2008, 42 (03): : 213 - 238
  • [48] Efficient Large-Scale Stereo Matching
    Geiger, Andreas
    Roser, Martin
    Urtasun, Raquel
    COMPUTER VISION-ACCV 2010, PT I, 2011, 6492 : 25 - +
  • [49] Efficient profile matching for large scale Webcasting
    Lu, Q
    Eichstaedt, M
    Ford, D
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 443 - 455
  • [50] Partition-based distributed extended Kalman filter for large-scale nonlinear processes with application to chemical and wastewater treatment processes
    Li, Xiaojie
    Law, Adrian Wing-Keung
    Yin, Xunyuan
    AICHE JOURNAL, 2023, 69 (12)