A partition-based efficient algorithm for large scale mul tiple-strings matching

被引:0
|
作者
Liu, Ping [1 ]
Liu, Yan-Bing [1 ]
Tan, Jian-Long [1 ]
机构
[1] Chinese Acad Sci, Software Div, Inst Comp Technol, Beijing 100080, Peoples R China
来源
String Processing and Information Retrieval, Proceedings | 2005年 / 3772卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Filtering plays an important role in the Internet security and information retrieval fields, and usually employs multiple-strings matching algorithm as its key part. All the classical matching algorithms, however, perform badly when the number of the keywords exceeds a critical point, which made large scale multiple-strings matching problem a great challenge. Based on the observation that the speed of the classical algorithms depends mainly on the length of the shortest keyword, a partition strategy was proposed to decompose the keywords set into a series of subsets on which the classical algorithms was performed. For the optimal partition, it was proved that the keywords with same length locate in one subset, and length of keywords in different subsets would not interlace each other. In this paper, we proposed a shortest-path model for the optimal partition finding problem. Experiments on both random and real data demonstrate that our algorithms generally has about a 100-300% speed-up compared with the classical ones.
引用
收藏
页码:399 / 404
页数:6
相关论文
共 50 条
  • [21] An Iterative Partition-Based Moving Horizon Estimator for Large-Scale Linear Systems
    Schneider, Rene
    Scheu, Holger
    Marquardt, Wolfgang
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 2621 - 2626
  • [22] Partition-Based Caching in Large-Scale SIC-Enabled Wireless Networks
    Jiang, Dongdong
    Cui, Ying
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (03) : 1660 - 1675
  • [23] Efficient structure similarity searches: a partition-based approach
    Xiang Zhao
    Chuan Xiao
    Xuemin Lin
    Wenjie Zhang
    Yang Wang
    The VLDB Journal, 2018, 27 : 53 - 78
  • [24] Efficient structure similarity searches: a partition-based approach
    Zhao, Xiang
    Xiao, Chuan
    Lin, Xuemin
    Zhang, Wenjie
    Wang, Yang
    VLDB JOURNAL, 2018, 27 (01): : 53 - 78
  • [25] PESE: An efficient partition-based electrical simulation environment
    Kim, YG
    Dharchoudhury, A
    Kang, SM
    Kim, KH
    38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 57 - 60
  • [26] A Partition-Based Match Making Algorithm for Dynamic Ridesharing
    Pelzer, Dominik
    Xiao, Jiajian
    Zehe, Daniel
    Lees, Michael H.
    Knoll, Alois C.
    Aydt, Heiko
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (05) : 2587 - 2598
  • [27] A Partition-Based Five-Point Coordinate Conversion Method for Large Scale Interactive Whiteboard
    Wang, Yiwen
    Wang, Xiaoting
    Li, Hui
    ELECTRICAL AND CONTROL ENGINEERING & MATERIALS SCIENCE AND MANUFACTURING, 2016, : 317 - 324
  • [28] Partition-based algorithm for power grid design using locality
    Singh, J
    Sapatnekar, SS
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2006, 25 (04) : 664 - 677
  • [29] A partition-based serial algorithm for generating viewshed on massive DEMs
    Wu, Huanping
    Pan, Mao
    Yao, Lingqing
    Luo, Bing
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2007, 21 (09) : 955 - 964
  • [30] A partition-based constrained multi-objective evolutionary algorithm
    Yang, Yongkuan
    Liu, Jianchang
    Tan, Shubin
    SWARM AND EVOLUTIONARY COMPUTATION, 2021, 66