On the Complexity of String Matching for Graphs

被引:8
|
作者
Equi, Massimo [1 ]
Makinen, Veli [1 ]
Tomescu, Alexandru I. [1 ]
Grossi, Roberto [2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, POB 68,Pietari Kalmin Katu 5, Helsinki 00014, Finland
[2] Univ Pisa, Dipartimento Informat, Largo B Pontecorvo 3, I-56127 Pisa, Italy
基金
欧洲研究理事会; 芬兰科学院;
关键词
Exact pattern matching; graph query; graph search; labeled graphs; string matching; string search; Strong Exponential Time Hypothesis; heterogeneous networks; variation graphs; EDIT DISTANCE CANNOT; TIME UNLESS SETH; PATTERN;
D O I
10.1145/3588334
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Exact string matching in labeled graphs is the problem of searching paths of a graph G = (V, E) such that the concatenation of their node labels is equal to a given pattern string P[1..m]. This basic problem can be found at the heart of more complex operations on variation graphs in computational biology, of query operations in graph databases, and of analysis operations in heterogeneous networks. We prove a conditional lower bound stating that, for any constant epsilon > 0, an O(|E|(1-epsilon) m) time, or an O(|E| m(1-epsilon)) time algorithm for exact string matching in graphs, with node labels and pattern drawn from a binary alphabet, cannot be achieved unless the Strong Exponential Time Hypothesis (SETH) is false. This holds even if restricted to undirected graphs with maximum node degree 2-that is, to zig-zag matching in bidirectional strings, or to deterministic directed acyclic graphs whose nodes have maximum sum of indegree and outdegree 3. These restricted cases make the lower bound stricter than what can be directly derived from related bounds on regular expression matching (Backurs and Indyk, FOCS'16). In fact, our bounds are tight in the sense that lowering the degree or the alphabet size yields linear time solvable problems. An interesting corollary is that exact and approximate matching are equally hard (i.e., quadratic time) in graphs under SETH. In comparison, the same problems restricted to strings have linear time vs quadratic time solutions, respectively (approximate pattern matching having also a matching SETH lower bound (Backurs and Indyk, STOC'15)).
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Space complexity of perfect matching in bounded genus bipartite graphs
    Datta, Samir
    Kulkarni, Raghav
    Tewari, Raghunath
    Vinodchandran, N. V.
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (03) : 765 - 779
  • [22] A complexity dichotomy for matching cut in (bipartite) graphs of fixed diameter
    Le, Hoang-Oanh
    Le, Van Bang
    THEORETICAL COMPUTER SCIENCE, 2019, 770 : 69 - 78
  • [23] Quantum Time Complexity and Algorithms for Pattern Matching on Labeled Graphs
    Darbaril, Parisa
    Gibney, Daniel
    Thankachan, Sharma, V
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2022, 2022, 13617 : 303 - 314
  • [24] On the complexity of the dominating induced matching problem in hereditary classes of graphs
    Cardoso, Domingos M.
    Korpelainen, Nicholas
    Lozin, Vadim V.
    DISCRETE APPLIED MATHEMATICS, 2011, 159 (07) : 521 - 531
  • [25] TIGHT BOUNDS ON THE COMPLEXITY OF THE BOYER-MOORE STRING-MATCHING ALGORITHM
    COLE, R
    SIAM JOURNAL ON COMPUTING, 1994, 23 (05) : 1075 - 1091
  • [26] Parameterized Complexity of Induced Graph Matching on Claw-Free Graphs
    Danny Hermelin
    Matthias Mnich
    Erik Jan van Leeuwen
    Algorithmica, 2014, 70 : 513 - 560
  • [27] Parameterized Complexity of Induced Graph Matching on Claw-Free Graphs
    Hermelin, Danny
    Mnich, Matthias
    van Leeuwen, Erik Jan
    ALGORITHMICA, 2014, 70 (03) : 513 - 560
  • [28] String graphs and incomparability graphs
    Fox, Jacob
    Pach, Janos
    ADVANCES IN MATHEMATICS, 2012, 230 (03) : 1381 - 1401
  • [29] Hybrid Approach to Reduce Time Complexity of String Matching Algorithm Using Hashing with Chaining
    Pandey, Shivendra Kumar
    Tiwari, Hari Krishna
    Tripathi, Priyanka
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT, ICT4SD 2015, VOL 1, 2016, 408 : 185 - 193
  • [30] Towards increasing F-measure of approximate string matching in O(1) complexity
    Boguszewski, Adrian
    Szymanski, Julian
    Draszawka, Karol
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 527 - 532