The Importance of Parameters in Database Queries

被引:0
|
作者
Grohe, Martin [1 ]
Kimelfeld, Benny [2 ]
Lindner, Peter [3 ]
Standke, Christoph [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
[2] Technion Israel Inst Technol, Haifa, Israel
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
SHAP score; query parameters; Shapley value; EXPLANATIONS; TRACTABILITY; COMPLEXITY;
D O I
10.4230/LIPIcs.ICDT.2024.14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We propose and study a framework for quantifying the importance of the choices of parameter values to the result of a query over a database. These parameters occur as constants in logical queries, such as conjunctive queries. In our framework, the importance of a parameter is its Shap score. This score is a popular instantiation of the game-theoretic Shapley value to measuring the importance of feature values in machine learning models. We make the case for the rationale of using this score by explaining the intuition behind Shap, and by showing that we arrive at this score in two different, apparently opposing, approaches to quantifying the contribution of a parameter. The application of the Shap score requires two components in addition to the query and the database: (a) a probability distribution over the combinations of parameter values, and (b) a utility function that measures the similarity between the result for the original parameters and the result for hypothetical parameters. The main question addressed in the paper is the complexity of calculating the Shap score for different distributions and similarity measures. We first address the case of probabilistically independent parameters. The problem is hard if we consider a fragment of queries that is hard to evaluate (as one would expect), and even for the fragment of acyclic conjunctive queries. In some cases, though, one can efficiently list all relevant parameter combinations, and then the Shap score can be computed in polynomial time under reasonable general conditions. Also tractable is the case of full acyclic conjunctive queries for certain (natural) similarity functions. We extend our results to conjunctive queries with inequalities between variables and parameters. Finally, we discuss a simple approximation technique for the case of correlated parameters.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Maintaining Database Anonymity in the Presence of Queries
    Riley, Ryan
    Clifton, Chris
    Malluhi, Qutaibah
    SECURITY AND TRUST MANAGEMENT, STM 2013, 2013, 8203 : 33 - 48
  • [42] Algebraic optimization of recursive database queries
    Hansen, Michael R.
    INFOR: Information Systems and Operational Research, 1988, 26 (04): : 286 - 298
  • [43] Relativistic quantum private database queries
    Si-Jia Sun
    Yu-Guang Yang
    Ming-Ou Zhang
    Quantum Information Processing, 2015, 14 : 1443 - 1450
  • [44] Relative qualification in database flexible queries
    Tudorie, Cornelia
    Bumbaru, Severin
    Dumitriu, Luminita
    2006 3RD INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 83 - 88
  • [45] A METHOD FOR PROCESSING DISTRIBUTED DATABASE QUERIES
    PERRIZO, W
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1984, 10 (04) : 466 - 471
  • [46] A database schema for constructing visual queries
    Presser, CGM
    MSV'04 & AMCS'04, PROCEEDINGS, 2004, : 135 - 140
  • [47] Renovating Database Queries with Query AutoAwesome
    Suryavanshi, Chetna
    Dyreson, Curtis
    Adams, Jonathan
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 253 - 260
  • [48] LINEAR PROGRAMS WITH CONJUNCTIVE DATABASE QUERIES
    Capelli, Florent
    Crosetti, Nicolas
    Niehren, Joachim
    Ramon, Jan
    LOGICAL METHODS IN COMPUTER SCIENCE, 2024, 20 (01) : 1 - 9
  • [49] Dynamic Query Forms for Database Queries
    Tang, Liang
    Li, Tao
    Jiang, Yexi
    Chen, Zhiyuan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2166 - 2178
  • [50] A PRACTICAL ALGORITHM FOR EVALUATING DATABASE QUERIES
    DEVET, JHM
    SOFTWARE-PRACTICE & EXPERIENCE, 1989, 19 (05): : 491 - 504