The Importance of Parameters in Database Queries

被引:0
|
作者
Grohe, Martin [1 ]
Kimelfeld, Benny [2 ]
Lindner, Peter [3 ]
Standke, Christoph [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
[2] Technion Israel Inst Technol, Haifa, Israel
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
SHAP score; query parameters; Shapley value; EXPLANATIONS; TRACTABILITY; COMPLEXITY;
D O I
10.4230/LIPIcs.ICDT.2024.14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We propose and study a framework for quantifying the importance of the choices of parameter values to the result of a query over a database. These parameters occur as constants in logical queries, such as conjunctive queries. In our framework, the importance of a parameter is its Shap score. This score is a popular instantiation of the game-theoretic Shapley value to measuring the importance of feature values in machine learning models. We make the case for the rationale of using this score by explaining the intuition behind Shap, and by showing that we arrive at this score in two different, apparently opposing, approaches to quantifying the contribution of a parameter. The application of the Shap score requires two components in addition to the query and the database: (a) a probability distribution over the combinations of parameter values, and (b) a utility function that measures the similarity between the result for the original parameters and the result for hypothetical parameters. The main question addressed in the paper is the complexity of calculating the Shap score for different distributions and similarity measures. We first address the case of probabilistically independent parameters. The problem is hard if we consider a fragment of queries that is hard to evaluate (as one would expect), and even for the fragment of acyclic conjunctive queries. In some cases, though, one can efficiently list all relevant parameter combinations, and then the Shap score can be computed in polynomial time under reasonable general conditions. Also tractable is the case of full acyclic conjunctive queries for certain (natural) similarity functions. We extend our results to conjunctive queries with inequalities between variables and parameters. Finally, we discuss a simple approximation technique for the case of correlated parameters.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] PROCEDURAL LANGUAGES FOR DATABASE QUERIES AND UPDATES
    ABITEBOUL, S
    VIANU, V
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1990, 41 (02) : 181 - 229
  • [32] EXPRESSING DATABASE QUERIES WITH INTUITIONISTIC LOGIC
    BONNER, AJ
    MCCARTY, LT
    VADAPARTY, K
    LOGIC PROGRAMMING : PROCEEDINGS OF THE NORTH AMERICAN CONFERENCE, 1989, VOL 1-2, 1989, : 831 - 851
  • [33] Fuzzy Queries above Relational Database
    Smolka, Pavel
    Bradac, Vladimir
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2017 (ICCMSE-2017), 2017, 1906
  • [34] The trichotomy of HAVING queries on a probabilistic database
    Re, Christopher
    Suciu, Dan
    VLDB JOURNAL, 2009, 18 (05): : 1091 - 1116
  • [35] Efficient Protocols for Private Database Queries
    Saha, Tushar Kanti
    Mayank
    Koshiba, Takeshi
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXI, DBSEC 2017, 2017, 10359 : 337 - 348
  • [36] CryptDB: Processing Queries on an Encrypted Database
    Popa, Raluca Ada
    Redfield, Catherine M. S.
    Zeldovich, Nickolai
    Balakrishnan, Hari
    COMMUNICATIONS OF THE ACM, 2012, 55 (09) : 103 - 111
  • [37] JReq: Database Queries in Imperative Languages
    Iu, Ming-Yee
    Cecchet, Emmanuel
    Zwaenepoel, Willy
    COMPILER CONSTRUCTION, PROCEEDINGS, 2010, 6011 : 84 - +
  • [38] Information Retrieval from Database Queries
    Catao, Vladimir Soares
    Sampaio, Marcus Costa
    Schiel, Ulrich
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 507 - 514
  • [39] Database interrogation using conjunctive queries
    Bielecki, M
    Van den Bussche, J
    DATABASE THEORY ICDT 2003, PROCEEDINGS, 2003, 2572 : 259 - 269
  • [40] Scoping Monadic Relational Database Queries
    Ekblad, Anton
    PROCEEDINGS OF THE 12TH ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON HASKELL (HASKELL '19), 2019, : 114 - 124