Query-Based Data Pricing

被引:115
|
作者
Koutris, Paraschos [1 ]
Upadhyaya, Prasang [1 ]
Balazinska, Magdalena [1 ]
Howe, Bill [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Algorithms; Economics; Theory; Data pricing; arbitrage; query determinacy;
D O I
10.1145/2770870
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data is increasingly being bought and sold online, and Web-based marketplace services have emerged to facilitate these activities. However, current mechanisms for pricing data are very simple: buyers can choose only from a set of explicit views, each with a specific price. In this article, we propose a framework for pricing data on the Internet that, given the price of a few views, allows the price of any query to be derived automatically. We call this capability query-based pricing. We first identify two important properties that the pricing function must satisfy, the arbitrage-free and discount-free properties. Then, we prove that there exists a unique function that satisfies these properties and extends the seller's explicit prices to all queries. Central to our framework is the notion of query determinacy, and in particular instance-based determinacy: we present several results regarding the complexity and properties of it. When both the views and the query are unions of conjunctive queries or conjunctive queries, we show that the complexity of computing the price is high. To ensure tractability, we restrict the explicit prices to be defined only on selection views (which is the common practice today). We give algorithms with polynomial time data complexity for computing the price of two classes of queries: chain queries (by reducing the problem to network flow), and cyclic queries. Furthermore, we completely characterize the class of conjunctive queries without self-joins that have PTIME data complexity, and prove that pricing all other queries is NP-complete, thus establishing a dichotomy on the complexity of the pricing problem when all views are selection queries.
引用
收藏
页数:44
相关论文
共 50 条
  • [21] Regularizing query-based retrieval scores
    Fernando Diaz
    Information Retrieval, 2007, 10 : 531 - 562
  • [22] Query-based summarization of discussion threads
    Verberne, Suzan
    Krahmer, Emiel
    Wubben, Sander
    van den Bosch, Antal
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (01) : 3 - 29
  • [23] Asking the Right Questions - Query-Based Data Citation to Precisely Identify Subsets of Data
    Proell, Stefan
    Rauber, Andreas
    ERCIM NEWS, 2015, (100): : 26 - 27
  • [24] Effective and Robust Query-Based Stemming
    Paik, Jiaul H.
    Parui, Swapan K.
    Pal, Dipasree
    Robertson, Stephen E.
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2013, 31 (04)
  • [25] Query-based sampling of text databases
    Callan, J
    Connell, M
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (02) : 97 - 130
  • [26] Query-Based Summarization for search lists
    Ye, Xinghuo
    Wei, Hai
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 330 - 333
  • [27] From ambiguities to insights: Query-based comparisons of high-dimensional data
    Kowalski, Jeanne
    Talbot, Conover
    Tsai, Hua L.
    Prasad, Nijaguna
    Umbricht, Christopher, Jr.
    Zeiger, Martha A.
    COMPUTATIONAL MODELS FOR LIFE SCIENCES (CMLS 07), 2007, 952 : 305 - +
  • [28] Query-based biclustering of gene expression data using Probabilistic Relational Models
    Hui Zhao
    Lore Cloots
    Tim Van den Bulcke
    Yan Wu
    Riet De Smet
    Valerie Storms
    Pieter Meysman
    Kristof Engelen
    Kathleen Marchal
    BMC Bioinformatics, 12
  • [29] Intelligent Query-Based Data Aggregation Model and Optimized Query Ordering for Efficient Wireless Sensor Network
    Sarode, Prachi
    Nandhini, R.
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 100 (04) : 1405 - 1425
  • [30] Intelligent Query-Based Data Aggregation Model and Optimized Query Ordering for Efficient Wireless Sensor Network
    Prachi Sarode
    R. Nandhini
    Wireless Personal Communications, 2018, 100 : 1405 - 1425