Efficient Evaluation of SUM Queries over Probabilistic Data

被引:10
|
作者
Akbarinia, Reza [1 ,2 ]
Valduriez, Patrick [1 ,2 ]
Verger, Guillaume [1 ,2 ]
机构
[1] INRIA, F-34095 Montpellier, France
[2] LIRMM, F-34095 Montpellier, France
关键词
Database management; systems; query processing; DATABASES;
D O I
10.1109/TKDE.2012.62
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SUM queries are crucial for many applications that need to deal with uncertain data. In this paper, we are interested in the queries, called ALL_SUM, that return all possible sum values and their probabilities. In general, there is no efficient solution for the problem of evaluating ALL_SUM queries. But, for many practical applications, where aggregate values are small integers or real numbers with small precision, it is possible to develop efficient solutions. In this paper, based on a recursive approach, we propose a new solution for those applications. We implemented our solution and conducted an extensive experimental evaluation over synthetic and real-world data sets; the results show its effectiveness.
引用
收藏
页码:764 / 775
页数:12
相关论文
共 50 条
  • [41] Evaluating Probabilistic Queries over Uncertain Matching
    Cheng, Reynold
    Gong, Jian
    Cheung, David W.
    Cheng, Jiefeng
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1096 - 1107
  • [42] Scalable Evaluation of Trajectory Queries over Imprecise Location Data
    Xie, Xike
    Yiu, Man L.
    Cheng, Reynold
    Lu, Hua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (08) : 2029 - 2044
  • [43] Efficient Routing of Subspace Skyline Queries over Highly Distributed Data
    Vlachou, Akrivi
    Doulkeridis, Christos
    Kotidis, Yannis
    Vazirgiannis, Michalis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (12) : 1694 - 1708
  • [44] Efficient Indexing Multiple Multidimensional Continuous Queries over Data Stream
    Hou, Dongfeng
    Liu, Qingbao
    Lu, Changhui
    Zhang, Weiming
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 6, 2010, : 594 - 598
  • [45] Efficient processing of multiple continuous skyline queries over a data stream
    Lee, Yu Won
    Lee, Ki Yong
    Kim, Myoung Ho
    INFORMATION SCIENCES, 2013, 221 : 316 - 337
  • [46] GDPS: An Efficient Approach for Skyline Queries over Distributed Uncertain Data
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Xiaowei
    yu, Jie
    BIG DATA RESEARCH, 2014, 1 (01) : 23 - 36
  • [47] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1448 - 1462
  • [48] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,
  • [49] Query Shredding: Efficient Relational Evaluation of Queries over Nested Multisets
    Cheney, James
    Lindley, Sam
    Wadler, Philip
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1027 - 1038
  • [50] Probabilistic verifiers: Evaluating Constrained Nearest-Neighbor queries over uncertain data
    Cheng, Reynold
    Chen, Jinchuan
    Mokbel, Mohamed
    Chow, Chi-Yin
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 973 - +