Aggregate Query Processing on Incomplete Data

被引:2
|
作者
Zhang, Anzhen [1 ]
Wang, Jinbao [1 ]
Li, Jianzhong [1 ]
Gao, Hong [1 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
来源
关键词
Aggregate query; Incomplete data; Estimation;
D O I
10.1007/978-3-319-96890-2_24
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Incomplete data has been a longstanding issue in database community, and yet the subject is poorly handled by both theory and practice. In this paper, we propose to directly estimate the aggregate query result on incomplete data, rather than imputing the missing values. An interval estimation, composed of the upper and lower bound of aggregate query results among all possible interpretation of missing values, are presented to the end-users. The ground-truth aggregate result is guaranteed to be among the interval. Experimental results are consistent with the theoretical results, and suggest that the estimation is invaluable to better assess the results of aggregate queries on incomplete data.
引用
收藏
页码:286 / 294
页数:9
相关论文
共 50 条
  • [1] Aggregate Query Processing Algorithm on Incomplete Data Based on Denotational Semantics
    Zhang A.-Z.
    Li J.-Z.
    Gao H.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (02): : 406 - 420
  • [2] Skyline query processing for incomplete data
    Khalefa, Mohamed E.
    Mokbel, Mohamed F.
    Levandoski, Justin J.
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 556 - 565
  • [3] Optimizing Skyline Query Processing in Incomplete Data
    Gulzar, Yonis
    Alwan, Ali A.
    Turaev, Sherzod
    IEEE ACCESS, 2019, 7 : 178121 - 178138
  • [4] SKYLINE QUERY PROCESSING FOR INCOMPLETE DATA IN CLOUD ENVIRONMENT
    Gulzar, Yonis
    Alwan, Ali A.
    Salleh, Norsaremah
    Al-Shaikhli, Imad Fakhri
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS: EMBRACING ECO-FRIENDLY COMPUTING, 2017, : 567 - 576
  • [5] Priority-Based Skyline Query Processing for Incomplete Data
    Liu, Chuang-Ming
    Pak, Denis
    Castellanos, Ari Ernesto Ortiz
    IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, 2021, : 204 - 211
  • [6] Optimizing Performance of Aggregate Query Processing with Histogram Data Structure
    Liang Yong
    Mu Zhaonan
    SOFTWARE ENGINEERING METHODS IN INTELLIGENT ALGORITHMS, VOL 1, 2019, 984 : 342 - 350
  • [7] Query processing over incomplete autonomous databases: query rewriting using learned data dependencies
    Wolf, Garrett
    Kalavagattu, Aravind
    Khatri, Hemal
    Balakrishnan, Raju
    Chokshi, Bhaumik
    Fan, Jianchun
    Chen, Yi
    Kambhampati, Subbarao
    VLDB JOURNAL, 2009, 18 (05): : 1167 - 1190
  • [8] Query processing over incomplete autonomous databases: query rewriting using learned data dependencies
    Garrett Wolf
    Aravind Kalavagattu
    Hemal Khatri
    Raju Balakrishnan
    Bhaumik Chokshi
    Jianchun Fan
    Yi Chen
    Subbarao Kambhampati
    The VLDB Journal, 2009, 18 : 1167 - 1190
  • [9] Probabilistic Threshold Range Aggregate Query Processing over Uncertain Data
    Yang, Shuxiang
    Zhang, Wenjie
    Zhang, Ying
    Lin, Xuemin
    ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2009, 5446 : 51 - +
  • [10] An incomplete database approach to global query processing
    Otsuka, S
    Miyazaki, N
    TWELFTH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN-12), PROCEEDINGS, 1998, : 337 - 342