Querying Incomplete Numerical Data: Between Certain and Possible Answers

被引:0
|
作者
Console, Marco [1 ]
Libkin, Leonid [2 ,3 ,4 ]
Peterfreund, Liat [5 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[3] PSL Univ, RelationalAI, Paris, France
[4] PSL Univ, ENS, Paris, France
[5] Univ Gustave Eiffel, LIGM, CNRS, Champs Sur Marne, France
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
Nulls; numerical attributes; aggregate queries; probabilistic databases; approximations; certain and possible answers; LANGUAGES;
D O I
10.1145/3584372.3588660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Queries with aggregation and arithmetic operations, as well as incomplete data, are common in real-world database, but we lack a good understanding of how they should interact. On the one hand, systems based on SQL provide ad-hoc rules for numerical nulls, on the other, theoretical research largely concentrates on the standard notions of certain and possible answers. In the presence of numerical attributes and aggregates, however, these answers are often meaningless, returning either too little or too much. Our goal is to define a principled framework for databases with numerical nulls and answering queries with arithmetic and aggregations over them. Towards this goal, we assume that missing values in numerical attributes are given by probability distributions associated with marked nulls. This yields a model of probabilistic bag databases in which tuples are not necessarily independent since nulls can repeat. We provide a general compositional framework for query answering and then concentrate on queries that resemble standard SQL with arithmetic and aggregation. We show that these queries are measurable, and their outputs have a finite representation. Moreover, since the classical forms of answers provide little information in the numerical setting, we look at the probability that numerical values in output tuples belong to specific intervals. Even though their exact computation is intractable, we show efficient approximation algorithms to compute such probabilities.
引用
收藏
页码:349 / 358
页数:10
相关论文
共 50 条
  • [21] Data adapter for querying and transformation between SQL and NoSQL database
    Liao, Ying-Ti
    Zhou, Jiazheng
    Lu, Chia-Hung
    Chen, Shih-Chang
    Hsu, Ching-Hsien
    Chen, Wenguang
    Jiang, Mon-Fong
    Chung, Yeh-Ching
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 65 : 111 - 121
  • [22] Querying Cardinal Directions between Complex Objects in Data Warehouses
    Viswanathan, Ganesh
    Schneider, Markus
    FUNDAMENTA INFORMATICAE, 2014, 132 (02) : 177 - 202
  • [23] RELATIONS BETWEEN CERTAIN NUMERICAL CHARACTERS OF SINGULARITIES
    SINGH, B
    JOURNAL OF PURE AND APPLIED ALGEBRA, 1980, 16 (01) : 99 - 108
  • [24] Research on querying node probability method in probabilistic XML data based on possible world
    Jianwei W.
    Zhongxiao H.
    International Journal of Digital Content Technology and its Applications, 2010, 4 (08) : 222 - 231
  • [25] A study on a fuzzy clustering for mixed numerical and categorical incomplete data
    Furukawa, Takashi
    Ohnishi, Shin-ichi
    Yamanoi, Takahiro
    2013 INTERNATIONAL CONFERENCE ON FUZZY THEORY AND ITS APPLICATIONS (IFUZZY 2013), 2013, : 425 - 428
  • [26] NUMERICAL APPROXIMATION OF THE FINAL STATE OF AN INCOMPLETE DATA HEAT PROBLEM
    Ali, Abani Maidaoua
    Moustapha, Djibo
    Bisso, Saley
    ADVANCES IN DIFFERENTIAL EQUATIONS AND CONTROL PROCESSES, 2023, 30 (03): : 199 - 212
  • [27] Certain Answers and Rewritings for Local Regular Path Queries on Graph-Structured Data
    Shoaran, Maryam
    Thomo, Alex
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 186 - 192
  • [28] COMMON TREATMENTS BETWEEN BLOCKS OF CERTAIN PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS
    SURENDRAN, PU
    ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (03): : 999 - +
  • [29] On curious relations between certain numerical values.
    Prunier, F
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES, 1941, 212 : 1134 - 1136
  • [30] POSSIBLE RELATIONSHIP BETWEEN CERTAIN MALOCCLUSIONS AND DIFFICULT OR INSTRUMENT DELIVERIES
    SCHOENWETTER, RF
    ANGLE ORTHODONTIST, 1974, 44 (04) : 336 - 340