Discovery of Genuine Functional Dependencies from Relational Data with Missing Values

被引:30
|
作者
Berti-Equille, Laure [1 ]
Harmouch, Nazar [2 ]
Naumann, Felix [2 ]
Novelli, Noel [1 ]
Saravanan [3 ]
机构
[1] Aix Marseille Univ, CNRS, LIS, Marseille, France
[2] Univ Potsdam, Hasso Plattner Inst, Potsdam, Germany
[3] HBKU, QCRI, Doha, Qatar
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 08期
关键词
IMPUTATION;
D O I
10.14778/3204028.3204032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Functional dependencies (FDs) play an important role in maintaining data quality. They can be used to enforce data consistency and to guide repairs over a database. In this work, we investigate the problem of missing values and its impact on FD discovery. When using existing FD discovery algorithms, some genuine FDs could not be detected precisely due to missing values or some non-genuine FDs can be discovered even though they are caused by missing values with a certain NULL semantics. We define a notion of genuineness and propose algorithms to compute the genuineness score of a discovered FD. This can be used to identify the genuine FDs among the set of all valid dependencies that hold on the data. We evaluate the quality of our method over various real-world and semi-synthetic datasets with extensive experiments. The results show that our method performs well for relatively large FD sets and is able to accurately capture genuine FDs.
引用
收藏
页码:880 / 892
页数:13
相关论文
共 50 条
  • [21] Missing Values and Indeterminable Values in Fuzzy Relational Compositions
    Cao, Nhung
    PROCEEDINGS OF THE 11TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT 2019), 2019, 1 : 313 - 320
  • [22] SPECTRA FROM DATA WITH MISSING VALUES
    HARRIS, RW
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 1987, 1 (01) : 97 - 104
  • [23] Learning Models over Relational Data Using Sparse Tensors and Functional Dependencies
    Khamis, Mahmoud Abo
    Ngo, Hung Q.
    Nguyen, Xuanlong
    Olteanu, Dan
    Schleich, Maximilian
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2020, 45 (02):
  • [24] Discovery of fuzzy inclusion dependencies in fuzzy relational databases
    Sharma, AK
    Goswami, A
    Gupta, DK
    ISCC2004: NINTH INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 128 - 133
  • [25] Towards relational inconsistent databases with functional dependencies
    Greco, Sergio
    Molinaro, Cristian
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 695 - 702
  • [26] FUNCTIONAL DEPENDENCIES IN A RELATIONAL DATABASE AND PROPOSITIONAL LOGIC
    FAGIN, R
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1977, 21 (06) : 534 - 544
  • [27] Capturing Relational Schemas and Functional Dependencies in RDFS
    Calvanese, Diego
    Fischl, Wolfgang
    Pichler, Reinhard
    Sallinger, Emanuel
    Simkus, Mantas
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1003 - 1011
  • [28] DISCOVERING FUNCTIONAL AND INCLUSION DEPENDENCIES IN RELATIONAL DATABASES
    KANTOLA, M
    MANNILA, H
    RAIHA, KJ
    SIIRTOLA, H
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1992, 7 (07) : 591 - 607
  • [29] Relational decomposition through partial functional dependencies
    Berzal, F
    Cubero, JC
    Cuenca, F
    Medina, JM
    DATA & KNOWLEDGE ENGINEERING, 2002, 43 (02) : 207 - 234
  • [30] Functional dependencies with null values, fuzzy values, and crisp values
    Liao, SY
    Wang, HQ
    Liu, WY
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (01) : 97 - 103