Approximate integrity constraints in incomplete databases with limited domains

被引:0
|
作者
Al-atar, Munqath [1 ]
Sali, Attila [2 ,3 ]
机构
[1] Univ Kufa, ITRDC, Kufa, Iraq
[2] Budapest Univ Technol & Econ, Dept Comp Sci & Informat Theory, Budapest, Hungary
[3] Alfred Renyi Inst Math, Budapest, Hungary
关键词
Strongly possible functional dependencies; Strongly possible keys; Cross joins; Multivalued dependency; Incomplete databases; Data imputation; Approximate functional dependencies; Approximate keys; DEPENDENCIES; DISCOVERY;
D O I
10.1007/s10472-025-09967-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A strongly possible constraint is an intermediate concept between possible and certain constraints, based on the strongly possible world approach (a strongly possible world is obtained by replacing NULL's by a value from the ones appearing in the corresponding attribute of the table). In the present paper, we introduce strongly possible versions of multivalued dependencies and cross joins, and we analyse the complexity of checking the validity of a given strongly possible cross joins. We also study two approximation measures, g(3) and g(5), of strongly possible keys (spKeys), functional dependencies (spFDs), multivalued dependencies (spMVDs) and cross joins (spCJs). For spKeys and spFDs, we show that the g(3) value is always an upper bound of the g(5) value for a given constraint in a table. However, there are tables of arbitrarily large number of tuples and a constant number of attributes that satisfy g(3)-g(5)=p/q for any rational number 0 <= p/q<1. On the other hand, we show that the values of measures g(3) and g(5) are independent of each other in the case of spMVDs and spCJs. We prove that checking whether a given strongly possible cross join holds in an incomplete table is NP-complete, in sharp contrast to the fact that checking a given cross join in a complete table is easily seen to be polynomially solvable. We also treat complexity questions of determination of the approximation values, namely we show that both, determining g(3) and g(5) for spCJs are NP-complete.
引用
收藏
页数:28
相关论文
共 50 条