Efficiently Labeling and Retrieving Temporal Anomalies in Relational Databases

被引:0
|
作者
Khnaisser, Christina [1 ]
Hamrouni, Hind [2 ]
Blumenthal, David B. [3 ]
Dignos, Anton [2 ]
Gamper, Johann [2 ]
机构
[1] Univ Sherbrooke, Fac Med & Hlth Sci, 3001,12e Ave Nord, Sherbrooke, PQ J1H 5N4, Canada
[2] Free Univ Bozen Bolzano, Fac Engn, Piazza Domenicani 3, I-39100 Bolzano, Italy
[3] Friedrich Alexander Univ Erlangen Nurnberg, Dept Artificial Intelligence Biomed Engn, Werner von Siemens Str 61, D-91052 Erlangen, Germany
关键词
Temporal database; Temporal anomalies; Temporal queries; Temporal constraints;
D O I
10.1007/s10796-024-10495-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Time and temporal constraints are implicit in most databases. To facilitate data analysis and quality assessment, a database should provide explicit operations to identify the violation of temporal constraints. Against this background, the purpose of this paper is threefold: (1) we identify and provide a formal definition of five common anomalies in temporal databases, (2) we propose two new relational operations that allow, respectively, to label anomalous tuples in and to retrieve the anomalous tuples from a dataset, and (3) we provide three different SQL implementations of these operations for current relational database management systems. The healthcare domain is used to illustrate the usage and utility of the temporal anomalies. Finally, an experimental evaluation on real-world and synthetic data analyses the performance of the different implementations of the anomaly operators.
引用
收藏
页数:25
相关论文
共 50 条