Representing data quality for streaming and static data

被引:23
|
作者
Klein, Anja [1 ]
Do, Hong-Hai [1 ]
Hackenbroich, Gregor [1 ]
Karnstedt, Marcel [2 ]
Lehner, Wolfgang [3 ]
机构
[1] SAP AG, SAP Res CEC Dresden, Dresden, Germany
[2] TU Ilmenau, Dept Comp Sci & Automat, Ilmenau, Germany
[3] Tech Univ Dresden, Database Technol Grp, Dresden, Germany
关键词
D O I
10.1109/ICDEW.2007.4400967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In smart item environments, multitude of sensors are applied to capture data about product conditions and usage to guide business decisions as well as production automation processes. A big issue in this application area is posed by the restricted quality of sensor data due to limited sensor precision as well as sensor failures and malfunctions. Decisions derived on incorrect or misleading sensor data are likely to be faulty. The issue of how to efficiently provide applications with information about data quality (DQ) is still an open research problem. In this paper, we present a flexible model for the efficient transfer and management of data quality for streaming as well as static data. We propose a data stream metamodel to allow for the propagation of data quality from the sensors up to the respective business application without a significant overhead of data. Furthermore, we present the extension of the traditional RDBMS metamodel to permit the persistent storage of data quality information in a relational database. Finally, we demonstrate a data quality metadata mapping to close the gap between the streaming environment and the target database. Our solution maintains a flexible number of DQ dimensions and supports applications directly consuming streaming data or processing data filed in a persistent database.
引用
收藏
页码:3 / +
页数:2
相关论文
共 50 条
  • [21] Coconut: sortable summarizations for scalable indexes over static and streaming data series
    Haridimos Kondylakis
    Niv Dayan
    Kostas Zoumpatianos
    Themis Palpanas
    The VLDB Journal, 2019, 28 : 847 - 869
  • [22] BPF: a novel cluster boundary points detection method for static and streaming data
    Khalique, Vijdan
    Kitagawa, Hiroyuki
    Amagasa, Toshiyuki
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (07) : 2991 - 3022
  • [23] Coconut Palm: Static and Streaming Data Series Exploration Now in your Palm
    Kondylakis, Haridimos
    Dayan, Niv
    Zoumpatianos, Kostas
    Palpanas, Themis
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1941 - 1944
  • [24] BPF: a novel cluster boundary points detection method for static and streaming data
    Vijdan Khalique
    Hiroyuki Kitagawa
    Toshiyuki Amagasa
    Knowledge and Information Systems, 2023, 65 : 2991 - 3022
  • [25] Automated Bayesian quality control of streaming rain gauge data
    Hill, David J.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2013, 40 : 289 - 301
  • [26] Streaming-data algorithms for high-quality clustering
    O'Callaghan, L
    Mishra, N
    Meyerson, A
    Guha, S
    Motwani, R
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 685 - 694
  • [27] Quality of Data Delivery in Peer-to-Peer Video Streaming
    Lou, Xiaosong
    Hwang, Kai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2012, 8 (01)
  • [28] Implementation of an Architecture for Streaming Data Acquisition Incorporating Quality Attributes
    e Silva, Ricardo Gamba
    Lopes, Fabio Silva
    PROCEEDINGS OF THE EURO AMERICAN CONFERENCE ON TELEMATICS AND INFORMATION SYSTEMS (EATIS '18), 2018,
  • [29] On the Evaluation, Management and Improvement of Data Quality in Streaming Time Series
    Gomez-Omella, Meritxell
    Sierra, Basilio
    Ferreiro, Susana
    IEEE ACCESS, 2022, 10 : 81458 - 81475
  • [30] DEA with streaming data
    Dula, J. H.
    Lopez, F. J.
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2013, 41 (01): : 41 - 47