Representing data quality for streaming and static data

被引:23
|
作者
Klein, Anja [1 ]
Do, Hong-Hai [1 ]
Hackenbroich, Gregor [1 ]
Karnstedt, Marcel [2 ]
Lehner, Wolfgang [3 ]
机构
[1] SAP AG, SAP Res CEC Dresden, Dresden, Germany
[2] TU Ilmenau, Dept Comp Sci & Automat, Ilmenau, Germany
[3] Tech Univ Dresden, Database Technol Grp, Dresden, Germany
关键词
D O I
10.1109/ICDEW.2007.4400967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In smart item environments, multitude of sensors are applied to capture data about product conditions and usage to guide business decisions as well as production automation processes. A big issue in this application area is posed by the restricted quality of sensor data due to limited sensor precision as well as sensor failures and malfunctions. Decisions derived on incorrect or misleading sensor data are likely to be faulty. The issue of how to efficiently provide applications with information about data quality (DQ) is still an open research problem. In this paper, we present a flexible model for the efficient transfer and management of data quality for streaming as well as static data. We propose a data stream metamodel to allow for the propagation of data quality from the sensors up to the respective business application without a significant overhead of data. Furthermore, we present the extension of the traditional RDBMS metamodel to permit the persistent storage of data quality information in a relational database. Finally, we demonstrate a data quality metadata mapping to close the gap between the streaming environment and the target database. Our solution maintains a flexible number of DQ dimensions and supports applications directly consuming streaming data or processing data filed in a persistent database.
引用
收藏
页码:3 / +
页数:2
相关论文
共 50 条
  • [1] Multivariate Correlations Discovery in Static and Streaming Data
    Minartz, Koen
    d'Hondt, Jens E.
    Papapetrou, Odysseas
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (06): : 1266 - 1278
  • [2] Semantic access to streaming and static data at Siemens
    Kharlamov, Evgeny
    Mailis, Theofilos
    Mehdi, Gulnar
    Neuenstadt, Christian
    Oezcep, Oezguer
    Roshchin, Mikhail
    Solomakhina, Nina
    Soylu, Ahmet
    Svingos, Christoforos
    Brandt, Sebastian
    Giese, Martin
    Ioannidis, Yannis
    Lamparter, Steffen
    Moeller, Ralf
    Kotidis, Yannis
    Waaler, Arild
    JOURNAL OF WEB SEMANTICS, 2017, 44 : 54 - 74
  • [3] Static and Streaming Data Structures for Frechet Distance Queries
    Filtser, Arnold
    Filtser, Omrit
    PROCEEDINGS OF THE 2021 ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2021, : 1150 - 1170
  • [4] Static and Streaming Data Structures for Frechet Distance Queries
    Filtser, Arnold
    Filtser, Omrit
    ACM TRANSACTIONS ON ALGORITHMS, 2023, 19 (04)
  • [5] Quality assurance of streaming oceanographic data sets: using the data stream as a metric of quality
    Walsh, Ian D.
    Murphy, David J.
    Martini, Kim
    OCEANS 2017 - ANCHORAGE, 2017,
  • [6] Warehousing and Analyzing Streaming Data Quality Information
    Olbrich, Sebastian
    Klein, Anja
    AMCIS 2010 PROCEEDINGS, 2010,
  • [7] Dynamic Data Quality for Static Blockchains
    Labouseur, Alan G.
    Matheus, Carolyn C.
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 19 - 21
  • [8] Reducing Data Request Contentions for Improved Streaming Quality
    Liu, Yao
    Li, Fei
    Guo, Lei
    Chen, Songqing
    NOSSDAV 2010: PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO, 2010, : 33 - 38
  • [9] Link Quality Estimation for Adaptive Data Streaming in WSN
    Jayasri, T.
    Hemalatha, M.
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 94 (03) : 1543 - 1562
  • [10] Efficiently mining high utility sequential patterns in static and streaming data
    Zihayat, Morteza
    Wu, Cheng-Wei
    An, Aijun
    Tseng, Vincent S.
    Lin, Chien
    INTELLIGENT DATA ANALYSIS, 2017, 21 : S103 - S135