A FRAMEWORK FOR DATA CLEANING IN DATA WAREHOUSES

被引:0
|
作者
Peng, Taoxin [1 ]
机构
[1] Napier Univ, Sch Comp, Edinburgh EH10 5DT, Midlothian, Scotland
关键词
Data Cleaning; Data Quality; Data Integration; Data Warehousing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is a persistent challenge to achieve a high quality of data in data warehouses. Data cleaning is a crucial task for such a challenge. To deal with this challenge, a set of methods and tools has been developed. However, there are still at least two questions needed to be answered: How to improve the efficiency while performing data cleaning? How to improve the degree of automation when performing data cleaning? This paper challenges these two questions by presenting a novel framework, which provides an approach to managing data cleaning in data warehouses by focusing on the use of data quality dimensions, and decoupling a cleaning process into several sub-processes. Initial test run of the processes in the framework demonstrates that the approach presented is efficient and scalable for data cleaning in data warehouses.
引用
收藏
页码:473 / 478
页数:6
相关论文
共 50 条
  • [31] Designing data warehouses
    Theodoratos, Dimitri
    Sellis, Timos
    Data and Knowledge Engineering, 1999, 31 (03): : 279 - 301
  • [32] Mobility Data Warehouses
    Vaisman, Alejandro
    Zimanyi, Esteban
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (04)
  • [33] Complements for data warehouses
    Laurent, D
    Lechtenbörger, J
    Spyratos, N
    Vossen, G
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 490 - 499
  • [34] Caring for data warehouses
    Strategic Systems, 1997, 10 (02):
  • [35] Data cleaning and transformation using the AJAX framework
    Galhardas, Helena
    GENERATIVE AND TRANSFORMATIONAL TECHNIQUES IN SOFTWARE ENGINEERING, 2006, 4143 : 327 - 343
  • [36] GPSClean: A Framework for Cleaning and Repairing GPS Data
    Fang, Chenglong
    Wang, Feng
    Yao, Bin
    Xu, Jianqiu
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (03)
  • [37] Designing data warehouses
    Theodoratos, D
    Sellis, T
    DATA & KNOWLEDGE ENGINEERING, 1999, 31 (03) : 279 - 301
  • [38] Deductive Data Warehouses
    Rabuzin, Kornelije
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2014, 10 (01) : 16 - 31
  • [40] Business-Object Oriented Requirements Analysis Framework for Data Warehouses
    Sarkar, Anirban
    Choudhury, Sankhayan
    Chaki, Nabendu
    Bhattacharya, Swapan
    22ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING & KNOWLEDGE ENGINEERING (SEKE 2010), 2010, : 34 - 37