A FRAMEWORK FOR DATA CLEANING IN DATA WAREHOUSES

被引:0
|
作者
Peng, Taoxin [1 ]
机构
[1] Napier Univ, Sch Comp, Edinburgh EH10 5DT, Midlothian, Scotland
关键词
Data Cleaning; Data Quality; Data Integration; Data Warehousing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is a persistent challenge to achieve a high quality of data in data warehouses. Data cleaning is a crucial task for such a challenge. To deal with this challenge, a set of methods and tools has been developed. However, there are still at least two questions needed to be answered: How to improve the efficiency while performing data cleaning? How to improve the degree of automation when performing data cleaning? This paper challenges these two questions by presenting a novel framework, which provides an approach to managing data cleaning in data warehouses by focusing on the use of data quality dimensions, and decoupling a cleaning process into several sub-processes. Initial test run of the processes in the framework demonstrates that the approach presented is efficient and scalable for data cleaning in data warehouses.
引用
收藏
页码:473 / 478
页数:6
相关论文
共 50 条
  • [21] Populating Data Warehouses with Semantic Data
    Nebot, V.
    Berlanga, R.
    IEEE LATIN AMERICA TRANSACTIONS, 2010, 8 (02) : 150 - 157
  • [22] Identifying data sources for data warehouses
    Koncilia, C
    Pozewaunig, H
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 213 - 218
  • [23] Querying Compressed Data in Data Warehouses
    Anindya Datta
    Helen Thomas
    Information Technology and Management, 2002, 3 (4) : 353 - 386
  • [24] DATA ANALYTICAL PROCESSING IN DATA WAREHOUSES
    Rostek, Katarzyna
    FOUNDATIONS OF MANAGEMENT, 2010, 2 (01) : 99 - 116
  • [25] Data mining and data warehouses - An overview
    Gray, P
    ASSOCIATION FOR INFORMATION SYSTEMS PROCEEDING OF THE AMERICAS CONFERENCE ON INFORMATION SYSTEMS, 1997, : 857 - 859
  • [26] Minimizing detail data in data warehouses
    Akinde, MO
    Jensen, OG
    Böhlen, MH
    ADVANCES IN DATABASE TECHNOLOGY - EDBT'98, 1998, 1377 : 293 - 307
  • [27] A Data Masking Technique for Data Warehouses
    Santos, Ricardo Jorge
    Bernardino, Jorge
    Vieira, Marco
    PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, : 61 - 69
  • [28] Building Data Warehouses in the Era of Big Data An Approach for Scalable and Flexible Big Data Warehouses
    Costa, Carlos
    Santos, Maribel Yasmina
    ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2019), 2019, 11483 : 693 - 695
  • [29] Stonebraker on Data Warehouses
    Stonebraker, Michael
    COMMUNICATIONS OF THE ACM, 2011, 54 (05) : 10 - 11
  • [30] PHILIPS DATA WAREHOUSES
    DALY, J
    PROCEEDINGS : SEAS ANNIVERSARY MEETING 1989, VOLS 1 AND 2: THE CORPORATE NETWORK, 1989, : 1099 - 1105