Ontology-based conceptual design of ETL processes for both structured and semi-structured data

被引:62
|
作者
Skoutas, Dimitricis [1 ]
Simitsis, Alkis [1 ]
机构
[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, GR-10682 Athens, Greece
关键词
conceptual design; data semantics; data warehousing; ETL; ontology; semantic matching; workflow diagram;
D O I
10.4018/jswis.2007100101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main tasks in the early stages of a data warehouse project is the identification of the appropriate transformations and the specification of inter-schema mappings from the data sources to the data warehouse. In this article, we propose an ontology-based approach to facilitate the conceptual design of the backstage of a data warehouse. A graph-based representation is used as a conceptual model for the datastores, so that both structured and semi-structured data are supported and handled in a uniform way. The proposed approach is based on the use of Semantic Web technologies to semantically annotate the data sources and the data warehouse, so that mappings between them can be inferred, thereby resolving the issue of heterogeneity Specifically, a suitable application Ontology is created and used to annotate the datastores. The language used for describing the ontology is OWL-DL. Based on the provided annotations, a DL reasoner is employed to infer semantic correspondences and conflicts among the datastores, and to propose a set of conceptual operations for transforming data from the source datastores to the data warehouse.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 50 条
  • [21] Query optimization for semi-structured data
    Li, GY
    Bian, S
    Zhang, J
    Xie, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 97 - 100
  • [22] Privacy Preservation of Semi-structured Data Based on XML
    Shi, Cheng
    Yang, Mingda
    Ning, Bo
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 1081 - 1088
  • [23] Survey on Mining in Semi-Structured Data
    Shettar, Rajashree
    Shobha, G.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (08): : 226 - 231
  • [24] Data Warehouse Based Approach to the Integration of Semi-structured Data
    Ahmad, Houda
    Kermanshahani, Shokoh
    Simonet, Ana
    Simonet, Michel
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 88 - 99
  • [25] FINDER:: A mediator system for structured and semi-structured data integration
    Alvarez, M
    Pan, A
    Raposo, J
    Cacheda, F
    Viña, A
    13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 847 - 851
  • [26] A Survey on the Semi-Structured Data Models
    Chakraborty, Supriya
    Chaki, Nabendu
    COMPUTER INFORMATION SYSTEMS - ANALYSIS AND TECHNOLOGIES, 2011, 245 : 257 - +
  • [27] Adaptive retrieval of semi-structured data
    Ben-Asher, Yosi
    Berkovsky, Shlomo
    Busetta, Paolo
    Eytani, Yaniv
    Jbara, Sadek
    Kuflik, Tsvi
    ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, 2008, 5149 : 32 - +
  • [28] Supporting structured, semi-structured and unstructured data in digital libraries
    Sánchez, JA
    Proal, C
    Maldonado-Naude, F
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 368 - 375
  • [29] Specification and Verification for Semi-Structured Data
    CHEN Tao-lue
    WuhanUniversityJournalofNaturalSciences, 2006, (01) : 107 - 112
  • [30] Generic organization of semi-structured data
    Chakraborty, Supriya
    Chaki, Nabendu
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2014, 29 (01): : 65 - 74