Ontology-based conceptual design of ETL processes for both structured and semi-structured data

被引:62
|
作者
Skoutas, Dimitricis [1 ]
Simitsis, Alkis [1 ]
机构
[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, GR-10682 Athens, Greece
关键词
conceptual design; data semantics; data warehousing; ETL; ontology; semantic matching; workflow diagram;
D O I
10.4018/jswis.2007100101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main tasks in the early stages of a data warehouse project is the identification of the appropriate transformations and the specification of inter-schema mappings from the data sources to the data warehouse. In this article, we propose an ontology-based approach to facilitate the conceptual design of the backstage of a data warehouse. A graph-based representation is used as a conceptual model for the datastores, so that both structured and semi-structured data are supported and handled in a uniform way. The proposed approach is based on the use of Semantic Web technologies to semantically annotate the data sources and the data warehouse, so that mappings between them can be inferred, thereby resolving the issue of heterogeneity Specifically, a suitable application Ontology is created and used to annotate the datastores. The language used for describing the ontology is OWL-DL. Based on the provided annotations, a DL reasoner is employed to infer semantic correspondences and conflicts among the datastores, and to propose a set of conceptual operations for transforming data from the source datastores to the data warehouse.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 50 条
  • [11] Querying semi-structured data
    Abiteboul, S
    DATABASE THEORY - ICDT'97, 1997, 1186 : 1 - 18
  • [12] Multilingual Food and Heath Ontology Learning Using Semi-Structured and Structured Web Data Sources
    Albukhitan, Saeed
    Helmy, Tarek
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 231 - 235
  • [13] A view-based approach to the integration of structured and semi-structured data
    Ahmad, Honda
    Kermanshahani, Shokooh
    Simonet, Ana
    Simonet, Michel
    DATABASES AND INFORMATION SYSTEMS: COMMUNICATIONS, MATERIALS OF DOCTORAL CONSORTIUM, 2006, : 41 - 51
  • [14] An automated integration approach for semi-structured and structured data
    Lim, SJ
    Ng, YK
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, : 12 - 21
  • [15] Schemas for integration and translation of structured and semi-structured data
    Beeri, C
    Milo, T
    DATABASE THEORY - ICDT'99, 1999, 1540 : 296 - 313
  • [16] OntoZilla: An ontology-based, semi-structured, and evolutionary peer-to-peer network for information systems and services
    Joung, Yuh-Jzer
    Chuang, Feng-Yuan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (01): : 53 - 63
  • [17] Conceptual graphs as schemas for semi-structured databases
    Su, YF
    Wong, KF
    SEVENTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2001, : 150 - 151
  • [18] Data Integration Approach for Semi-structured and Structured Data (Linked Data)
    Kettouch, Mohamed Salah
    Luca, Cristina
    Hobbs, Mike
    Fatima, Arooj
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2015, : 820 - 825
  • [19] A Web Mining method based on personal ontology for semi-structured RDF
    Nakayama, K
    Hara, T
    Nishio, S
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005 WORKSHOPS, PROCEEDINGS, 2005, 3807 : 227 - 234
  • [20] Ontology population from unstructured and semi-structured texts
    Yoon, Hee-Geun
    Han, Yong Jin
    Park, Seong-Bae
    Park, Se-Young
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 135 - +