INTEGRATION OF DATA FROM HETEROGENEOUS SOURCES USING ETL TECHNOLOGY

被引:9
|
作者
Macura, Marek [1 ]
机构
[1] AGH Univ Sci & Technol, Krakow, Poland
来源
COMPUTER SCIENCE-AGH | 2014年 / 15卷 / 02期
关键词
data integration; integration approaches; ETL technology; knowledge discovery from data; business intelligence;
D O I
10.7494/csci.2014.15.2.109
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data integration is a crucial issue in the environments of heterogeneous data sources. At present, the afore-mentioned heterogeneity is becoming widespread. Based on various data sources, if we want to gain useful information and knowledge, we must solve data integration problems in order to apply appropriate analytical methods to comprehensive and uniform data. Such activity is known as knowledge discovery from the data process. Therefore, approaches to the data integration problem are very interesting and bring us closer to the "age of information". This paper presents an architecture which implements knowledge discovery from the data process. The solution combines ETL technology and a wrapper layer known from mediated systems. It also provides semantic integration through connection mechanism between data elements. The solution allows for integration of any data sources and implementation of analytical methods in one environment. The proposed environment is verified by applying it to data sources in the foundry industry.
引用
收藏
页码:109 / 132
页数:24
相关论文
共 50 条
  • [41] An approach for the extensional integration of data sources with heterogeneous representation formats
    Pontieri, L
    Ursino, D
    Zumpano, E
    DATA & KNOWLEDGE ENGINEERING, 2003, 45 (03) : 291 - 331
  • [42] Parallel Integration of Heterogeneous Genome-Wide Data Sources
    Greene, Derek
    Bryan, Kenneth
    Cunningham, Padraig
    8TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, VOLS 1 AND 2, 2008, : 368 - 374
  • [43] Clustering schema elements for semantic integration of heterogeneous data sources
    Zhao, HM
    Ram, S
    JOURNAL OF DATABASE MANAGEMENT, 2004, 15 (04) : 88 - 106
  • [44] Matching disparate dimensions for analytical integration of heterogeneous data sources
    Korobko, Anna
    Korobko, Aleksei
    11TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES), 2019, : 66 - 72
  • [45] Fusionplex: resolution of data inconsistencies in the integration of heterogeneous information sources
    Motro, Amihai
    Anokhin, Philipp
    INFORMATION FUSION, 2006, 7 (02) : 176 - 196
  • [46] Detection and resolution of data confliction in the integration of heterogeneous information sources
    College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100022, China
    Beijing Gongye Daxue Xuebao J. Beijing Univ. Technol., 2008, 1 (37-42): : 37 - 42
  • [47] Ontology based integration of distributed and heterogeneous data sources in ACGT
    Martin, Luis
    Anguita, Alberto
    Maojo, Victor
    Bonsma, Erwin
    Bucur, Anca
    Vrijnsen, Jeroen
    Brochhausen, Mathias
    Cocos, Christian
    Stenzhorn, Holger
    Tsiknakis, Manolis
    Doerr, Martin
    Kondylakis, Haridimos
    HEALTHINF 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON HEALTH INFORMATICS, VOL 1, 2008, : 301 - +
  • [48] Research on Semantic Integration across Heterogeneous Data Sources in Grid
    Liu, Guofeng
    Huang, Shaobin
    Cheng, Yuan
    FRONTIERS IN COMPUTER EDUCATION, 2012, 133 : 397 - 404
  • [49] A Case Study on the Integration of Heterogeneous Data Sources in Public Health
    Vittorini, Pierpaolo
    Angelone, Anna Maria
    Cofini, Vincenza
    Fabiani, Leila
    Mattei, Antonella
    Necozione, Stefano
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II, 2017, 10209 : 411 - 423
  • [50] Heterogeneous Data Integration Using Web of Data Technologies
    Ziebelin, Danielle
    Hobus, Kim
    Genoud, Philippe
    Bouveret, Sylvain
    WEB AND WIRELESS GEOGRAPHICAL INFORMATION SYSTEMS, W2GIS 2017, 2017, 10181 : 35 - 47