Active XML-based Web data integration

被引:16
|
作者
Salem, Rashed [1 ]
Boussaid, Omar [1 ]
Darmont, Jerome [1 ]
机构
[1] Univ Lyon, ERIC Lyon 2, F-69676 Bron, France
关键词
Real-time Web data integration; Metadata; Integration services; Active rules; Event mining; DATA WAREHOUSES; ISSUES; OLAP;
D O I
10.1007/s10796-012-9405-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, the Web is the largest source of information worldwide. There is currently a strong trend for decision-making applications such as Data Warehousing (DW) and Business Intelligence (BI) to move onto the Web, especially in the cloud. Integrating data into DW/BI applications is a critical and time-consuming task. To make better decisions in DW/BI applications, next generation data integration poses new requirements to data integration systems, over those posed by traditional data integration. In this paper, we propose a generic, metadata-based, service-oriented, and event-driven approach for integrating Web data timely and autonomously. Beside handling data heterogeneity, distribution and interoperability, our approach satisfies near real-time requirements and realize active data integration. For this sake, we design and develop a framework that utilizes Web standards (e.g., XML and Web services) for tackling data heterogeneity, distribution and interoperability issues. Moreover, our framework utilizes Active XML (AXML) to warehouse passive data as well as services to integrate active and dynamic data on-the-fly. AXML embedded services and changes detection services ensure near real-time data integration. Furthermore, the idea of integrating Web data actively and autonomously revolves around mining events logged by the data integration environment. Therefore, we propose an incremental XML-based algorithm for mining association rules from logged events. Then, we define active rules dynamically upon mined data to automate and reactivate integration tasks. Finally, as a proof of concept, we implement a framework prototype as a Web application using open-source tools.
引用
收藏
页码:371 / 398
页数:28
相关论文
共 50 条
  • [31] XML-Based specification for web services document security
    Bhatti, R
    Bertino, E
    Ghafoor, A
    Joshi, JBD
    COMPUTER, 2004, 37 (04) : 41 - +
  • [32] An XML-based wrapper generator for Web information extraction
    Liu, L
    Han, W
    Buttler, D
    Pu, C
    Tang, W
    SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 540 - 543
  • [33] XML-based integration for Internet publishers - the case of BertelsmannSpringer
    Rawolle, J
    Ade, J
    Schumann, M
    WIRTSCHAFTSINFORMATIK, 2002, 44 (01): : 19 - 28
  • [34] XML-based monitoring and operating for Web Services in automation
    Braune, Annerose
    Hennig, Stefan
    Schaft, Torsten
    2007 5TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1-3, 2007, : 797 - 802
  • [35] Coyote: An XML-based framework for web services testing
    Tsai, WT
    Paul, R
    Song, WW
    Cao, ZB
    7TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING, PROCEEDINGS, 2002, : 173 - 174
  • [36] Web Services: XML-based system integrated techniques
    Yu, SC
    Chen, RS
    ELECTRONIC LIBRARY, 2003, 21 (04): : 358 - 366
  • [37] XML-Based Network Integration of Information in CAD Systems
    Shchekin A.V.
    Tribushinin I.N.
    Shchekin, A.V. (schekin@inbox.ru); Tribushinin, I.N. (tribushinin@mail.ru), 1600, Pleiades journals (40): : 1073 - 1077
  • [38] A review of XML-based supply-chain integration
    Nurmilaakso, JM
    Kotinurmi, P
    PRODUCTION PLANNING & CONTROL, 2004, 15 (06) : 608 - 621
  • [39] Web-based Real-Time Decision Support System Active XML-based Metadata
    Alrefae, Abdullah
    Cao, Jinli
    2014 GLOBAL SUMMIT ON COMPUTER & INFORMATION TECHNOLOGY (GSCIT), 2014,
  • [40] XML-based visual data mining in medicine
    Dugas, M
    Hoffmann, E
    Janko, S
    Hahnewald, S
    Matis, T
    Überla, K
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 1324 - 1328