Logical big data integration and near real-time data analytics

被引:6
|
作者
Silva, Bruno [1 ]
Moreira, Jose [1 ,2 ]
Costa, Rogerio Luis de C. [3 ]
机构
[1] Univ Aveiro, Inst Elect & Informat Engn IEETA, LASI, P-3810193 Aveiro, Portugal
[2] Univ Aveiro, Dept Elect Telecommun & Informat DETI, P-3810193 Aveiro, Portugal
[3] Polytech Leiria, Comp Sci & Commun Res Ctr CIIC, P-2411901 Leiria, Portugal
关键词
Big data integration; Distributed databases; Near real-time OLAP;
D O I
10.1016/j.datak.2023.102185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of decision-making, there is a growing demand for near real-time data that traditional solutions, like data warehousing based on long-running ETL processes, cannot fully meet. On the other hand, existing logical data integration solutions are challenging because users must focus on data location and distribution details rather than on data analytics and decision-making. EasyBDI is an open-source system that provides logical integration of data and high-level business-oriented abstractions. It uses schema matching, integration, and mapping techniques, to automatically identify partitioned data and propose a global schema. Users can then specify star schemas based on global entities and submit analytical queries to retrieve data from distributed data sources without knowing the organization and other technical details of the underlying systems. This work presents the algorithms and methods for global schema creation and query execution. Experimental results show that the overhead imposed by logical integration layers is relatively small compared to the execution times of distributed queries.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] An Event-Based Near Real-Time Data Integration Architecture
    Naeem, M. Asif
    Dobbie, Gillian
    Weber, Gerald
    EDOCW: 2008 12TH ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE WORKSHOPS, 2008, : 472 - 475
  • [42] Real-Time Big Data Analytics and Proactive Traffic Safety Management Visualization System
    Abdel-Aty, Mohamed
    Zheng, Ou
    Wu, Yina
    Abdelraouf, Amr
    Rim, Heesub
    Li, Pei
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2023, 149 (08)
  • [43] Toward a smart health: big data analytics and IoT for real-time miscarriage prediction
    Asri, Hiba
    Jarir, Zahi
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [44] Real-Time Large-Scale Big Data Networks Analytics and Visualization Architecture
    Chopade, Pravin
    Zhan, Justin
    Roy, Kaushik
    Flurchick, Kenneth
    2015 12TH INTERNATIONAL CONFERENCE & EXPO ON EMERGING TECHNOLOGIES FOR A SMARTER WORLD (CEWIT), 2015,
  • [45] HBelt: Integrating an Incremental ETL Pipeline with a Big Data Store for Real-Time Analytics
    Qu, Weiping
    Shankar, Sahana
    Ganza, Sandy
    Dessloch, Stefan
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2015, 2015, 9282 : 123 - 137
  • [46] MOLESTRA: A Multi-Task Learning Approach for Real-Time Big Data Analytics
    Demertzis, Konstantinos
    Iliadis, Lazaros
    Anezakis, Vardis-Dimitris
    2018 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2018,
  • [47] Toward a smart health: big data analytics and IoT for real-time miscarriage prediction
    Hiba Asri
    Zahi Jarir
    Journal of Big Data, 10
  • [48] Real-Time Tweet Analytics Using Hybrid Hashtags on Twitter Big Data Streams
    Gupta, Vibhuti
    Hewett, Rattikorn
    INFORMATION, 2020, 11 (07)
  • [49] Real-time misfire detection of large gas engine using big data analytics
    Szabo, Jozsef Z.
    Bakucz, Peter
    2018 IEEE 16TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY 2018), 2018, : 215 - 220
  • [50] Exploiting IoT and big data analytics: Defining Smart Digital City using real-time urban data
    Rathore, M. Mazhar
    Paul, Anand
    Hong, Won-Hwa
    Seo, HyunCheol
    Awan, Imtiaz
    Saeed, Sharjil
    SUSTAINABLE CITIES AND SOCIETY, 2018, 40 : 600 - 610