Efficient incremental loading in ETL processing for real-time data integration

被引:0
|
作者
Neepa Biswas
Anamitra Sarkar
Kartick Chandra Mondal
机构
[1] Jadavpur University,Department of Information Technology
关键词
Data warehouse; Code-based ETL; ETL tools; Pygrametl; Petl; Scriptella; Incremental load; Bulk load; CDC;
D O I
暂无
中图分类号
学科分类号
摘要
ETL (extract transform load) is the widely used standard process for creating and maintaining a data warehouse (DW). ETL is the most resource-, cost- and time-demanding process in DW implementation and maintenance. Nowadays, many graphical user interfaces (GUI)-based solutions are available to facilitate the ETL processes. In spite of the high popularity of GUI-based tool, there is still some downside of such approach. This paper focuses on alternative ETL developmental approach taken by hand coding. In some contexts like research and academic work, it is appropriate to go for custom-coded solution which can be cheaper, faster and maintainable compared to any GUI-based tools. Some well-known code-based open-source ETL tools developed by the academic world have been studied in this article. Their architecture and implementation details are addressed here. The aim of this paper is to present a comparative evaluation of these code-based ETL tools. Finally, an efficient ETL model is designed to meet the near real-time responsibility of the present days.
引用
收藏
页码:53 / 61
页数:8
相关论文
共 50 条
  • [21] REAL-TIME TERRAIN RENDERING WITH INCREMENTAL LOADING FOR INTERACTIVE TERRAIN MODELLING
    van den Hurk, Simon
    Yuen, Wallace
    Wunsche, Burkhard C.
    GRAPP 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS THEORY AND APPLICATIONS, 2011, : 181 - 186
  • [22] An efficient solution of real-time data processing for multi-GNSS network
    Gong, Xiaopeng
    Gu, Shengfeng
    Lou, Yidong
    Zheng, Fu
    Ge, Maorong
    Liu, Jingnan
    JOURNAL OF GEODESY, 2018, 92 (07) : 797 - 809
  • [23] An efficient solution of real-time data processing for multi-GNSS network
    Xiaopeng Gong
    Shengfeng Gu
    Yidong Lou
    Fu Zheng
    Maorong Ge
    Jingnan Liu
    Journal of Geodesy, 2018, 92 : 797 - 809
  • [24] Real-Time Integration of Building Energy Data
    Anjos, Diogo
    Carreira, Paulo
    Francisco, Alexandre P.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 250 - 257
  • [25] REAL-TIME DATA-PROCESSING AND REAL-TIME DECISION-MAKING
    KENNEDY, MH
    HOFFER, JA
    JOURNAL OF SYSTEMS MANAGEMENT, 1978, 29 (10): : 21 - 25
  • [26] Real-time stream processing for Big Data
    Wingerath, Wolfram
    Gessert, Felix
    Friedrich, Steffen
    Ritter, Norbert
    IT-INFORMATION TECHNOLOGY, 2016, 58 (04): : 186 - 194
  • [27] A REAL-TIME RADAR DATA PROCESSING SYSTEM
    SMITH, PL
    BOARDMAN, JH
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 1968, 49 (5P2) : 564 - &
  • [28] Real-time data processing on graphics processors
    Lipowski, J
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS IV, 2006, 6159
  • [29] REAL-TIME NMR DATA-PROCESSING
    VINITSKI, S
    SZUMOWSKI, U
    GRIFFEY, RH
    PROCEEDINGS OF THE ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, PTS 1-4, 1988, : 331 - 333
  • [30] Real-time processing of streaming big data
    Safaei, Ali A.
    REAL-TIME SYSTEMS, 2017, 53 (01) : 1 - 44