BigDimETL with NoSQL Database

被引:10
|
作者
Mallek, Hana [1 ]
Ghozzi, Faiza [1 ]
Teste, Olivier [2 ]
Gargouri, Faiez [1 ]
机构
[1] Inst Comp Sci & Multimedia Sfax, MIRACL Lab, BP 1030, Sfax, Tunisia
[2] Univ Toulouse, IRIT 5505, 118 Route Narbonne, F-31062 Toulouse, France
关键词
ETL; Hbase; BigData; Twitter; Join operation;
D O I
10.1016/j.procs.2018.08.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last decade, we have witnessed an explosion of data volume available on the Web. This is due to the rapid technological advances with the availability of smart devices and social networks such as Twitter, Facebook, Instagram, etc. Hence, the concept of Big Data was created to face this constant increase. In this context, many domains should take in consideration this growth of data, especially, the Business Intelligence (BI) domain. Where, it is full of important knowledge that is crucial for effective decision making. However, new problems and challenges have appeared for the Decision Support System that must be addressed. Accordingly, the purpose of this paper is to adapt Extract-Transform-Load (ETL) processes with Big Data technologies, in order to support decision-making and knowledge discovery. In this paper, we propose a new approach called Big Dimensional ETL (BigDimETL) dealing with ETL development process and taking into account the Multidimensional structure. In addition, in order to accelerate data handling we used the MapReduce paradigm and Hbase as a distributed storage mechanism that provides data warehousing capabilities. Experimental results show that our ETL operation adaptation can perform well especially with Join operation. (C) 2018 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:798 / 807
页数:10
相关论文
共 50 条
  • [1] BASE analysis of NoSQL database
    Chandra, Deka Ganesh
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 52 : 13 - 21
  • [2] Renormalization of NoSQL Database Schemas
    Mior, Michael J.
    Salem, Kenneth
    CONCEPTUAL MODELING, ER 2018, 2018, 11157 : 479 - 487
  • [3] Standardization of NoSQL Database Languages
    Bach, Malgorzata
    Werner, Aleksandra
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2014, 2014, 424 : 50 - 60
  • [4] Database Design for NoSQL Systems
    Bugiotti, Francesca
    Cabibbo, Luca
    Atzeni, Paolo
    Torlone, Riccardo
    CONCEPTUAL MODELING, 2014, 8824 : 223 - 231
  • [5] Application of NoSQL Database MongoDB
    Gu, Yunhua
    Shen, Shu
    Wang, Jin
    Kim, Jeong-Uk
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 158 - 159
  • [6] Analysis of Various NoSql Database
    Srivastava, Pragati Prakash
    Goyal, Saumya
    Kumar, Anil
    2015 International Conference on Green Computing and Internet of Things (ICGCIoT), 2015, : 539 - 544
  • [7] A study of NoSQL Database for enterprises
    Chen, Jeang-Kuo
    Lee, Wei-Zhe
    2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 436 - 440
  • [8] Biological Database in NoSQL Document-oriented Database
    Lopes, Ingrid Santana
    Holanda, Maristela
    2018 13TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2018,
  • [9] MEDICAL GRAPH DATABASE ON NOSQL ENVIRONMENT
    Mocean, Loredana
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY (IE 2017): EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2017, : 303 - 310
  • [10] DISTRIBUTED DATABASE OPTIMIZATIONS WITH NoSQL MEMBERS
    Popa, George Dan
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2015, 77 (02): : 55 - 64