EVALUATING THE SCALABILITY OF BIG DATA FRAMEWORKS

被引:1
|
作者
Sanchez, David [2 ]
Solarte, Oswaldo [2 ]
Bucheli, Victor [2 ]
Ordonez, Hugo [1 ]
机构
[1] Univ San Buenaventura, Cali, Colombia
[2] Univ Valle, Cali, Colombia
来源
关键词
Scalability; Isoefficiency; Big Data; Hadoop; Spark; MapReduce;
D O I
10.12694/scpe.v19i3.1402
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The aim of this paper is to present a method based on the isoefficiency model for assessing the scalability in big data environments. The programs word count and sort were implemented and compared in Hadoop and Spark. The results confirm that isoefficiency presented a linear growth as the size of the data sets was increased. They were checked experimentally to ensure that the evaluated frameworks are scalable and a sublinear function was obtained. This paper discusses how scalability in big data is governed by a constant of scalability (beta).
引用
收藏
页码:301 / 307
页数:7
相关论文
共 50 条
  • [21] Challenges in High Performance Big Data Frameworks
    Papadopoulos, Alessandro V.
    Maggio, Martina
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 153 - 156
  • [22] Towards an analysis of the epistemic frameworks of big data
    Becerra, Gaston
    Castorina, Jose Antonio
    CINTA DE MOEBIO, 2023, 76 : 50 - 63
  • [23] Unstructured medical frameworks using big data
    Banu, A. Arjuman
    Reshmy, A. K.
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 : 234 - 241
  • [24] Big Data Security Survey on Frameworks and Algorithms
    Chandra, Sudipta
    Ray, Soumya
    Goswami, R. T.
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 48 - 54
  • [25] Experiences Evaluating OpenStack Network Data Plane Performance and Scalability
    Karacali, Bengi
    Tracey, John M.
    NOMS 2016 - 2016 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2016, : 901 - 906
  • [26] Database Sharding: To Provide Fault Tolerance and Scalability of Big Data on the Cloud
    Bagui, Sikha
    Nguyen, Loi Tang
    INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2015, 5 (02) : 36 - 52
  • [27] BIG DATA SCALABILITY, METHODS AND ITS IMPLICATIONS: A SURVEY OF CURRENT PRACTICE
    Amudhavel, J.
    Sathian, D.
    Raghav, R. S.
    Rao, Dhanawada Nirmala
    Dhavachelvan, P.
    Kumar, K. Prem
    ICARCSET'15: PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH IN COMPUTER SCIENCE ENGINEERING & TECHNOLOGY (ICARCSET - 2015), 2015,
  • [28] Big Data Processing: Scalability with Extreme Single-Node Performance
    Govindaraju, Venkatraman
    Idicula, Sam
    Agrawal, Sandeep
    Vardarajan, Venkatanathan
    Raghavan, Arun
    Wen, Jarod
    Balkesen, Cagri
    Giannikis, Georgios
    Agarwal, Nipun
    Sedlar, Eric
    2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 129 - 136
  • [29] Scalability and Parallelization of Sequential Processing: Big Data Demands and Information Algebras
    Golubtsov, Peter
    ADVANCES IN INTELLIGENT SYSTEMS, COMPUTER SCIENCE AND DIGITAL ECONOMICS, 2020, 1127 : 274 - 298
  • [30] A Survey on Big Data Processing Frameworks for Mobility Analytics
    Doulkeridis C.
    Vlachou A.
    Pelekis N.
    Theodoridis Y.
    SIGMOD Record, 2021, 50 (02): : 18 - 29