Performance of Scalable Off-The-Shelf Hardware for Data-intensive Parallel Processing using MapReduce

被引:0
|
作者
Fadzil, Ahmad Firdaus Ahmad [1 ]
Khalid, Noor Elaiza Abdul [1 ]
Manaf, Mazani [1 ]
机构
[1] Univ Teknol MARA UiTM, Fac Comp & Math Sci, Shah Alam, Malaysia
关键词
MapReduce; Parallel processing; Off-the-shelf hardware; scalability;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large data and information processing requires high processing power that usually involve supercomputers which are costly. MapReduce parallel framework introduces an automated way of distributing these large processes to many computers. This paper proposes to conduct preliminary studies on scalability using MapReduce as an automated parallel processing running on low-cost off-the-shelf hardware. The system architecture is built with collections of off-the-shelf hardware. The scalability test will be conducted by adding an off-the-shelf hardware one at a time to the architecture. MapReduce tool is used as a parallel framework to automatically distribute tasks according to available resources. Performance will be evaluated based on improvement in speedup. It is found that MapReduce is able to accommodate scalability of off-the-shelf hardware resources by automatically distributing tasks regardless of the number of hardware being added to the architecture.
引用
收藏
页码:379 / 384
页数:6
相关论文
共 50 条
  • [31] ScalaBLAST: A scalable implementation of BLAST for high-performance data-intensive bioinformatics analysis
    Oehmen, Christopher
    Nieplocha, Jarek
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2006, 17 (08) : 740 - 749
  • [32] A high-performance distributed parallel file system for data-intensive computations
    Shen, XH
    Choudhary, A
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (10) : 1157 - 1167
  • [33] Performance Evaluation of IEEE 802.11p Vehicle to Infrastructure Communication Using Off-the-Shelf IEEE 802.11a Hardware
    Zhao, Yuhang
    Zhang, Hesheng
    Sun, Wei
    Bai, Zhe
    Pan, Cheng
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 3004 - 3009
  • [34] DEVELOPING A PRODUCTION DATA MANAGEMENT (PDM) SYSTEM USING OFF-THE-SHELF SOFTWARE
    MCGINNIS, B
    FLANDERS, WA
    JOURNAL OF PETROLEUM TECHNOLOGY, 1988, 40 (10): : 1321 - 1329
  • [35] Using off-the-shelf data-human interface platforms: traps and tricks
    Alessia Angeli
    Gustavo Marfia
    Norman Riedel
    Multimedia Tools and Applications, 2021, 80 : 12907 - 12929
  • [36] Using off-the-shelf data-human interface platforms: traps and tricks
    Angeli, Alessia
    Marfia, Gustavo
    Riedel, Norman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 12907 - 12929
  • [37] Using Off-the-Shelf Graphic Design Software for Validating the Operation of an Image Processing System
    Chrzaszcz, Jerzy
    SENSORS, 2021, 21 (15)
  • [38] Scalable, high-performance data mining with parallel processing
    Freitas, AA
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 477 - 477
  • [39] Parallel Data Processing in Dynamic Hybrid Computing Environment Using MapReduce
    Tang, Bing
    He, Haiwu
    Fedak, Gilles
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 1 - 14
  • [40] Analysis of Massive Industrial Data using MapReduce Framework for Parallel Processing
    Aly, Mohab
    Yacout, Soumaya
    Shaban, Yasser
    2017 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2017,