IMapC: Inner MAPping Combiner to Enhance the Performance of MapReduce in Hadoop

被引:4
|
作者
Kavitha, C. [1 ]
Srividhya, S. R. [1 ]
Lai, Wen-Cheng [2 ,3 ]
Mani, Vinodhini [1 ]
机构
[1] Sathyabama Inst Sci & Technol, Dept Comp Sci & Engn, Chennai 600119, Tamil Nadu, India
[2] Natl Yunlin Univ Sci & Technol, Bachelor Program Ind Projects, Touliu 640301, Yunlin, Taiwan
[3] Natl Yunlin Univ Sci & Technol, Dept Elect Engn, Touliu 640301, Yunlin, Taiwan
关键词
big data; combiner; distributed storage; hadoop; mapreduce; sort; task failure resilience; wordcount;
D O I
10.3390/electronics11101599
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hadoop is a framework for storing and processing huge amounts of data. With HDFS, large data sets can be managed on commodity hardware. MapReduce is a programming model for processing vast amounts of data in parallel. Mapping and reducing can be performed by using the MapReduce programming framework. A very large amount of data is transferred from Mapper to Reducer without any filtering or recursion, resulting in overdrawn bandwidth. In this paper, we introduce an algorithm called Inner MAPping Combiner (IMapC) for the map phase. This algorithm in the Mapper combines the values of recurring keys. In order to test the efficiency of the algorithm, different approaches were tested. According to the test, MapReduce programs that are implemented with the Default Combiner (DC) of IMapC will be 70% more efficient than those that are implemented without one. To make computations significantly faster, this work can be combined with MapReduce.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Scalable Performance Tuning of Hadoop MapReduce: A Noisy Gradient Approach
    Kumar, Sandeep
    Padakandla, Sindhu
    Chandrashekar, L.
    Parihar, Priyank
    Gopinath, K.
    Bhatnagar, Shalabh
    2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 375 - 382
  • [22] IDaPS - Improved data-locality aware data placement strategy based on Markov clustering to enhance MapReduce performance on Hadoop
    Vengadeswaran, S.
    Balasundaram, S. R.
    Dhavakumar, P.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (03)
  • [23] Performance Analysis of MapReduce on OpenStack-based Hadoop Virtual Cluster
    Ahmad, Nazrul M.
    Yaacob, Asrul Hadi
    Amin, Anang Hudaya Muhamad
    Kannan, Subarmaniam
    2014 IEEE 2ND INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATION TECHNOLOGIES (ISTT), 2014, : 132 - 137
  • [24] Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud
    Wang, Nini
    Yang, Jian
    Lu, Zhihui
    Li, Xiaoyan
    Wu, Jie
    ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 77 - 91
  • [25] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [26] Enhancing Performance of Hadoop and Mapreduce for Scientific Data using NoSQL Database
    Alshammari, Hamoud
    Bajwa, Hassan
    Lee, Jeongkyu
    2015 IEEE LONG ISLAND SYSTEMS, APPLICATIONS AND TECHNOLOGY CONFERENCE (LISAT), 2015,
  • [27] Performance Analysis of Hadoop MapReduce on an OpenNebula Cloud with KVM and OpenVZ Virtualizations
    Magalhaes Vasconcelos, Pedro Roger
    de Araujo Freitas, Gisele Azevedo
    2014 9TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2014, : 471 - 476
  • [28] An Open Source Project for Tuning and Analyzing MapReduce Performance in Hadoop and Spark
    Chen, Donghua
    Zhang, Runtong
    IEEE SOFTWARE, 2022, 39 (01) : 61 - 69
  • [29] Performance Evaluation and Tuning for MapReduce Computing in Hadoop Distributed File System
    Kim, Jongyeop
    Kumar, Ashwin T. K.
    George, K. M.
    Park, Nohpill
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2015, : 62 - 68
  • [30] Observations on Factors Affecting Performance of MapReduce based Apriori on Hadoop Cluster
    Singh, Sudhakar
    Garg, Rakhi
    Mishra, P. K.
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 87 - 94