MrHeter: improving MapReduce performance in heterogeneous environments

被引:0
|
作者
Xiao Zhang
Yanjun Wu
Chen Zhao
机构
来源
Cluster Computing | 2016年 / 19卷
关键词
MapReduce; Heterogeneous cluster; Scheduling; Performance;
D O I
暂无
中图分类号
学科分类号
摘要
As GPUs, ARM CPUs and even FPGAs are widely used in modern computing, a data center gradually develops towards the heterogeneous clusters. However, many well-known programming models such as MapReduce are designed for homogeneous clusters and have poor performance in heterogeneous environments. In this paper, we reconsider the problem and make four contributions: (1) We analyse the causes of MapReduce poor performance in heterogeneous clusters, and the most important one is unreasonable task allocation between nodes with different computing ability. (2) Based on this, we propose MrHeter, which separates MapReduce process into map-shuffle stage and reduce stage, then constructs optimization model separately for them and gets different task allocation mlij,mrij,rij\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ml_{ij}, mr_{ij}, r_{ij}$$\end{document} for heterogeneous nodes based on computing ability.(3) In order to make it suitable for dynamic execution, we propose D-MrHeter, which includes monitor and feedback mechanism. (4) Finally, we prove that MrHeter and D-MrHeter can greatly decrease total execution time of MapReduce from 30 to 70 % in heterogeneous cluster comparing with original Hadoop, having better performance especially in the condition of heavy-workload and large-difference between nodes computing ability.
引用
收藏
页码:1691 / 1701
页数:10
相关论文
共 50 条
  • [31] Performance analysis of MapReduce program in heterogeneous cloud computing
    Lin, Wenhui
    Liu, Jun
    Journal of Networks, 2013, 8 (08) : 1734 - 1741
  • [32] Data Preloading and Data Placement for MapReduce Performance Improving
    Spivak, Anton
    Nasonov, Denis
    5TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, YSC 2016, 2016, 101 : 379 - 387
  • [33] Improving the performance of aggregate queries with cached tuples in mapReduce
    Peng, Dunlu
    Duan, Kai
    Xie, Lei
    International Journal of Database Theory and Application, 2013, 6 (01): : 13 - 24
  • [34] HAT: history-based auto-tuning MapReduce in heterogeneous environments
    Chen, Quan
    Guo, Minyi
    Deng, Qianni
    Zheng, Long
    Guo, Song
    Shen, Yao
    JOURNAL OF SUPERCOMPUTING, 2013, 64 (03): : 1038 - 1054
  • [35] HAT: history-based auto-tuning MapReduce in heterogeneous environments
    Quan Chen
    Minyi Guo
    Qianni Deng
    Long Zheng
    Song Guo
    Yao Shen
    The Journal of Supercomputing, 2013, 64 : 1038 - 1054
  • [36] Novel Scheduling Algorithms for Efficient Deployment of MapReduce Applications in Heterogeneous Computing Environments
    Hsieh, Sun-Yuan
    Chen, Chi-Ting
    Chen, Chi-Hao
    Yen, Tzu-Hsiang
    Hsiao, Hung-Chang
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (04) : 1080 - 1095
  • [37] Improving performance of heterogeneous agents
    Özcan, F
    Subrahmanian, VS
    Dix, J
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2004, 41 (2-4) : 339 - 395
  • [38] Improving Performance of Heterogeneous Agents
    Fatma Özcan
    V.S. Subrahmanian
    Jürgen Dix
    Annals of Mathematics and Artificial Intelligence, 2004, 41 : 339 - 395
  • [39] Dynamic Token Based Improving MapReduce Performance in Cloud Computing
    Zhou, Mosong
    Chen, Heng
    Dong, Xiaoshe
    Zhu, Zhengdong
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 180 - 186
  • [40] Task failure resilience technique for improving the performance of MapReduce in Hadoop
    Kavitha, C.
    Anita, X.
    ETRI JOURNAL, 2020, 42 (05) : 751 - 763