On the energy (In)efficiency of Hadoop clusters

被引:41
|
作者
Leverich J. [1 ]
Kozyrakis C. [1 ]
机构
[1] Computer Systems Laboratory, Stanford University
来源
Operating Systems Review (ACM) | 2010年 / 44卷 / 01期
关键词
D O I
10.1145/1740390.1740405
中图分类号
学科分类号
摘要
Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.
引用
收藏
页码:61 / 65
页数:4
相关论文
共 50 条
  • [41] Bigscale: Automatic service provisioning for hadoop clusters
    Huru, Dan (alexandru.huru2208@cti.pub.ro), 2018, Politechnica University of Bucharest (80):
  • [42] Optimizing data placement in heterogeneous Hadoop clusters
    Xiong, Runqun
    Luo, Junzhou
    Dong, Fang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1465 - 1480
  • [43] On MapReduce Scheduling in Hadoop Yarn on Heterogeneous Clusters
    Wang, Meng
    Wu, Chase Q.
    Cao, Huiyan
    Liu, Yang
    Wang, Yonggiang
    Hou, Aiqin
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1747 - 1754
  • [44] Performance and energy efficiency of big data applications in cloud environments: A Hadoop case study
    Feller, Eugen
    Ramakrishnan, Lavanya
    Morin, Christine
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 79-80 : 80 - 89
  • [45] Hadoop Scalability and Performance Testing in Homogeneous Clusters
    Manike, Chiranjeevi
    Nanda, Ashok Kumar
    Gajulagudem, Tejashwini
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 907 - 917
  • [46] Performance Implications of SSDs in Virtualized Hadoop Clusters
    Ahn, Sungyong
    Park, Sangkyu
    Hong, Jae-Ki
    Chang, Wooseok
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 585 - 592
  • [47] Efficiency Experiments on Hadoop and Giraph with PageRank
    Koschel, Arne
    Heine, Felix
    Astrova, Irina
    Korte, Fred
    Rossow, Thomas
    Stipkovic, Sebastian
    2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 328 - 331
  • [48] Towards Energy Awareness in Hadoop
    Krish, K. R.
    Iqbal, M. Safdar
    Rafique, M. Mustafa
    Butt, Ali R.
    2014 FOURTH INTERNATIONAL WORKSHOP ON NETWORK-AWARE DATA MANAGEMENT (NDM), 2014, : 16 - 22
  • [49] Reconstruction of the gray belt objects based on energy efficiency clusters
    Talipova, Liliia
    Shonina, Ekaterina
    Strelets, Ksenia
    Lapteva, Svetlana
    INTERNATIONAL SCIENCE CONFERENCE SPBWOSCE-2018: BUSINESS TECHNOLOGIES FOR SUSTAINABLE URBAN DEVELOPMENT, 2019, 110
  • [50] A taxonomy of software-based and hardware-based approaches for energy efficiency management in the Hadoop
    Shabestari, Fatemeh
    Rahmani, Amir Masoud
    Navimipour, Nima Jafari
    Jabbehdari, Sam
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2019, 126 : 162 - 177