Characterizing Cloud Applications on a Google Data Center

被引:69
|
作者
Di, Sheng [1 ]
Kondo, Derrick [2 ]
Cappello, Franck [1 ,3 ]
机构
[1] INRIA, Saclay, France
[2] INRIA, Grenoble, France
[3] Univ Illinois, Urbana, IL USA
关键词
COMPUTING ENVIRONMENTS;
D O I
10.1109/ICPP.2013.56
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we characterize Google applications, based on a one-month Google trace with over 650k jobs running across over 12000 heterogeneous hosts from a Google data center. On one hand, we carefully compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources (such as CPU, memory) and execution types (e. g., whether they can run batch tasks or not). Resource utilization per application is observed with an extremely typical Pareto principle. On the other hand, we classify applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage. The number of applications in the K-means clustering sets follows a Pareto-similar distribution. We believe our work is very interesting and valuable for the further investigation of Cloud environment.
引用
收藏
页码:468 / 473
页数:6
相关论文
共 50 条
  • [1] Characterizing and modeling cloud applications/jobs on a Google data center
    Di, Sheng
    Kondo, Derrick
    Cappello, Franck
    JOURNAL OF SUPERCOMPUTING, 2014, 69 (01): : 139 - 160
  • [2] Characterizing and modeling cloud applications/jobs on a Google data center
    Sheng Di
    Derrick Kondo
    Franck Cappello
    The Journal of Supercomputing, 2014, 69 : 139 - 160
  • [3] Data Analysis of a Google Data Center
    Minet, Pascale
    Renault, Eric
    Khoufi, Ines
    Boumerdassi, Selma
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 342 - 343
  • [4] Cloud Dependability Analysis: Characterizing Google Cluster Infrastructure Reliability
    Mesbahi, Mohammad Reza
    Rahmani, Amir Masoud
    Hosseinzadeh, Mehdi
    2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 56 - 61
  • [5] Characterizing machines lifecycle in Google data centers
    Sebastio, Stefano
    Trivedi, Kishor S.
    Alonso, Javier
    PERFORMANCE EVALUATION, 2018, 126 : 39 - 63
  • [6] Regression Cloud Models and Their Applications in Energy Consumption of Data Center
    Zhou, Yanshuang
    Li, Na
    Li, Hong
    Zhang, Yongqiang
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2015, 2015 (2015)
  • [7] A Novel Approach for Simulation and Analysis of Cloud Data Center Applications
    Atwal, Kuldip Singh
    Bassiouni, Mostafa
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 164 - 169
  • [8] Google Cloud and Analysis of Realtime Process Data
    Langmann, R.
    PROCEEDINGS OF 2015 12TH INTERNATIONAL CONFERENCE ON REMOTE ENGINEERING AND VIRTUAL INSTRUMENTATION (REV), 2015, : 81 - 85
  • [9] Energy-Aware Consolidation Scheme for Data Center Cloud Applications
    Carrega, A.
    Repetto, M.
    2017 29TH INTERNATIONAL TELETRAFFIC CONGRESS (ITC 29), VOL 2, 2017, : 24 - 29
  • [10] Characterizing Energy per Job in Cloud Applications
    Ho, Thi Thao Nguyen
    Gribaudo, Marco
    Pernici, Barbara
    ELECTRONICS, 2016, 5 (04)