Characterizing and modeling cloud applications/jobs on a Google data center

被引:0
|
作者
Sheng Di
Derrick Kondo
Franck Cappello
机构
[1] INRIA,
[2] Argonne National Laboratory,undefined
来源
关键词
Google data center; Cloud task; Characterization and analysis; Large-scale system trace;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we characterize and model Google applications and jobs, based on a 1-month Google trace from a large-scale Google data center. We address four contributions: (1) we compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources and execution types; (2) we analyze the classification of applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage; (3) we study the correlation of Google application properties and running features (e.g., job priority and scheduling class); (4) we finally build a model that can simulate Google jobs/tasks and dynamic events, in accordance with Google trace. Experiments show that the tasks simulated based on our model exhibit fairly analogous features with those in Google trace. 95+ % of tasks’ simulation errors are <\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<$$\end{document}20 %, confirming a high accuracy of our simulation model.
引用
收藏
页码:139 / 160
页数:21
相关论文
共 50 条
  • [41] Power Modeling for Energy-Efficient Resource Management in a Cloud Data Center
    Tarafdar, Anurina
    Sarkar, Soumi
    Das, Rajib K.
    Khatua, Sunirmal
    JOURNAL OF GRID COMPUTING, 2023, 21 (01)
  • [42] Green and Heuristics-Based Consolidation Scheme for Data Center Cloud Applications
    Carrega, Alessandro
    Repetto, Matteo
    DIGITAL COMMUNICATION: TOWARDS A SMART AND SECURE FUTURE INTERNET, TIWDC 2017, 2017, 766 : 230 - 250
  • [43] Training Data Reduction for Performance Models of Data Analytics Jobs in the Cloud
    Will, Jonathan
    Arslan, Onur
    Bader, Jonathan
    Scheinert, Dominik
    Thamsen, Lauritz
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3141 - 3146
  • [44] Cloud based Text extraction using Google Cloud Vison for Visually Impaired applications
    Vaithiyanathan, D.
    Muniraj, Manigandan
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 90 - 96
  • [45] Modeling and Testing of Cloud Applications
    Chan, W. K.
    Mei, Lijun
    Zhang, Zhenyu
    2009 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC 2009), 2009, : 107 - +
  • [46] Characterizing and Evaluating Different Deployment Approaches for Cloud Applications
    Wettinger, Johannes
    Andrikopoulos, Vasilios
    Strauch, Steve
    Leymann, Frank
    2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2014, : 205 - 214
  • [47] Characterizing the Cost-Accuracy Performance of Cloud Applications
    Rathnayake, Sunimal
    Ramapantulu, Lavanya
    Teo, Yong Meng
    49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOP PROCEEDINGS, ICPP 2020, 2020,
  • [48] MPEC: Distributed Matrix Multiplication Performance Modeling on a Scale-Out Cloud Environment for Data Mining Jobs
    Kim, Jeongchul
    Son, Myungjun
    Lee, Kyungyong
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (01) : 521 - 538
  • [49] Transcriptomics and epigenetic data integration learning module on Google Cloud
    Ruprecht, Nathan A.
    Kennedy, Joshua D.
    Bansal, Benu
    Singhal, Sonalika
    Sens, Donald
    Maggio, Angela
    Doe, Valena
    Hawkins, Dale
    Campbel, Ross
    O'Connell, Kyle
    Gill, Jappreet Singh
    Schaefer, Kalli
    Singhal, Sandeep K.
    BRIEFINGS IN BIOINFORMATICS, 2024, 25
  • [50] Formally modeling and analyzing cost-aware job scheduling for cloud data center
    Fan, Guisheng
    Chen, Liqiong
    Yu, Huiqun
    Liu, Dongmei
    SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (09): : 1536 - 1559