Dynamic K-Means Clustering of Workload and Cloud Resource Configuration for Cloud Elastic Model

被引:2
|
作者
Daradkeh, Tariq [1 ]
Agarwal, Anjali [1 ]
Zaman, Marzia [2 ]
Goel, Nishith [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Cistech Ltd, Ottawa, ON K2E 7K3, Canada
来源
IEEE ACCESS | 2020年 / 8卷
基金
加拿大自然科学与工程研究理事会;
关键词
Cloud computing; Data centers; Task analysis; Clustering methods; Servers; Internet; Hardware; Elastic model; kernel density estimator; dynamic k-means clustering; workload; data center configuration; logs analysis;
D O I
10.1109/ACCESS.2020.3042716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud elasticity involves timely provisioning and de-provisioning of computing resources and adjusting resources size to meet the dynamic workload demand. This requires fast, and accurate resource scaling methods at minimum cost (e.g. pay as you go) that match with workload demands. Two dynamic changing parameters must be defined in an elastic model, the workload resource demand classes, and the data center resource reconfiguration classes. These parameters are not labeled for cloud management system while data center logs are being captured. Building an advance elastic model is a critical task, which defines multiple classes under these two categories i.e. for workload and for provisioning. A dynamic method is therefore required to define (during configuration time window) the workload classes and resource provisioning classes. Unsupervised learning model such as K-Means has many challenges such as time complexity, selection of optimum number of clusters (representing the classes), and determining centroid values of the clusters. All clustering methods depend on minimizing mean square error between center of population in same class member. These methods are often enhanced using guidelines to find out the centroids, but they suffer from K-Means limitations. For the application of clustering cloud log traces, most of the reported work use K-Means clustering to label workload types. However, there is no work reported that label data center scaling classes. In this work, a novel method is proposed to analyze the characteristics of both workloads and datacenter configurations using clustering method, and is based on random variable model transformation (kernel density estimator) guide. This method enhances K-Means clustering by automatically determining optimum number of classes and finding the mean centroids for the clusters. In addition, it improves the accuracy and the time complexity of standard K-Means clustering model, by best correlating between clustering attributes using statistical correlation methods.
引用
收藏
页码:219430 / 219446
页数:17
相关论文
共 50 条
  • [21] Based on K-means clustering and CNN algorithm research in hail cloud determination
    Wang Xue
    Liao Feijia
    Xu Wenxia
    Guo Kun
    Li Guodong
    2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 232 - 235
  • [22] Dynamic Incremental K-means Clustering
    Aaron, Bryant
    Tamir, Dan E.
    Rishe, Naphtali D.
    Kandel, Abraham
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 308 - 313
  • [23] K-Means algorithm based on Cloud Computing
    Xu, Yunfeng
    Zhang, Yan
    Ma, Rui
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 2, 2012, : 363 - 365
  • [24] SA-rough sets K-means resource dynamic allocation strategy based on cloud computing environment
    Ying, M. (kfmengying@126.com), 1600, Universitas Ahmad Dahlan (10):
  • [25] An Improved Task Allocation Strategy in Cloud using Modified K-means Clustering Technique
    Sharma, Vrajesh
    Bala, Manju
    EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (04) : 201 - 208
  • [26] Scattered Point Cloud Simplification Algorithm Integrating k-means Clustering and Hausdorff Distance
    Li J.
    Cao Y.
    Wang Z.
    Wang G.
    Cao, Yao (772440651@qq.com), 1600, Editorial Board of Medical Journal of Wuhan University (45): : 250 - 257
  • [27] Hybrid Resource Scaling for Dynamic Workload in Cloud Computing
    Daraje, Megersa
    Shaikh, Javed
    2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,
  • [28] Cloud Classification Using Ground Based Images Using CBIR and K-Means Clustering
    Rudrappa, Gujanatti
    Vijapur, Nataraj
    Jadhav, Sushant
    Manage, Prabhakar
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (13): : 95 - 99
  • [29] A single plant segmentation method of maize point cloud based on Euclidean clustering and K-means clustering
    Miao, Yanlong
    Li, Shuai
    Wang, Liuyang
    Li, Han
    Qiu, Ruicheng
    Zhang, Man
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 210
  • [30] Workload forecasting based elastic resource management in edge cloud
    Liu, Boyun
    Guo, Jingjing
    Li, Chunlin
    Luo, Youlong
    COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 139