Clustering-based data placement in cloud computing: a predictive approach

被引:0
|
作者
Mokhtar Sellami
Haithem Mezni
Mohand Said Hacid
Mohamed Moshen Gammoudi
机构
[1] University of Jendouba,
[2] Taibah University,undefined
[3] SMART Lab,undefined
[4] ISG de Tunis,undefined
[5] Univ. Lyon,undefined
[6] University Claude Bernard Lyon 1,undefined
[7] LIRIS,undefined
[8] Higher Institute of Multimedia Arts of Manouba,undefined
[9] RIADI,undefined
来源
Cluster Computing | 2021年 / 24卷
关键词
Data placement; Resource usage; Intensive jobs; Prediction; Kernel Density Estimation; Fuzzy FCA; SOA; Autonomic computing;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, cloud computing environments have become a natural choice to host and process a huge volume of data. The combination of cloud computing and big data frameworks is an effective way to run data-intensive applications and tasks. Also, an optimal arrangement of data partitions can improve the tasks executions, which is not the case in most big data frameworks. For example, the default distribution of data partitions in Hadoop-based clouds causes several problems, which are mainly related to the load balancing and the resource usage. In addition, most existing data placement solutions are static and lack precision in the placement of data partitions. To overcome these issues, we propose a data placement approach based on the prediction of the future resources usage. We exploit Kernel Density Estimation (KDE) and Fuzzy FCA techniques to, first, forecast the workers’ and tasks’ future resource consumption and, second, cluster data partitions and intensive jobs according to the estimated resource usage. Fuzzy FCA is also used to exclude partitions and jobs that require less resources, which will reduce the needless migrations. To allow monitoring and predicting the workers’ states and the data partitions’ consumption, we modeled the big data cluster as an autonomic service-based system. The obtained results have shown that our solution outperformed existing approaches in terms of migrations rate and resource consumption.
引用
收藏
页码:3311 / 3336
页数:25
相关论文
共 50 条
  • [41] A Data Placement Strategy Based on Genetic Algorithm in Cloud Computing Platform
    Guo, Wei
    Wang, Xinjun
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 369 - 372
  • [42] Clustering-Based Numerosity Reduction for Cloud Workload Forecasting
    Rossi, Andrea
    Visentin, Andrea
    Prestwich, Steven
    Brown, Kenneth N.
    ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2023, 2024, 14053 : 115 - 132
  • [43] ACLNet: an attention and clustering-based cloud segmentation network
    Makwana, Dhruv
    Nag, Subhrajit
    Susladkar, Onkar
    Deshmukh, Gayatri
    Teja, Sai Chandra R.
    Mittal, Sparsh
    Mohan, C. Krishna
    REMOTE SENSING LETTERS, 2022, 13 (09) : 865 - 875
  • [44] FcVcA: A Fuzzy Clustering-based Vehicular Cloud Architecture
    Arkian, Hamid Reza
    Atani, Reza Ebrahimi
    Kamali, Saman
    2014 7TH INTERNATIONAL WORKSHOP ON COMMUNICATION TECHNOLOGIES FOR VEHICLES (NETS4CARS-FALL), 2014, : 24 - 28
  • [45] Point cloud clustering compression algorithm for spatial statistical data based on cloud computing
    Pan, Shaoming
    Li, Hong
    Tang, Ge
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2014, 42 (04): : 64 - 67
  • [46] OpenK: An Elastic Data Cleansing System with A Clustering-based Data Anomaly Detection Approach
    Tran Khanh Dang
    Dinh Khuong Nguyen
    Luc Minh Tuan
    2021 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP 2021), 2021, : 120 - 127
  • [47] Optimal Placement of Gateways in Multi-Hop Wireless Mesh Networks: A Clustering-based Approach
    Benyamina, Djohara
    Hafid, Abdelhakim
    Gendreau, Michel
    2009 IEEE 34TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2009), 2009, : 625 - +
  • [48] Novel Clustering-based Hidden Markov Model (CHMM) optimization approach for optimal PMU placement
    Girish V.
    Ananthapadmanabha T.
    Russian Electrical Engineering, 2017, 88 (3) : 178 - 184
  • [49] A Clustering-based Collaborative Filtering Approach for Mashups Recommendation over Big Data
    Hu, Rong
    Dou, Wanchun
    Liu, Jianxun
    2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 810 - 817
  • [50] A clustering-based approach to ocean model-data comparison around Antarctica
    Sun, Qiang
    Little, Christopher M.
    Barthel, Alice M.
    Padman, Laurie
    OCEAN SCIENCE, 2021, 17 (01) : 131 - 145