Scheduling of big data applications on distributed cloud based on QoS parameters

被引:0
|
作者
Rajinder Sandhu
Sandeep K. Sood
机构
[1] Guru Nanak Dev University,
来源
Cluster Computing | 2015年 / 18卷
关键词
Big data; Cloud computing; Quality of Service (QoS); Hadoop; Self organizing maps; K nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Big data is one of the major technology usages for business operations in today’s competitive market. It provides organizations a powerful tool to analyze large unstructured data to make useful decisions. Result quality, time, and price associated with big data analytics are very important aspects for its success. Selection of appropriate cloud infrastructure at coarse and fine grained level will ensure better results. In this paper, a global architecture is proposed for QoS based scheduling for big data application to distributed cloud datacenter at two levels which are coarse grained and fine grained. At coarse grain level, appropriate local datacenter is selected based on network distance between user and datacenter, network throughput and total available resources using adaptive K nearest neighbor algorithm. At fine grained level, probability triplet (C, I, M) is predicted using naïve Bayes algorithm which provides probability of new application to fall in compute intensive (C), input/output intensive (I) and memory intensive (M) categories. Each datacenter is transformed into a pool of virtual clusters capable of executing specific category of jobs with specific (C, I, M) requirements using self organized maps. Novelty of study is to represent whole datacenter resources in a predefined topological ordering and executing new incoming jobs in their respective predefined virtual clusters based on their respective QoS requirements. Proposed architecture is tested on three different Amazon EMR datacenters for resource utilization, waiting time, availability, response time and estimated time to complete the job. Results indicated better QoS achievement and 33.15 % cost gain of the proposed architecture over traditional Amazon methods.
引用
收藏
页码:817 / 828
页数:11
相关论文
共 50 条
  • [21] Performance Prediction of Cloud-Based Big Data Applications
    Ardagna, Danilo
    Barbierato, Enrico
    Evangelinou, Athanasia
    Gianniti, Eugenio
    Gribaudo, Marco
    Pinto, Tulio B. M.
    Guimaraes, Anna
    da Silva, Ana Paula Couto
    Almeida, Jussara M.
    PROCEEDINGS OF THE 2018 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '18), 2018, : 192 - 199
  • [22] Research on Ship Data Big Data Parallel Scheduling Algorithm Based on Cloud Computing
    Li, Xin
    Guo, Jingjing
    JOURNAL OF COASTAL RESEARCH, 2019, : 535 - 539
  • [23] Cost-efficient dynamic scheduling of big data applications in apache spark on cloud
    Islam, Muhammed Tawfiqul
    Srirama, Satish Narayana
    Karunasekera, Shanika
    Buyya, Rajkumar
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 162
  • [24] Adaptive cache policy scheduling for big data applications on distributed tiered storage system
    Gu, Rong
    Li, Chongjie
    Shu, Peng
    Yuan, Chunfeng
    Huang, Yihua
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (15):
  • [25] PSO-based novel resource scheduling technique to improve QoS parameters in cloud computing
    Kumar, Mohit
    Sharma, S. C.
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12103 - 12126
  • [26] Multiple QoS Priority Based Scheduling in Cloud Computing
    Meriam, Essaies
    Tabbane, Nabil
    2016 INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2016, : 276 - 281
  • [27] PSO-based novel resource scheduling technique to improve QoS parameters in cloud computing
    Mohit Kumar
    S. C. Sharma
    Neural Computing and Applications, 2020, 32 : 12103 - 12126
  • [28] Modified scheduling algorithm for cloud workflow based on QoS
    Wang, Y. (wangyan3215931@163.com), 1600, Northeast University (35):
  • [29] QoS Based Dynamic Task Scheduling in IaaS Cloud
    Anbazhagi
    Tamilselvan, Latha
    Shakkeera
    2014 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2014,
  • [30] Energy-Aware Cloud Workflow Applications Scheduling With Geo-Distributed Data
    Li, Xiaoping
    Yu, Wei
    Ruiz, Ruben
    Zhu, Jie
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (02) : 891 - 903