Scheduling of big data applications on distributed cloud based on QoS parameters

被引:0
|
作者
Rajinder Sandhu
Sandeep K. Sood
机构
[1] Guru Nanak Dev University,
来源
Cluster Computing | 2015年 / 18卷
关键词
Big data; Cloud computing; Quality of Service (QoS); Hadoop; Self organizing maps; K nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Big data is one of the major technology usages for business operations in today’s competitive market. It provides organizations a powerful tool to analyze large unstructured data to make useful decisions. Result quality, time, and price associated with big data analytics are very important aspects for its success. Selection of appropriate cloud infrastructure at coarse and fine grained level will ensure better results. In this paper, a global architecture is proposed for QoS based scheduling for big data application to distributed cloud datacenter at two levels which are coarse grained and fine grained. At coarse grain level, appropriate local datacenter is selected based on network distance between user and datacenter, network throughput and total available resources using adaptive K nearest neighbor algorithm. At fine grained level, probability triplet (C, I, M) is predicted using naïve Bayes algorithm which provides probability of new application to fall in compute intensive (C), input/output intensive (I) and memory intensive (M) categories. Each datacenter is transformed into a pool of virtual clusters capable of executing specific category of jobs with specific (C, I, M) requirements using self organized maps. Novelty of study is to represent whole datacenter resources in a predefined topological ordering and executing new incoming jobs in their respective predefined virtual clusters based on their respective QoS requirements. Proposed architecture is tested on three different Amazon EMR datacenters for resource utilization, waiting time, availability, response time and estimated time to complete the job. Results indicated better QoS achievement and 33.15 % cost gain of the proposed architecture over traditional Amazon methods.
引用
收藏
页码:817 / 828
页数:11
相关论文
共 50 条
  • [41] NoSQL Distributed Big Data Storage Technology and Application Based on Cloud Platform
    Lu Zheng-Wu
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON ADVANCED DESIGN AND MANUFACTURING ENGINEERING (ICADME 2017), 2017, 136 : 334 - 340
  • [42] Knowledge Integration of Distributed Enterprises using Cloud based Big Data Analytics
    Bohlouli, Mahdi
    Merges, Fabian
    Fathi, Madjid
    2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2014, : 612 - 617
  • [43] Intelligent Digital Envelope for Distributed Cloud-Based Big Data Security
    Chelladurai S.P.
    Rajagopalan T.
    Computer Systems Science and Engineering, 2023, 46 (01): : 951 - 960
  • [44] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [45] APPLICATIONS OF BIG DATA IN RENEWABLE ENERGY SYSTEMS BASED ON CLOUD COMPUTING
    Sreedhar, Tarun Shakthi
    Islam, Saiful
    Atmosa, Meron
    Yazdandoust, Elaheh
    Elnaim, Mohamed Suliman
    Mishra, Shomesh
    Naresh, Venkata
    Bajpai, Vemparala Rupali
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2024, 16 (03): : 121 - 128
  • [46] Trusted Data Acquisition Mechanism for Cloud Resource Scheduling Based on Distributed Agents
    Li Xiaoyong
    Yang Yuehua
    CHINA COMMUNICATIONS, 2011, 8 (06) : 108 - 116
  • [47] Distributed In Situ Processing of Big Raster Data in the Cloud
    Zalipynis, Ramon Antonio Rodriges
    PERSPECTIVES OF SYSTEM INFORMATICS, PSI 2017, 2018, 10742 : 337 - 351
  • [48] Boafft: Distributed Deduplication for Big Data Storage in the Cloud
    Luo, Shengmei
    Zhang, Guangyan
    Wu, Chengwen
    Khan, Samee U.
    Li, Keqin
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (04) : 1199 - 1211
  • [49] Predicting the performance of big data applications on the cloud
    Ardagna, D.
    Barbierato, E.
    Gianniti, E.
    Gribaudo, M.
    Pinto, T. B. M.
    da Silva, A. P. C.
    Almeida, J. M.
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (02): : 1321 - 1353
  • [50] A Study on Big Data Reliable Combination Evaluation Method based on the Cloud Service Qos Model
    Li Xiating
    Song Rong
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (12): : 213 - 222