Scheduling of big data applications on distributed cloud based on QoS parameters

被引:0
|
作者
Rajinder Sandhu
Sandeep K. Sood
机构
[1] Guru Nanak Dev University,
来源
Cluster Computing | 2015年 / 18卷
关键词
Big data; Cloud computing; Quality of Service (QoS); Hadoop; Self organizing maps; K nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Big data is one of the major technology usages for business operations in today’s competitive market. It provides organizations a powerful tool to analyze large unstructured data to make useful decisions. Result quality, time, and price associated with big data analytics are very important aspects for its success. Selection of appropriate cloud infrastructure at coarse and fine grained level will ensure better results. In this paper, a global architecture is proposed for QoS based scheduling for big data application to distributed cloud datacenter at two levels which are coarse grained and fine grained. At coarse grain level, appropriate local datacenter is selected based on network distance between user and datacenter, network throughput and total available resources using adaptive K nearest neighbor algorithm. At fine grained level, probability triplet (C, I, M) is predicted using naïve Bayes algorithm which provides probability of new application to fall in compute intensive (C), input/output intensive (I) and memory intensive (M) categories. Each datacenter is transformed into a pool of virtual clusters capable of executing specific category of jobs with specific (C, I, M) requirements using self organized maps. Novelty of study is to represent whole datacenter resources in a predefined topological ordering and executing new incoming jobs in their respective predefined virtual clusters based on their respective QoS requirements. Proposed architecture is tested on three different Amazon EMR datacenters for resource utilization, waiting time, availability, response time and estimated time to complete the job. Results indicated better QoS achievement and 33.15 % cost gain of the proposed architecture over traditional Amazon methods.
引用
收藏
页码:817 / 828
页数:11
相关论文
共 50 条
  • [31] Optimization of Task Scheduling Algorithm through QoS Parameters for Cloud Computing
    Monika
    Jindal, Abhimanyu
    4TH INTERNATIONAL CONFERENCE ON ADVANCEMENTS IN ENGINEERING & TECHNOLOGY (ICAET-2016), 2016, 57
  • [32] Balancing the load and scheduling the tasks using zebra optimizer in IoT based cloud computing for big-data applications
    Vijayaraj, V.
    Balamurugan, M.
    Oberoi, Monisha
    REVISTA INTERNACIONAL DE METODOS NUMERICOS PARA CALCULO Y DISENO EN INGENIERIA, 2024, 40 (02):
  • [33] AutoScale: Adaptive QoS-Aware Container-based Cloud Applications Scheduling Framework
    Sun, Yao
    Meng, Lun
    Song, Yunkui
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (06): : 2824 - 2837
  • [34] Selection of Services for Data-Centric Cloud Applications: A QoS Based Approach
    Mandal, Amit Kr
    Changder, Suvamoy
    Sarkar, Anirban
    2013 SECOND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND SECURITY (ADCONS 2013), 2013, : 102 - 107
  • [35] An Improved Container Scheduling Algorithm Based on PSO for Big Data Applications
    Li, Jiawei
    Liu, Bo
    Lin, Weiwei
    Li, Pengfei
    Gao, Qian
    CYBERSPACE SAFETY AND SECURITY, PT I, 2020, 11982 : 516 - 530
  • [36] An Efficient Data Scheduling Scheme for Cloud-based Big Data Framework for Smart City
    Nasser, Nidal
    Khan, Nargis
    ElAttar, Mohamed
    Saleh, Kassem
    Abujamous, Amjad
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [37] Swarm Intelligence (SI) based Profiling and Scheduling of Big Data Applications
    Soinasundaram, Thamarai Selvi
    Govindarajan, Kannan
    Kumar, Vivekanandan Suresh
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1875 - 1880
  • [38] An Intelligent Task Scheduling Model for Hybrid Internet of Things and Cloud Environment for Big Data Applications
    Pal, Souvik
    Jhanjhi, N. Z.
    Abdulbaqi, Azmi Shawkat
    Akila, D.
    Alsubaei, Faisal S.
    Almazroi, Abdulaleem Ali
    SUSTAINABILITY, 2023, 15 (06)
  • [39] An Iterative Hierarchical Key Exchange Scheme for Secure Scheduling of Big Data Applications in Cloud Computing
    Liu, Chang
    Zhang, Xuyun
    Liu, Chengfei
    Yang, Yun
    Ranjan, Rajiv
    Georgakopoulos, Dimitrios
    Chen, Jinjun
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 9 - 16
  • [40] A Distributed Collaborative Urban Traffic Big Data System Based on Cloud Computing
    Zhang, Jianqin
    Chen, Zhihong
    Xu, Zhijie
    Du, Mingyi
    Yang, Weijun
    Guo, Liang
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2019, 11 (04) : 37 - 47