Efficient deadline-aware scheduling for the analysis of Big Data streams in public Cloud

被引:14
|
作者
Mortazavi-Dehkordi, Mahmood [1 ]
Zamanifar, Kamran [1 ]
机构
[1] Univ Isfahan, Comp Engn Fac, Software Dept, Esfahan, Iran
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2020年 / 23卷 / 01期
关键词
Streaming Big Data analysis query; Deadline-aware scheduling; Cloud-based stream processing; REAL-TIME; RESOURCE-MANAGEMENT; SIMULATION;
D O I
10.1007/s10586-019-02908-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of Big Data has had a profound impact on how data are analyzed. Open source distributed stream processing platforms have gained popularity for analyzing streaming Big Data as they provide low latency required for streaming Big Data applications using Cloud resources. However, existing resource schedulers are still lacking the efficiency and deadline meeting that Big Data analytical applications require. Recent works have already considered streaming Big Data characteristics to improve the efficiency and the likelihood of deadline meeting for scheduling in the platforms. Nevertheless, they have not taken into account the specific attributes of analytical application, public Cloud utilization cost and delays caused by performance degradation of leasing public Cloud resources. This study, therefore, presents BCframework, an efficient deadline-aware scheduling framework used by streaming Big Data analysis applications based on public Cloud resources. BCframework proposes a scheduling model which considers public Cloud utilization cost, performance variation, deadline meeting and latency reduction requirements of streaming Big Data analytical applications. Furthermore, it introduces two operator scheduling algorithms based on both a novel partitioning algorithm and an operator replication method. BCframework is highly adaptable to the fluctuation of streaming Big Data and the performance degradation of public Cloud resources. Experiments with the benchmark and real-world queries show that BCframework can significantly reduce the latency and utilization cost and also minimize deadline violations and provisioned virtual machine instances.
引用
收藏
页码:241 / 263
页数:23
相关论文
共 50 条
  • [1] Efficient deadline-aware scheduling for the analysis of Big Data streams in public Cloud
    Mahmood Mortazavi-Dehkordi
    Kamran Zamanifar
    Cluster Computing, 2020, 23 : 241 - 263
  • [2] A Deadline-aware Coflow Scheduling Approach for Big Data Applications
    Tang, Wenda
    Wang, Song
    Li, Duanchao
    Huang, Taigui
    Dou, Wanchun
    Yu, Shui
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [3] Modelling and Analysis of A Novel Deadline-Aware Scheduling Scheme for Cloud Computing Data Centers
    Khabbaz, Maurice
    Assi, Chadi M.
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (01) : 141 - 155
  • [4] Task Deadline-Aware Energy-Efficient Scheduling Model for a Virtualized Cloud
    Garg, Neha
    Goraya, Major Singh
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (02) : 829 - 841
  • [5] Task Deadline-Aware Energy-Efficient Scheduling Model for a Virtualized Cloud
    Neha Garg
    Major Singh Goraya
    Arabian Journal for Science and Engineering, 2018, 43 : 829 - 841
  • [6] Brief Announcement: Deadline-Aware Scheduling of Big-Data Processing Jobs
    Bodik, Peter
    Menache, Ishai
    Naor, Joseph
    Yaniv, Jonathan
    PROCEEDINGS OF THE 26TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'14), 2014, : 211 - 213
  • [7] Deadline-Aware Scheduling and Flexible Bandwidth Allocation for Big-Data Transfers
    Srinivasan, Srinikethan Madapuzi
    Tram Truong-Huu
    Gurusamy, Mohan
    IEEE ACCESS, 2018, 6 : 74400 - 74415
  • [8] An energy-efficient and deadline-aware workflow scheduling algorithm in the fog and cloud environment
    Khaledian, Navid
    Khamforoosh, Keyhan
    Akraminejad, Reza
    Abualigah, Laith
    Javaheri, Danial
    COMPUTING, 2024, 106 (01) : 109 - 137
  • [9] An energy-efficient and deadline-aware workflow scheduling algorithm in the fog and cloud environment
    Navid Khaledian
    Keyhan Khamforoosh
    Reza Akraminejad
    Laith Abualigah
    Danial Javaheri
    Computing, 2024, 106 : 109 - 137
  • [10] Deadline-Aware Programming and Scheduling
    Burns, Alan
    Wellings, Andy
    RELIABLE SOFTWARE TECHNOLOGIES - ADA-EUROPE 2014, 2014, 8454 : 107 - 118