Robust Task Scheduling Strategy for Big Data Clusters

被引:2
|
作者
Wang, Zixiang [1 ]
Liu, Zhoubin [1 ]
Huan, Zhan [2 ]
Kong, Xiaoyun [3 ]
Yuan, Xiaolu [4 ]
机构
[1] State Grid Zhejiang Elect Power Res Inst, Hangzhou, Zhejiang, Peoples R China
[2] Changzhou Univ, Changzhou, Peoples R China
[3] State Grid Zhejiang Elect Power Corp, Hangzhou, Zhejiang, Peoples R China
[4] Xi An Jiao Tong Univ, Xian, Shaanxi, Peoples R China
关键词
task scheduling; big data; robust; uncertainty; feedback; FEEDBACK-CONTROL; ALGORITHMS;
D O I
10.1109/BIGCOM.2017.30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Task scheduling problem has been an active area in big data framework. Arriving tasks should be assigned to suitable nodes in a big data cluster for load balancing. The system can become more efficient when the workload is well distributed among different nodes. However, the existing researches consider little about the uncertainties in the resource and service provision. Since effective processor speed is changing over time, the computational ability of a node is not consistent. The load balancing may not be well achieved due to the computing uncertainty. In this paper, we propose a robust task scheduling algorithm for big data clusters. The computational uncertainty is modeled as perturbation on the processing speed, and our task scheduling approach is designed to deal with the potential computing uncertainties. The simulation results demonstrate that our scheduling strategy can reject perturbation and provide the stable computing service.
引用
收藏
页码:305 / 312
页数:8
相关论文
共 50 条
  • [21] A Data Distribution Aware Task Scheduling Strategy for MapReduce System
    Guo, Leitao
    Sun, Hongwei
    Luo, Zhiguo
    CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 : 694 - 699
  • [22] Scheduling strategies for mixed data and task parallelism on heterogeneous clusters and grids
    Beaumont, O
    Legrand, A
    Robert, Y
    ELEVENTH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2003, : 209 - 216
  • [23] Data-Intensive Task Scheduling for Heterogeneous Big Data Analytics in IoT System
    Li, Xin
    Wang, Liangyuan
    Abawajy, Jemal H.
    Qin, Xiaolin
    Pau, Giovanni
    You, Ilsun
    ENERGIES, 2020, 13 (17)
  • [24] Leveraging Big Data for Adaptive Robust Optimization of Scheduling under Uncertainty
    Ning, Chao
    You, Fengqi
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 3783 - 3788
  • [25] Adaptive Task Scheduling Strategy Based on Dynamic Workload Adjustment for Heterogeneous Hadoop Clusters
    Xu, Xiaolong
    Cao, Lingling
    Wang, Xinheng
    IEEE SYSTEMS JOURNAL, 2016, 10 (02): : 471 - 482
  • [26] Storage-aware Task Scheduling for Performance Optimization of Big Data Workflows
    Ye, Qianwen
    Wu, Chase Q.
    Cao, Huiyan
    Rao, Nageswara S. V.
    Hou, Aiqin
    2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 1095 - 1102
  • [27] Trident: Task Scheduling over Tiered Storage Systems in Big Data Platforms
    Herodotou, Herodotos
    Kakoulli, Elena
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (09): : 1570 - 1582
  • [28] A Reliable Task Assignment Strategy for Spatial Crowdsourcing in Big Data Environment
    Gu, Liqiu
    Wang, Kun
    Liu, Xiulong
    Guo, Song
    Liu, Bo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [29] Performance optimization of computing task scheduling based on the Hadoop big data platform
    Li, Yang
    Hei, Xinhong
    NEURAL COMPUTING & APPLICATIONS, 2022, 37 (13): : 8181 - 8192
  • [30] Thermal-Aware and DVFS-Enabled Big Data Task Scheduling for Data Centers
    Liu, Huazhong
    Liu, Baoshun
    Yang, Laurence T.
    Lin, Man
    Deng, Yuhui
    Bilal, Kashif
    Khan, Samee U.
    IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (02) : 177 - 190