Tails in the cloud: a survey and taxonomy of straggler management within large-scale cloud data centres

被引:17
|
作者
Gill, Sukhpal Singh [1 ]
Ouyang, Xue [2 ]
Garraghan, Peter [3 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
[2] Natl Univ Def Technol, Sch Elect Sci, Changsha, Peoples R China
[3] Univ Lancaster, Sch Comp & Commun, Lancaster, England
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 12期
基金
英国工程与自然科学研究理事会;
关键词
Computing; Stragglers; Cloud computing; Straggler management; Distributed systems; Cloud data centres;
D O I
10.1007/s11227-020-03241-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing systems are splitting compute- and data-intensive jobs into smaller tasks to execute them in a parallel manner using clusters to improve execution time. However, such systems at increasing scale are exposed to stragglers, whereby abnormally slow running tasks executing within a job substantially affect job performance completion. Such stragglers are a direct threat towards attaining fast execution of data-intensive jobs within cloud computing. Researchers have proposed an assortment of different mechanisms, frameworks, and management techniques to detect and mitigate stragglers both proactively and reactively. In this paper, we present a comprehensive review of straggler management techniques within large-scale cloud data centres. We provide a detailed taxonomy of straggler causes, as well as proposed management and mitigation techniques based on straggler characteristics and properties. From this systematic review, we outline several outstanding challenges and potential directions of possible future work for straggler research.
引用
收藏
页码:10050 / 10089
页数:40
相关论文
共 50 条
  • [41] LGDCloudSim: A resource management simulation system for large-scale geographically distributed cloud data center scenarios
    Liu, Jiawen
    Xu, Yuehao
    Feng, Binbin
    Ding, Zhijun
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 194 - 204
  • [42] Survey of Large-Scale Data Management Systems for Big Data Applications
    Wu, Lengdong
    Yuan, Liyan
    You, Jiahuai
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (01) : 163 - 183
  • [43] Survey of Large-Scale Data Management Systems for Big Data Applications
    Lengdong Wu
    Liyan Yuan
    Jiahuai You
    Journal of Computer Science and Technology, 2015, 30 : 163 - 183
  • [44] Efficient Data Delivery Scheme for Large-Scale Microservices in Distributed Cloud Environment
    Pham, Van-Nam
    Hossain, Md. Delowar
    Lee, Ga-Won
    Huh, Eui-Nam
    APPLIED SCIENCES-BASEL, 2023, 13 (02):
  • [45] MatrixDCN: a high performance network architecture for large-scale cloud data centers
    Sun, Yantao
    Chen, Min
    Peng, Limei
    Hassan, Mohammad Mehedi
    Alelaiwi, Abdulhameed
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2016, 16 (08): : 942 - 959
  • [46] RESEARCH ON THE INCOMPLETE POINT CLOUD DATA REPAIRING OF THE LARGE-SCALE SCENE BUILDINGS
    Li, Yongqiang
    Li, Lixue
    Niu, Lubiao
    Huang, Tengda
    Li, Youpeng
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6726 - 6729
  • [47] RESEARCH BASED ON LARGE-SCALE DATA QUERY WITH MAPREDUCE TECHNOLOGY IN CLOUD COMPUTING
    Wang, Feiping
    Gu, Xiaofeng
    2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 243 - 245
  • [48] MITIGATION OF LARGE-SCALE RDF DATA LOADING WITH THE EMPLOYMENT OF A CLOUD COMPUTING SERVICE
    Namgoong, Hyun
    Kumar, Harshit
    Kim, Hong-Gee
    KEOD 2010: Proceedings of the International Conference on Knowledge Engineering and Ontology Development, 2010, : 489 - 492
  • [49] Data Center Networks Recovery for Large-scale Disasters in Optical Cloud Networks
    Yang, Yuan
    Yin, Shan
    Guo, Bingli
    Zhang, Yu
    Wang, Chenge
    Huang, Shanguo
    2016 15TH INTERNATIONAL CONFERENCE ON OPTICAL COMMUNICATIONS AND NETWORKS (ICOCN), 2016,
  • [50] A High Performance Network Architecture for Large-scale Cloud Media Data Centers
    Sun, Yantao
    Chen, Min
    Liu, Qiang
    Cheng, Jing
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 1760 - 1766