Tails in the cloud: a survey and taxonomy of straggler management within large-scale cloud data centres

被引:17
|
作者
Gill, Sukhpal Singh [1 ]
Ouyang, Xue [2 ]
Garraghan, Peter [3 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
[2] Natl Univ Def Technol, Sch Elect Sci, Changsha, Peoples R China
[3] Univ Lancaster, Sch Comp & Commun, Lancaster, England
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 12期
基金
英国工程与自然科学研究理事会;
关键词
Computing; Stragglers; Cloud computing; Straggler management; Distributed systems; Cloud data centres;
D O I
10.1007/s11227-020-03241-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing systems are splitting compute- and data-intensive jobs into smaller tasks to execute them in a parallel manner using clusters to improve execution time. However, such systems at increasing scale are exposed to stragglers, whereby abnormally slow running tasks executing within a job substantially affect job performance completion. Such stragglers are a direct threat towards attaining fast execution of data-intensive jobs within cloud computing. Researchers have proposed an assortment of different mechanisms, frameworks, and management techniques to detect and mitigate stragglers both proactively and reactively. In this paper, we present a comprehensive review of straggler management techniques within large-scale cloud data centres. We provide a detailed taxonomy of straggler causes, as well as proposed management and mitigation techniques based on straggler characteristics and properties. From this systematic review, we outline several outstanding challenges and potential directions of possible future work for straggler research.
引用
收藏
页码:10050 / 10089
页数:40
相关论文
共 50 条
  • [21] Autonomous and Energy-Aware Management of Large-Scale Cloud Infrastructures
    Feller, Eugen
    Morin, Christine
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2542 - 2545
  • [22] An Automatic Cloud Service Platform for Learning from Large-Scale Data
    Xiong, Li
    Tong, Hengqing
    RESOURCES AND SUSTAINABLE DEVELOPMENT, PTS 1-4, 2013, 734-737 : 3085 - +
  • [23] Clustered Multicast Source Routing for Large-Scale Cloud Data Centers
    Alqahtani, Jarallah
    Sinky, Hassan H.
    Hamdaoui, Bechir
    IEEE ACCESS, 2021, 9 (09): : 12693 - 12705
  • [24] Rapid Data Evacuation for Large-Scale Disasters in Optical Cloud Networks
    Ferdousi, Sifat
    Habib, M. Farhan
    Tornatore, Massimo
    Mukherjee, Biswanath
    2015 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2015,
  • [25] Improving System and Software Deployment on a Large-Scale Cloud Data Center
    Wu, Yu-Sheng
    Juang, Tong-Ying
    Chang, Yue-Shan
    Wang, Wei-Jen
    Lu, Jun-Ting
    2013 FIFTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2013, : 82 - 87
  • [26] Process virtualization of large-scale lidar data in a cloud computing environment
    Guan, Haiyan
    Li, Jonathan
    Zhong, Liang
    Yu, Yongtao
    Chapman, Michael
    COMPUTERS & GEOSCIENCES, 2013, 60 : 109 - 116
  • [27] Using Cloud Technologies for Large-Scale House Data in Smart City
    Yamamoto, Shintaro
    Matsumoto, Shinsuke
    Nakamura, Masahide
    2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
  • [28] Rapid Data Evacuation for Large-Scale Disasters in Optical Cloud Networks
    Ferdousi, Sifat
    Tornatore, Massimo
    Habib, M. Farhan
    Mukherjee, Biswanath
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2015, 7 (12) : B163 - B172
  • [29] Large-scale, realistic cloud visualization based on weather forecast data
    Hufnagel, Roland
    Held, Martin
    Schroeder, Florian
    PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS AND IMAGING, 2007, : 54 - 59
  • [30] Remote Attestation of Large-scale Virtual Machines in the Cloud Data Center
    Chene, Jie
    Zhang, Kun
    Tu, Bibo
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 180 - 187