A Data and Task Co-Scheduling Algorithm for Scientific Cloud Workflows

被引:15
|
作者
Deng, Kefeng [1 ]
Ren, Kaijun [1 ]
Zhu, Min [1 ]
Song, Junqiang [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Cloud computing; scientific workflow; co-scheduling; data placement; task scheduling; DATA PLACEMENT; STRATEGY;
D O I
10.1109/TCC.2015.2511745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing has emerged as a promising computational infrastructure for cost-efficient workflow execution by provisioning on-demand resources in a pay-as-you-go manner. While scientific workflows require accessing community-wide resources, they usually need to be performed in collaborative cloud environments composed of multiple datacenters. Although such environments facilitate scientific collaboration, the movements of input and intermediate datasets across geographically distributed datacenters may cause intolerable latency that would hinder efficient execution of large-scale data-intensive scientific workflows. To address the problem, in this article we propose a novel multi-level K-cut graph partitioning algorithm to minimize the volume of data transfer across datacenters while satisfying load balancing and fixed data constraints. The algorithm first contracts the fixed input datasets in the same datacenter and their consuming tasks, and coarsens the contracted graph to a predefined scale in a level-by-level manner. Then, a K-cut algorithm is used to partition the resulted graph into K parts such that the cut size is minimized. After that, the partitioned graph is projected back to the original workflow graph, during which the load balancing constraint is maintained. We evaluate our algorithm using three real-world workflow applications and the results demonstrate that the proposed algorithm outperforms other state-of-the-art algorithms.
引用
收藏
页码:349 / 362
页数:14
相关论文
共 50 条
  • [41] Energy Efficient Task Scheduling for Parallel Workflows in Cloud Environment
    Kumar, Mallari Harish
    Peddoju, Sateesh K.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1298 - 1303
  • [42] Structure-Aware Scheduling Methods for Scientific Workflows in Cloud
    Albtoush, Alaa
    Yunus, Farizah
    Almi'ani, Khaled
    Noor, Noor Maizura Mohamad
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [43] Fair-Share Methods for Scheduling Scientific Workflows in Cloud
    Aldabaybah, Balqees
    Alrawashdeh, Tawfiq
    Butt, Talal Ashraf
    Almiani, Khaled
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [44] Scheduling Scientific Workflows on Clouds Using a Task Duplication Approach
    Genez, Thiago A. L.
    Sakellariou, Rizos
    Bittencourt, Luiz F.
    Madeira, Edmundo R. M.
    Braun, Torsten
    2018 IEEE/ACM 11TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2018, : 83 - 92
  • [45] Fault-Tolerant Scheduling for Scientific Workflows in Cloud Environments
    Vinay, K.
    Kumar, S. M. Dilip
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 150 - 155
  • [46] Scheduling multiple scientific workflows using containers on IaaS cloud
    Rajasekar, P.
    Palanichamy, Yogesh
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (07) : 7621 - 7636
  • [47] Security-Aware Scheduling of Multiple Scientific Workflows in Cloud
    Roy, Shubhro
    Gharote, Mangesh
    Ramamurthy, Arun
    Pawar, Anand
    Lodha, Sachin
    CLOUD COMPUTING AND SERVICES SCIENCE, CLOSER 2022, CLOSER 2023, 2024, 1845 : 1 - 24
  • [48] Energy efficient partitioning and scheduling approach for Scientific Workflows in the Cloud
    Bousselmi, Khadija
    Brahmi, Zaki
    Gammoudi, Mohamed Mohsen
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 146 - 154
  • [49] Scheduling multiple scientific workflows using containers on IaaS cloud
    P. Rajasekar
    Yogesh Palanichamy
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 7621 - 7636
  • [50] Cache-aware scheduling of scientific workflows in a multisite cloud
    Heidsieck, Gaetan
    de Oliveira, Daniel
    Pacitti, Esther
    Pradal, Christophe
    Tardieu, Francois
    Valduriez, Patrick
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 122 : 172 - 186