Replica-Aware Job Scheduling in Distributed Systems

被引:0
|
作者
Liao, Wei-Cheng [1 ]
Wu, Jan-Jan [2 ,3 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[3] Acad Sinica, Res Ctr Informat Technol Innova, Taipei, Taiwan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an effective replica-aware scheduling algorithm for independent jobs in Grid and distributed systems. The proposed algorithm considers not only the execution time of jobs but also the location and transfer time of data and data replica that these jobs require. We propose a cost model to estimate the starting time and earliest completion time of a job and its associated data (original or replicated). Based on the estimated time, the scheduling algorithm finds a proper execution sequence for the jobs and the data with the goal to minimize the makespan of the jobs. Our experiment results demonstrate that the proposed algorithm is scalable and outperforms a random job selection strategy. We also show that the proposed algorithm performs well compared to a conservative theoretical lower bound, with performance within 15% of the lower bound on average and within 40% in the worst case.
引用
收藏
页码:290 / +
页数:2
相关论文
共 50 条
  • [1] Replica-Aware Partitioning Design in Parallel Database Systems
    Dong, Liming
    Liu, Weidong
    Li, Renchuan
    Zhang, Tiejun
    Zhao, Weiguo
    EURO-PAR 2017: PARALLEL PROCESSING, 2017, 10417 : 303 - 316
  • [2] Replica-aware caching for Web proxies
    Bahn, H
    Lee, H
    Noh, SH
    Min, SL
    Koh, K
    COMPUTER COMMUNICATIONS, 2002, 25 (03) : 183 - 188
  • [3] Replica-aware, multi-dimensional range queries in Distributed Hash Tables
    Chazapis, Antony
    Asiki, Athanasia
    Tsoukalas, Georgios
    Tsoumakos, Dimitrios
    Koziris, Nectarios
    COMPUTER COMMUNICATIONS, 2010, 33 (08) : 984 - 996
  • [4] Replica-aware task scheduling and load balanced cache placement for delay reduction in multi-cloud environment
    Li, Chunlin
    Zhang, Jing
    Tang, Hengliang
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (05): : 2805 - 2836
  • [5] Replica-aware data recovery performance improvement for Hadoop system with NVM
    Xin Li
    Huijie Li
    Youyou Lu
    Yanchao Zhao
    Xiaolin Qin
    CCF Transactions on High Performance Computing, 2021, 3 : 144 - 156
  • [6] Replica-aware data recovery performance improvement for Hadoop system with NVM
    Li, Xin
    Li, Huijie
    Lu, Youyou
    Zhao, Yanchao
    Qin, Xiaolin
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2021, 3 (02) : 144 - 156
  • [7] Replica-aware task scheduling and load balanced cache placement for delay reduction in multi-cloud environment
    Chunlin Li
    Jing Zhang
    Hengliang Tang
    The Journal of Supercomputing, 2019, 75 : 2805 - 2836
  • [8] Resource intensity aware job scheduling in a distributed cloud
    Huang Daochao
    Zhu Chunge
    Zhang Hong
    Liu Xinran
    CHINA COMMUNICATIONS, 2014, 11 (02) : 175 - 184
  • [9] Job scheduling in heterogeneous distributed systems
    Karatza, HD
    JOURNAL OF SYSTEMS AND SOFTWARE, 2001, 56 (03) : 203 - 212
  • [10] A simulation environment for job scheduling on distributed systems
    Santoso, J
    van Albada, GD
    Basaruddin, T
    Sloot, PMA
    COMPUTATIONAL SCIENCE-ICCS 2002, PT I, PROCEEDINGS, 2002, 2329 : 653 - 662