Makespan reduction for dynamic workloads in cluster-based data grids using reinforcement-learning based scheduling

被引:14
|
作者
Moghadam, Mahshid Helali [1 ]
Babamir, Seyed Morteza [1 ]
机构
[1] Univ Kashan, Dept Comp Engn, Kashan, Iran
关键词
Data grid; Data-intensive task scheduling algorithm; Data communication cost; Reinforcement learning; DATA REPLICATION; COMPUTING SYSTEMS; INDEPENDENT TASKS; RELIABILITY; MANAGEMENT; ALGORITHM;
D O I
10.1016/j.jocs.2017.09.016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scheduling is one of the important problems within the scope of control and management in grid and cloud-based systems. Data grid still as a primary solution to process data-intensive tasks, deals with managing large amounts of distributed data in multiple nodes. In this paper, a two-phase learning-based scheduling algorithm is proposed for data-intensive tasks scheduling in cluster-based data grids. In the proposed scheduling algorithm, a hierarchical multi agent system, consisting of one global broker agent and several local agents, is applied to scheduling procedure in the cluster-based data grids. At the first step of the proposed scheduling algorithm, the global broker agent selects the cluster with the minimum data cost based on the data communication cost measure, then an adaptive policy based on Q-learning is used by the local agent of the selected cluster to schedule the task to the proper node of the cluster. The impacts of three action selection strategies have been investigated in the proposed scheduling algorithm, and the performance of different versions of the scheduling algorithm regarding different action selection strategies, has been evaluated under three types of workloads with heterogeneous tasks. Experimental results show that for dynamic workloads with varying task submission patterns, the proposed learning-based scheduling algorithm gives better performance compared to four common scheduling algorithm, Queue Length (Shortest Queue), Access Cost, Queue Access Cost (QAC) and HCS, which use regular combinations of primary parameters such as, data communication cost and queue length. Applying a learning-based strategy provides the scheduling algorithm with more adaptability to the changing conditions in the environment. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:402 / 412
页数:11
相关论文
共 50 条
  • [1] A Cluster-Based Dynamic Load Balancing Protocol for Grids
    Payli, Resat Uemit
    Erciyes, Kayhan
    Dagdeviren, Orhan
    RECENT TRENDS IN WIRELESS AND MOBILE NETWORKS, 2011, 162 : 315 - +
  • [2] A cluster-based dynamic load balancing middleware protocol for grids
    Erciyes, K
    Payli, RÜ
    ADVANCES IN GRID COMPUTING - EGC 2005, 2005, 3470 : 805 - 812
  • [3] Cluster-based Data Reduction for Persistent Homology
    Moitra, Anindya
    Malott, Nicholas O.
    Wilsey, Philip A.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 327 - 334
  • [4] Cost Optimization for Big Data Workloads Based on Dynamic Scheduling and Cluster-Size Tuning
    Grzegorowski, Marek
    Zdravevski, Eftim
    Janusz, Andrzej
    Lameski, Petre
    Apanowicz, Cas
    Slezak, Dominik
    BIG DATA RESEARCH, 2021, 25
  • [5] Enhancing stochastic resonance using a reinforcement-learning based method
    Ding, Jianpeng
    Lei, Youming
    JOURNAL OF VIBRATION AND CONTROL, 2023, 29 (7-8) : 1461 - 1471
  • [6] A cluster-based dissimilarity learning approach for localized fault classification in Smart Grids
    De Santis, Enrico
    Rizzi, Antonello
    Sadeghian, Alireza
    SWARM AND EVOLUTIONARY COMPUTATION, 2018, 39 : 267 - 278
  • [7] A cluster-based scheduling model using SPT and SA for dynamic hybrid flow shop problems
    Wang, Kai
    Choi, Shiu Hong
    Qin, Hu
    Huang, Yun
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2013, 67 (9-12): : 2243 - 2258
  • [8] A cluster-based scheduling model using SPT and SA for dynamic hybrid flow shop problems
    Wang, K. (kai.wang@whu.edu.cn), 1600, Springer London (67): : 9 - 12
  • [9] A cluster-based scheduling model using SPT and SA for dynamic hybrid flow shop problems
    Kai Wang
    Shiu Hong Choi
    Hu Qin
    Yun Huang
    The International Journal of Advanced Manufacturing Technology, 2013, 67 : 2243 - 2258
  • [10] Cluster-based zero-shot learning for multivariate data
    Hayashi, Toshitaka
    Fujita, Hamido
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 1897 - 1911