SpotDAG: An RL-Based Algorithm for DAG Workflow Scheduling in Heterogeneous Cloud Environments

被引:2
|
作者
Lin, Liduo [1 ]
Pan, Li [1 ]
Liu, Shijun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
基金
国家重点研发计划;
关键词
Data processing; Job shop scheduling; Costs; Cloud computing; Optimization; Task analysis; Data models; Heterogeneous cloud environments; spot instance; on-demand instance; IaaS; TASKS;
D O I
10.1109/TSC.2024.3422828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As increasingly complex functions are implemented in applications, directed acyclic graphs (DAGs) are widely used to model the inter-dependencies between individual functions. Cloud-based data processing platforms need to consider the complex topology of DAGs and arbitrary deadlines given by users for job scheduling, leading to an NP-hard decision-making problem. Leveraging spot instances in data processing platforms can achieve significant cost savings, but the unpredictable interruption of spot instances makes the problem of VM scaling and job scheduling more difficult. In this paper, a Reinforcement Learning (RL) based approach called SpotDAG is proposed to solve the auto-scaling problem for jobs modeled as DAGs on a data processing platform where spot instances are introduced. SpotDAG makes cluster scaling and job scheduling decisions at the same time by mapping its output to several meta-policies. This paper introduces the self-attention mechanism for feature extraction to help the intelligent agent learn faster. A mask layer after the output of the proposed RL-based algorithm circumvents illegal actions to ensure that a job is completed by its deadline. Extensive experimental results show that the proposed approach can significantly reduce the cost of instances for data processing platforms while ensuring that jobs are completed in time.
引用
收藏
页码:2904 / 2917
页数:14
相关论文
共 50 条
  • [11] Hierarchical Cross-Organizational Workflow Scheduling Algorithm in Cloud Environments
    Tan, Wen'an
    Peng, Jia
    Sun, Yong
    Chen, Senbo
    Tang, Anqiong
    Tang, Shan
    HUMAN CENTERED COMPUTING, HCC 2014, 2015, 8944 : 522 - 533
  • [12] Modified scheduling algorithm for cloud workflow based on QoS
    Wang, Y. (wangyan3215931@163.com), 1600, Northeast University (35):
  • [13] A workflow scheduling algorithm based on cloud computing environment
    Zhang, X.-M., 1600, CESER Publications, Post Box No. 113, Roorkee, 247667, India (45):
  • [14] Genetic Algorithm for DAG Scheduling in Grid Environments
    Pop, Florin
    Dobre, Ciprian
    Cristea, Valentin
    2009 IEEE 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, PROCEEDINGS, 2009, : 299 - 305
  • [15] Cuckoo Search based Workflow Scheduling on Heterogeneous Cloud Resources
    Singh, Harshpreet
    Randhawa, Rajneesh
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 65 - 70
  • [16] A New DAG Scheduling Algorithm for Heterogeneous Platforms
    Nasri, Wahid
    Nafti, Wafa
    2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 114 - 119
  • [17] DAG Scheduling in Heterogeneous Computing and Grid Environments Using Variable Neighborhood Search Algorithm
    Selvi, S.
    Manimegalai, D.
    APPLIED ARTIFICIAL INTELLIGENCE, 2017, 31 (02) : 134 - 173
  • [18] An HC-CSO Algorithm for Workflow Scheduling in Heterogeneous Cloud Computing System
    Bhagwan, Jai
    Kumar, Sanjeev
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 484 - 492
  • [19] An Effective Algorithm for Cloud Workflow Scheduling
    Chou, Yu-Ting
    Liu, Shih-Jui
    Wu, Tzu-Chuan
    Wu, Chia-Lin
    Tsai, Chun-Wei
    Chiang, Ming-Chao
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3603 - 3608
  • [20] Granularity-based workflow scheduling algorithm for cloud computing
    Kumar, Madhu Sudan
    Gupta, Indrajeet
    Panda, Sanjaya K.
    Jana, Prasanta K.
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (12): : 5440 - 5464