Efficient Online Resource Allocation in Heterogeneous Clusters with Machine Variability

被引:0
|
作者
Xu, Huanle [1 ]
Liu, Yang [2 ]
Lau, Wing Cheong [2 ]
Guo, Jun [1 ]
Liu, Alex [1 ]
机构
[1] Dongguan Univ Technol, Coll Comp Sci & Network Secur, Dongguan, Peoples R China
[2] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-armed bandit; online optimization; approximation jobs; regret bound;
D O I
10.1109/infocom.2019.8737511
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Approximation jobs that allow partial execution of their many tasks to achieve valuable results have played an important role in today's large-scale data analytics [1] [2]. This fact can be utilized to maximize the system utility of a big data computing cluster by choosing proper tasks in scheduling for each approximation job. A fundamental challenge herein, however, is that the machine service capacity may fluctuate substantially during a job's lifetime, which makes it difficult to assign valuable tasks to well-performed machines. In addition, the cluster scheduler needs to make online scheduling decisions without knowing future job arrivals according to machine availabilities. In this paper, we tackle this online resource allocation problem for approximation jobs in parallel computing clusters. In particular, we model a cluster with heterogeneous machines as a multi armed bandit where each machine is treated as an arm. By making estimations on machine service rates while balancing the exploration-exploitation trade-off, we design an efficient online resource allocation algorithm from a bandit perspective. The proposed algorithm extends existing online convex optimization techniques and yields a sublinear regret bound. Moreover, we also examine the performance of the proposed algorithm via extensive trace-driven simulations and demonstrate that it outperforms the baselines substantially.
引用
收藏
页码:478 / 486
页数:9
相关论文
共 50 条
  • [1] Online Resource Allocation With Machine Variability: A Bandit Perspective
    Xu, Huanle
    Liu, Yang
    Lau, Wing Cheong
    Zeng, Tiantong
    Guo, Jun
    Liu, Alex X.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (05) : 2243 - 2256
  • [2] Adaptive Multi-Criteria Selection for Efficient Resource Allocation in Frugal Heterogeneous Hadoop Clusters
    Qureshi, Basit
    ELECTRONICS, 2024, 13 (10)
  • [3] On the use of HCM to develop a resource allocation algorithm for heterogeneous clusters
    Soares, Thiago Marques
    dos Santos, Rodrigo Weber
    Lobosco, Marcelo
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (24):
  • [4] Resource Selection and Allocation for Dynamic Adaptive Computing in Heterogeneous Clusters
    Duselis, John U.
    Cauich, E. Enrique
    Wang, Richert K.
    Scherson, Isaac D.
    2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 365 - 373
  • [5] Efficient Algorithms for Resource Allocation in Heterogeneous OFDMA Networks
    Bashar, Shafi
    Ding, Zhi
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [6] An Efficient Resource Allocation Algorithm for Heterogeneous Wireless Networks
    Mathonsi, Topside E.
    Tshilongamulenzhe, Tshimangadzo M.
    Buthelezi, Bongisizwe E.
    2019 OPEN INNOVATIONS CONFERENCE (OI), 2019, : 15 - 19
  • [7] Energy Efficient Resource Allocation for Heterogeneous Cloud Workloads
    Kaur, Prabhjot
    Kaur, Pankaj Deep
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1319 - 1322
  • [8] A Bandit Approach to Online Pricing for Heterogeneous Edge Resource Allocation
    Cheng, Jiaming
    Nguyen, Duong Thuy Anh
    Wang, Lele
    Nguyen, Duong Tung
    Bhargava, Vijay K.
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 277 - 281
  • [9] Simple, efficient allocation of modelling runs on heterogeneous clusters with MPI
    Donato, David I.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2017, 88 : 48 - 57
  • [10] Tarema: Adaptive Resource Allocation for Scalable Scientific Workflows in Heterogeneous Clusters
    Bader, Jonathan
    Thamsen, Lauritz
    Kulagina, Svetlana
    Will, Jonathan
    Meyerhenke, Henning
    Kao, Odej
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 65 - 75