Learning Based Performance and Power Efficient Cluster Resource Manager for CPU-GPU Cluster

被引:1
|
作者
Das, Soumen Kumar [1 ]
Sudhakaran, G. [1 ]
Ashok, V. [1 ]
机构
[1] ISRO, Vikram Sarabhai Space Ctr, Govt India, Dept Space, Trivandrum, Kerala, India
关键词
High performance Cluster; CRM; Moldable Scheduler; Collocation; Resource Manager; petascale; green computing;
D O I
10.1109/EAIT.2014.58
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The recent success in building petascale High Performance Computing (HPC) systems have produced the demand for efficient and optimized use of resources to increase the performance and reduce the power consumption. Including the above, the heterogeneous architectures of nowadays HPCs comprising a multicore CPU and many-core Accelerator like GPU(s) are facing another concern for using optimum utilization of each of these components. This paper presents the scheduling mechanism of the Cluster Resource Manager (CRM): i. Moldable job Scheduler (MS) which is able to mold the jobs with respect to the number of machines based on an preliminary initialized and auto updated heuristic knowledge-base of problem size, optimum machine count, execution duration to increase the utilization of the full cluster facility. ii) Collocation Aware and Power Efficient Resource Manager (CAPE-RM) manages collocation of CPU only and GPU accelerated jobs by monitoring the CPU load and memory usage. The emerging computation ability is followed by the huge amount of power consumption. Though the use of GPU(s) itself cut down the power to be needed by the only CPU based cluster but to make a green computing facility more power efficiency is desired. The CAPE-RM is designed to support the above by powering off the idle nodes by monitoring the total load to the facility and based on a simple statistic of the frequency of job submission.
引用
收藏
页码:161 / 166
页数:6
相关论文
共 50 条
  • [21] An efficient, model-based CPU-GPU heterogeneous FFT library
    Ogata, Yasuhito
    Endo, Toshio
    Maruyama, Naoya
    Matsuoka, Satoshi
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 380 - +
  • [22] Allok: a machine learning approach for efficient graph execution on CPU-GPU clusters
    Moori, Marcelo Koji
    Rocha, Hiago Mayk G. de A.
    Lorenzon, Arthur F.
    Beck, Antonio Carlos S.
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (13): : 18838 - 18865
  • [23] Implementation of Cubic Spline Interpolation on Parallel Skeleton using Pipeline Model on CPU-GPU Cluster
    Mohanty, Prasant Kumar
    Reza, Motahar
    Kumar, Piyush
    Kumar, Praveen
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 747 - 751
  • [24] A distributed in-memory key-value store system on heterogeneous CPU-GPU cluster
    Zhang, Kai
    Wang, Kaibo
    Yuan, Yuan
    Guo, Lei
    Li, Rubao
    Zhang, Xiaodong
    He, Bingsheng
    Hu, Jiayu
    Hua, Bei
    VLDB JOURNAL, 2017, 26 (05): : 729 - 750
  • [25] DHCRF: A Distributed Conditional Random Field Algorithm on A Heterogeneous CPU-GPU Cluster for Big Data
    Ai, Wei
    Li, Kenli
    Chen, Cen
    Peng, Jiwu
    Li, Keqin
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 2372 - 2379
  • [26] High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms
    Teodoro, George
    Pan, Tony
    Kurc, Tahsin M.
    Kong, Jun
    Cooper, Lee A. D.
    Podhorszki, Norbert
    Klasky, Scott
    Saltz, Joel H.
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 103 - 114
  • [27] 3D DSMC Computations on a Heterogeneous CPU-GPU Cluster with a Large Number of GPUs
    Kashkovsky, Alexander
    PROCEEDINGS OF THE 29TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS, 2014, 1628 : 192 - 198
  • [28] EFFICIENT PARALLEL PROCESSING BY IMPROVED CPU-GPU INTERACTION
    Khatter, Harsh
    Aggarwal, Vaishali
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 159 - 161
  • [29] Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
    Yu, Yuanhang
    Wen, Dong
    Zhang, Ying
    Wang, Xiaoyang
    Zhang, Wenjie
    Lin, Xuemin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1871 - 1876
  • [30] Efficient Pattern Matching on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 391 - 403