Online Resource Management in Thermal and Energy Constrained Heterogeneous High Performance Computing

被引:6
|
作者
Oxley, Mark A. [1 ]
Pasricha, Sudeep [2 ]
Maciejewski, Anthony A. [1 ]
Siegel, Howard Jay [1 ,2 ]
Burns, Patrick J. [3 ]
机构
[1] Colorado State Univ, Dept Elect & Comp Engn, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
[3] Colorado State Univ, Informat Technol, Ft Collins, CO 80523 USA
关键词
heterogeneous computing; resource management; thermal-aware computing; energy-aware computing; HPC; DVFS; DATA CENTERS; POWER;
D O I
10.1109/DASC-PICom-DataCom-CyberSciTec.2016.111
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Operators of high-performance computing (HPC) facilities face conflicting trade-offs between the operating temperature of the facility, reliability of compute nodes, energy costs, and computing performance. Intelligent management of the HPC facility typically involves taking a proactive approach by predicting the thermal implications of allocating tasks to different cores around the facility. This offers the benefit of operating the HPC facility at a hotter CRAC temperature while avoiding hotspots. However, such an approach can be a time-consuming process that requires complicated air flow models to be calculated for every mapping decision. We propose a framework in which offline analysis is used to assist an online resource manager by predicting the thermal implications of mapping a given workload. The goal is to maximize the reward earned from completing tasks by their individual deadlines throughout the day, while adhering to a daily energy budget and temperature threshold constraints. We show that our proposed techniques can earn significantly greater reward than traditional load balancing and thermal management schemes.
引用
收藏
页码:604 / 611
页数:8
相关论文
共 50 条
  • [21] Reliability-oriented resource management for High-Performance Computing
    Massari, Giuseppe
    Peta, Miriam
    Campi, Alessandro
    Reghenzani, Federico
    Terraneo, Federico
    Agosta, Giovanni
    Fornaciari, William
    Ciesielski, Sebastian
    Kulczewski, Michal
    Piatek, Wojciech
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2023, 39
  • [22] Resource management and scheduling for high performance computing application based on WSRF
    Weng, CL
    Li, ML
    Lu, XD
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 718 - 729
  • [23] Energy and Area Efficiency in Neuromorphic Computing for Resource Constrained Devices
    Chakma, Gangotree
    Skuda, Nicholas D.
    Schuman, Catherine D.
    Plank, James S.
    Dean, Mark E.
    Rose, Garrett S.
    PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 379 - 383
  • [24] Resource-constrained management of heterogeneous assets with stochastic deterioration
    Ohlmann, Jeffrey W.
    Bean, James C.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 199 (01) : 198 - 208
  • [25] High Performance Computing Algorithm and Software for Heterogeneous Computing
    Xu S.
    Wang W.
    Zhang J.
    Jiang J.-R.
    Jin Z.
    Chi X.-B.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (08): : 2365 - 2376
  • [26] Morphing Bus: A rapid deployment computing architecture for high performance, resource-constrained robots
    D'Souza, Colin
    Kim, Byung Hwa
    Voyles, Richard
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 311 - 316
  • [27] ENERGY AND PERFORMANCE CHARACTERIZATION OF MOBILE HETEROGENEOUS COMPUTING
    Wang, Yi-Chu
    Cheng, Kwang-Ting
    2012 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2012, : 312 - 317
  • [28] Energy Efficient Resource Allocation for Heterogeneous Workload in Cloud Computing
    Malik, Surbhi
    Saini, Poonam
    Rani, Sudesh
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 : 89 - 97
  • [29] Intelligent control agents for resource management of heterogeneous parallel computing
    Lambert, AB
    King, RL
    Russ, SH
    Reese, DS
    COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - EVOLUTIONARY COMPUTATION & FUZZY LOGIC FOR INTELLIGENT CONTROL, KNOWLEDGE ACQUISITION & INFORMATION RETRIEVAL, 1999, 55 : 116 - 121
  • [30] Utility Functions and Resource Management in an Oversubscribed Heterogeneous Computing Environment
    Khemka, Bhavesh
    Friese, Ryan
    Briceno, Luis D.
    Siegel, Howard Jay
    Maciejewski, Anthony A.
    Koenig, Gregory A.
    Groer, Chris
    Okonski, Gene
    Hilton, Marcia M.
    Rambharos, Rajendra
    Poole, Steve
    IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (08) : 2394 - 2407