Online Resource Management in Thermal and Energy Constrained Heterogeneous High Performance Computing

被引:6
|
作者
Oxley, Mark A. [1 ]
Pasricha, Sudeep [2 ]
Maciejewski, Anthony A. [1 ]
Siegel, Howard Jay [1 ,2 ]
Burns, Patrick J. [3 ]
机构
[1] Colorado State Univ, Dept Elect & Comp Engn, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
[3] Colorado State Univ, Informat Technol, Ft Collins, CO 80523 USA
关键词
heterogeneous computing; resource management; thermal-aware computing; energy-aware computing; HPC; DVFS; DATA CENTERS; POWER;
D O I
10.1109/DASC-PICom-DataCom-CyberSciTec.2016.111
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Operators of high-performance computing (HPC) facilities face conflicting trade-offs between the operating temperature of the facility, reliability of compute nodes, energy costs, and computing performance. Intelligent management of the HPC facility typically involves taking a proactive approach by predicting the thermal implications of allocating tasks to different cores around the facility. This offers the benefit of operating the HPC facility at a hotter CRAC temperature while avoiding hotspots. However, such an approach can be a time-consuming process that requires complicated air flow models to be calculated for every mapping decision. We propose a framework in which offline analysis is used to assist an online resource manager by predicting the thermal implications of mapping a given workload. The goal is to maximize the reward earned from completing tasks by their individual deadlines throughout the day, while adhering to a daily energy budget and temperature threshold constraints. We show that our proposed techniques can earn significantly greater reward than traditional load balancing and thermal management schemes.
引用
收藏
页码:604 / 611
页数:8
相关论文
共 50 条
  • [1] Resource and Energy Management in High-Performance Computing: From Heterogeneous to Exascale Systems
    Ahmad, Ishfaq
    2017 INTERNATIONAL CONFERENCE ON INFOCOM TECHNOLOGIES AND UNMANNED SYSTEMS (TRENDS AND FUTURE DIRECTIONS) (ICTUS), 2017, : 70 - 70
  • [2] Dynamic resource management in energy constrained heterogeneous computing systems using voltage scaling
    Kim, Jong-Kook
    Siegel, Howard Jay
    Maciejewski, Anthony A.
    Eigenmann, Rudolf
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, 19 (11) : 1445 - 1457
  • [3] Utility maximizing dynamic resource management in an oversubscribed energy-constrained heterogeneous computing system
    Khemka, Bhavesh
    Friese, Ryan
    Pasricha, Sudeep
    Maciejewski, Anthony A.
    Siegel, Howard Jay
    Koenig, Gregory A.
    Powers, Sarah
    Hilton, Marcia
    Rambharos, Rajendra
    Poole, Steve
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2015, 5 : 14 - 30
  • [4] Deadline and energy constrained dynamic resource allocation in a heterogeneous computing environment
    B. Dalton Young
    Jonathan Apodaca
    Luis Diego Briceño
    Jay Smith
    Sudeep Pasricha
    Anthony A. Maciejewski
    Howard Jay Siegel
    Bhavesh Khemka
    Shirish Bahirat
    Adrian Ramirez
    Yong Zou
    The Journal of Supercomputing, 2013, 63 : 326 - 347
  • [5] Deadline and energy constrained dynamic resource allocation in a heterogeneous computing environment
    Young, B. Dalton
    Apodaca, Jonathan
    Briceno, Luis Diego
    Smith, Jay
    Pasricha, Sudeep
    Maciejewski, Anthony A.
    Siegel, Howard Jay
    Khemka, Bhavesh
    Bahirat, Shirish
    Ramirez, Adrian
    Zou, Yong
    JOURNAL OF SUPERCOMPUTING, 2013, 63 (02): : 326 - 347
  • [6] Online Distributed Offloading and Computing Resource Management With Energy Harvesting for Heterogeneous MEC-Enabled IoT
    Xia, Shichao
    Yao, Zhixiu
    Li, Yun
    Mao, Shiwen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (10) : 6743 - 6757
  • [7] Predictive Resource Management for Next-Generation High-Performance Computing Heterogeneous Platforms
    Massari, Giuseppe
    Pupykina, Anna
    Agosta, Giovanni
    Fornaciari, William
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 470 - 483
  • [8] Thermal, Power, and Co-location Aware Resource Allocation in Heterogeneous High Performance Computing Systems
    Oxley, Mark A.
    Jonardi, Eric
    Pasricha, Sudeep
    Maciejewski, Anthony A.
    Koenig, Gregory A.
    Siegel, Howard Jay
    2014 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2014,
  • [9] Resilient heterogeneous power and energy constrained computing
    Lynar, T. M.
    Steer, K. C. B.
    Eng, F.
    Smith, O.
    ENERGY SYSTEMS-OPTIMIZATION MODELING SIMULATION AND ECONOMIC ASPECTS, 2014, 5 (01): : 145 - 161
  • [10] Energy and thermal models for simulation of workload and resource management in computing systems
    Piatek, Wojciech
    Oleksiak, Ariel
    Da Costa, Georges
    SIMULATION MODELLING PRACTICE AND THEORY, 2015, 58 : 40 - 54