RLP: Power Management Based on a Latency-Aware Roofline Model

被引:0
|
作者
Wang, Bo [1 ]
Kozhokanova, Anara [1 ]
Terboven, Christian [1 ]
Mueller, Matthias [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany
关键词
power management; memory access latency; roofline model; PERFORMANCE;
D O I
10.1109/IPDPS54959.2023.00052
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.
引用
收藏
页码:446 / 456
页数:11
相关论文
共 50 条
  • [31] Green latency-aware data placement in data centers
    Fan, Yuqi
    Ding, Hongli
    Wang, Lusheng
    Yuan, Xiaojing
    COMPUTER NETWORKS, 2016, 110 : 46 - 57
  • [32] Latency-Aware Industrial Fog Application Orchestration with Kubernetes
    Eidenbenz, Raphael
    Pignolet, Yvonne-Anne
    Ryser, Alain
    2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2020, : 164 - 171
  • [33] Optimization of latency-aware flow allocation in NGFI networks
    Klinkowski, Miroslaw
    COMPUTER COMMUNICATIONS, 2020, 161 : 344 - 359
  • [34] Latency-aware Scheduling in the Cloud-Edge Continuum
    Chiaro, Cristopher
    Monaco, Doriana
    Sacco, Alessio
    Casetti, Claudio
    Marchetto, Guido
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [35] Latency-Aware Offloading for Mobile Edge Computing Networks
    Feng, Wei
    Liu, Hao
    Yao, Yingbiao
    Cao, Diqiu
    Zhao, Mingxiong
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2673 - 2677
  • [36] Latency-aware virtual desktops optimization in distributed clouds
    Tian Guo
    Prashant Shenoy
    K. K. Ramakrishnan
    Vijay Gopalakrishnan
    Multimedia Systems, 2018, 24 : 73 - 94
  • [37] Energy and Latency-aware Resource Reconfiguration in Fog Environments
    Godinho, Noe
    Silva, Henrique
    Curado, Marilia
    Paquete, Luis
    2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
  • [38] Latency-Aware Kubernetes Scheduling for Microservices Orchestration at the Edge
    Centofanti, C.
    Tiberti, W.
    Marotta, A.
    Graziosi, F.
    Cassioli, D.
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 426 - 431
  • [39] Latency-Aware Accelerator of SIMECK Lightweight Block Cipher
    Alharbi, Adel R.
    Tariq, Hassan
    Aljaedi, Amer
    Aljuhni, Abdullah
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [40] Simulation Study on Latency-aware Network in Edge Computing
    Zheng, Qinling
    Ping, Zhan
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 150 - 155