RLP: Power Management Based on a Latency-Aware Roofline Model

被引：0

作者：

Wang, Bo ^{[1
]}

Kozhokanova, Anara ^{[1
]}

Terboven, Christian ^{[1
]}

Mueller, Matthias ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany

来源：

2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS | 2023年

关键词：

power management; memory access latency; roofline model; PERFORMANCE;

D O I：

10.1109/IPDPS54959.2023.00052

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.

引用

页码：446 / 456

页数：11

共 50 条

[31] Green latency-aware data placement in data centers
Fan, Yuqi
Ding, Hongli
Wang, Lusheng
Yuan, Xiaojing
COMPUTER NETWORKS, 2016, 110 : 46 - 57
[32] Latency-Aware Industrial Fog Application Orchestration with Kubernetes
Eidenbenz, Raphael
Pignolet, Yvonne-Anne
Ryser, Alain
2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2020, : 164 - 171
[33] Optimization of latency-aware flow allocation in NGFI networks
Klinkowski, Miroslaw
COMPUTER COMMUNICATIONS, 2020, 161 : 344 - 359
[34] Latency-aware Scheduling in the Cloud-Edge Continuum
Chiaro, Cristopher
Monaco, Doriana
Sacco, Alessio
Casetti, Claudio
Marchetto, Guido
PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
[35] Latency-Aware Offloading for Mobile Edge Computing Networks
Feng, Wei
Liu, Hao
Yao, Yingbiao
Cao, Diqiu
Zhao, Mingxiong
IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2673 - 2677
[36] Latency-aware virtual desktops optimization in distributed clouds
Tian Guo
Prashant Shenoy
K. K. Ramakrishnan
Vijay Gopalakrishnan
Multimedia Systems, 2018, 24 : 73 - 94
[37] Energy and Latency-aware Resource Reconfiguration in Fog Environments
Godinho, Noe
Silva, Henrique
Curado, Marilia
Paquete, Luis
2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
[38] Latency-Aware Kubernetes Scheduling for Microservices Orchestration at the Edge
Centofanti, C.
Tiberti, W.
Marotta, A.
Graziosi, F.
Cassioli, D.
2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 426 - 431
[39] Latency-Aware Accelerator of SIMECK Lightweight Block Cipher
Alharbi, Adel R.
Tariq, Hassan
Aljaedi, Amer
Aljuhni, Abdullah
APPLIED SCIENCES-BASEL, 2023, 13 (01):
[40] Simulation Study on Latency-aware Network in Edge Computing
Zheng, Qinling
Ping, Zhan
2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 150 - 155

← 1 2 3 4 5 →