LARA: Locality-aware resource allocation to improve GPU memory-access time

被引：0

作者：

Hossein BiTalebi

Farshad Safaei

机构：

[1] Shahid Beheshti University,Faculty of Computer Science and Engineering

来源：

The Journal of Supercomputing | 2021年 / 77卷

关键词：

Cache contention; Memory divergence; Graphics Processing Unit (GPU); GPU-NoC; Interconnection network; Locality; Memory; Priority; Row access; Stall time;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Memory access as a primary performance bottleneck of each processing unit also plays a significant role in GPU performance. In addition to high challenging parts of GPU’s memory access path, the low locality property among the requests considerably increases the memory access delay. Despite the GPU’s immense processing power, they cannot reach their maximum throughput values because of the memory access bottlenecks. Memory divergence and miss locality among the L1 missed requests significantly impose the Last-Level-Cache contention and main memory row switching overheads. In addition, interconnection network routes the request packets regardless of locality properties, such routing algorithm considerably disrupts the locality among the requests.

引用

页码：14438 / 14460

页数：22

共 50 条

[1] LARA: Locality-aware resource allocation to improve GPU memory-access time
BiTalebi, Hossein
Safaei, Farshad
JOURNAL OF SUPERCOMPUTING, 2021, 77 (12): : 14438 - 14460
[2] Locality-Aware GPU Register File
Jeon, Hyeran
Esfeden, Hodjat Asghari
Abu-Ghazaleh, Nael B.
Wong, Daniel
Elango, Sindhuja
IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (02) : 153 - 156
[3] Locality-aware Optimizations for Improving Remote Memory Latency in Multi-GPU Systems
Belayneh, Leul
Ye, Haojie
Chen, Kuan-Yu
Blaauw, David
Mudge, Trevor
Dreslinski, Ronald
Talati, Nishil
PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 304 - 316
[4] Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AET
Pan, Cheng
Wang, Xiaolin
Luo, Yingwei
Wang, Zhenlin
ACM TRANSACTIONS ON STORAGE, 2021, 17 (02)
[5] Locality-Aware Vertex Scheduling for GPU-based Graph Computation
Park, Hyunsun
Ahn, Junwhan
Park, Eunhyeok
Yoo, Sungjoo
2015 IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2015, : 195 - 200
[6] SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
Kal, Hongju
Lee, Seokmin
Ko, Gun
Ro, Won Woo
2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 679 - 691
[7] Memory-Access Aware DVFS for Network-on-Chip in CMPs
Yao, Yuan
Lu, Zhonghai
PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1433 - 1436
[8] Optimizing Locality-Aware Memory Management of Key-Value Caches
Hu, Xiameng
Wang, Xiaolin
Zhou, Lan
Luo, Yingwei
Ding, Chen
Jiang, Song
Wang, Zhenlin
IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (05) : 862 - 875
[9] Locality-Aware Memory Association for Multi-Target Worksharing in OpenMP
Scogland, Thomas R. W.
Feng, Wu-Chun
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 515 - 516
[10] Locality-aware allocation of multi-dimensional correlated files on the cloud platform
Zhang, Xiaofei
Tong, Yongxin
Chen, Lei
Wang, Min
Feng, Shicong
DISTRIBUTED AND PARALLEL DATABASES, 2015, 33 (03) : 353 - 380

← 1 2 3 4 5 →