LARA: Locality-aware resource allocation to improve GPU memory-access time

被引:0
|
作者
Hossein BiTalebi
Farshad Safaei
机构
[1] Shahid Beheshti University,Faculty of Computer Science and Engineering
来源
关键词
Cache contention; Memory divergence; Graphics Processing Unit (GPU); GPU-NoC; Interconnection network; Locality; Memory; Priority; Row access; Stall time;
D O I
暂无
中图分类号
学科分类号
摘要
Memory access as a primary performance bottleneck of each processing unit also plays a significant role in GPU performance. In addition to high challenging parts of GPU’s memory access path, the low locality property among the requests considerably increases the memory access delay. Despite the GPU’s immense processing power, they cannot reach their maximum throughput values because of the memory access bottlenecks. Memory divergence and miss locality among the L1 missed requests significantly impose the Last-Level-Cache contention and main memory row switching overheads. In addition, interconnection network routes the request packets regardless of locality properties, such routing algorithm considerably disrupts the locality among the requests.
引用
收藏
页码:14438 / 14460
页数:22
相关论文
共 50 条
  • [1] LARA: Locality-aware resource allocation to improve GPU memory-access time
    BiTalebi, Hossein
    Safaei, Farshad
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (12): : 14438 - 14460
  • [2] Locality-Aware GPU Register File
    Jeon, Hyeran
    Esfeden, Hodjat Asghari
    Abu-Ghazaleh, Nael B.
    Wong, Daniel
    Elango, Sindhuja
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (02) : 153 - 156
  • [3] Locality-aware Optimizations for Improving Remote Memory Latency in Multi-GPU Systems
    Belayneh, Leul
    Ye, Haojie
    Chen, Kuan-Yu
    Blaauw, David
    Mudge, Trevor
    Dreslinski, Ronald
    Talati, Nishil
    PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 304 - 316
  • [4] Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AET
    Pan, Cheng
    Wang, Xiaolin
    Luo, Yingwei
    Wang, Zhenlin
    ACM TRANSACTIONS ON STORAGE, 2021, 17 (02)
  • [5] Locality-Aware Vertex Scheduling for GPU-based Graph Computation
    Park, Hyunsun
    Ahn, Junwhan
    Park, Eunhyeok
    Yoo, Sungjoo
    2015 IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2015, : 195 - 200
  • [6] SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
    Kal, Hongju
    Lee, Seokmin
    Ko, Gun
    Ro, Won Woo
    2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 679 - 691
  • [7] Memory-Access Aware DVFS for Network-on-Chip in CMPs
    Yao, Yuan
    Lu, Zhonghai
    PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1433 - 1436
  • [8] Optimizing Locality-Aware Memory Management of Key-Value Caches
    Hu, Xiameng
    Wang, Xiaolin
    Zhou, Lan
    Luo, Yingwei
    Ding, Chen
    Jiang, Song
    Wang, Zhenlin
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (05) : 862 - 875
  • [9] Locality-Aware Memory Association for Multi-Target Worksharing in OpenMP
    Scogland, Thomas R. W.
    Feng, Wu-Chun
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 515 - 516
  • [10] Locality-aware allocation of multi-dimensional correlated files on the cloud platform
    Zhang, Xiaofei
    Tong, Yongxin
    Chen, Lei
    Wang, Min
    Feng, Shicong
    DISTRIBUTED AND PARALLEL DATABASES, 2015, 33 (03) : 353 - 380