Selective Caching: Avoiding Performance Valleys in Massively Parallel Architectures

被引:0
|
作者
Jadidi, Amin [1 ]
Kandemir, Mahmut T. [2 ]
Das, Chita R. [2 ]
机构
[1] Cadence Design Syst, San Jose, CA 95134 USA
[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
来源
2020 28TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2020) | 2020年
关键词
D O I
10.1109/PDP50117.2020.00051
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Emerging general purpose graphics processing units (GPGPU) make use of a memory hierarchy very similar to that of modern multi-core processors they typically have multiple levels of on-chip caches and a DDR-like off-chip main memory. In such massively parallel architectures, caches are expected to reduce the average data access latency by reducing the number of off-chip memory accesses; however, our extensive experimental studies confirm that not all applications utilize the on-chip caches in an efficient manner. Even though GPGPUs are adopted to run a wide range of general purpose applications, the conventional cache management policies are incapable of achieving the optimal performance over different memory characteristics of the applications. This paper first investigates the underlying reasons for inefficiency of common cache management policies in GPGPUs. To address and resolve those issues, we then propose (i) a characterization mechanism to analyze each kernel at runtime and, (ii) a selective caching policy to manage the flow of cache accesses. Evaluation results of the studied platform show that our proposed dynamically reconfigurable cache hierarchy improves the system performance by up to 105% (average of 27%) over a wide range of modern GPGPU applications, which is within 10% of the optimal improvement.
引用
收藏
页码:290 / 298
页数:9
相关论文
共 50 条
  • [31] A massively parallel adaptive fast multipole method on heterogeneous architectures
    Institute for Scientific Computing Research, Lawrence Livermore National Laboratory, Livermore, CA, United States
    不详
    不详
    不详
    不详
    不详
    Commun ACM, 5 (101-109):
  • [32] Solving the Bethe-Salpeter equation on massively parallel architectures
    Zhang, Xiao
    Achilles, Sebastian
    Winkelmann, Jan
    Haas, Roland
    Schleife, Andre
    Di Napoli, Edoardo
    COMPUTER PHYSICS COMMUNICATIONS, 2021, 267
  • [33] Fast Direct Solver for Integral Equations on Massively Parallel Architectures
    Augonnet, C.
    Pujols, A.
    Sesques, M.
    2015 USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM) PROCEEDINGS, 2015, : 149 - 149
  • [34] N-BODY SIMULATIONS ON MASSIVELY-PARALLEL ARCHITECTURES
    STILLER, L
    DAEMEN, LL
    GUBERNATIS, JE
    JOURNAL OF COMPUTATIONAL PHYSICS, 1994, 115 (02) : 550 - 552
  • [35] Massively Parallel EEG Algorithms for Pre-exascale Architectures
    Wang, Zeyu
    Juhasz, Zoltan
    EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT II, EURO-PAR 2023, 2024, 14352 : 290 - 295
  • [36] Accelerating Constrained Sparse Tensor Factorization on Massively Parallel Architectures
    Soh, Yongseok
    Sao, Piyush
    Kannan, Ramakrishnan
    Choi, Jee
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 107 - 116
  • [37] Web caching: Architectures and performance evaluation survey
    Lai, GP
    Liu, MK
    Wang, FY
    Zeng, D
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 3039 - 3044
  • [38] Parallel sphere detector algorithm providing optimal MIMO detection on massively parallel architectures
    Jozsa, Csaba M.
    Kolumban, Geza
    Vidal, Antonio M.
    Martinez-Zaldivar, Francisco J.
    Gonzalez, Alberto
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 4993 - 5018
  • [39] Interior Point Methods for the Inverse Medium Problem on Massively Parallel Architectures
    Grote, M. J.
    Huber, J.
    Schenk, O.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1466 - 1474
  • [40] A Design of Autonomous Error-Tolerant Architectures for Massively Parallel Computing
    Liu, Lizheng
    Jin, Yi
    Liu, Yi
    Ma, Ning
    Huan, Yuxiang
    Zou, Zhuo
    Zheng, Lirong
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (10) : 2143 - 2154