Page Placement Strategies for GPUs within Heterogeneous Memory Systems

被引:0
|
作者
Agarwal, Neha [1 ]
Nellans, David [2 ]
Stephenson, Mark [2 ]
O'Connor, Mike [2 ]
Keckler, Stephen W. [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] NVIDIA, Westford, MA USA
关键词
Bandwidth; Page placement; Linux; Program annotation; DRAM CACHE; MANAGEMENT;
D O I
10.1145/2775054.2694381
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Systems from smartphones to supercomputers are increasingly heterogeneous, being composed of both CPUs and GPUs. To maximize cost and energy efficiency, these systems will increasingly use globally-addressable heterogeneous memory systems, making choices about memory page placement critical to performance. In this work we show that current page placement policies are not sufficient to maximize GPU performance in these heterogeneous memory systems. We propose two new page placement policies that improve GPU performance: one application agnostic and one using application profile information. Our application agnostic policy, bandwidth-aware (BW-AWARE) placement, maximizes GPU throughput by balancing page placement across the memories based on the aggregate memory bandwidth available in a system. Our simulation-based results show that BW-AWARE placement outperforms the existing Linux INTERLEAVE and LOCAL policies by 35% and 18% on average for GPU compute workloads. We build upon BW-AWARE placement by developing a compiler-based profiling mechanism that provides programmers with information about GPU application data structure access patterns. Combining this information with simple program-annotated hints about memory placement, our hint-based page placement approach performs within 90% of oracular page placement on average, largely mitigating the need for costly dynamic page tracking and migration.
引用
收藏
页码:607 / 618
页数:12
相关论文
共 50 条
  • [1] Hierarchical Page Eviction Policy for Unified Memory in GPUs
    Yu, Qi
    Childers, Bruce
    Huang, Libo
    Qian, Cheng
    Wang, Zhiying
    2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2019, : 149 - 150
  • [2] ANALYSIS OF PAGE ALLOCATION STRATEGIES FOR MULTIPROGRAMMING SYSTEMS WITH VIRTUAL MEMORY
    CHAMBERLIN, DD
    FULLER, SH
    LIU, LY
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1973, 17 (05) : 404 - 412
  • [3] Adaptive Security Support for Heterogeneous Memory on GPUs
    Yuan, Shougang
    Awad, Amro
    Yudha, Ardhi Wiratama Baskara
    Solihin, Yan
    Zhou, Huiyang
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 213 - 228
  • [4] On-the-fly Page Migration and Address Reconciliation for Heterogeneous Memory Systems
    Islam, Mahzabeen
    Adavally, Shashank
    Scrbak, Marko
    Kavi, Krishna
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2020, 16 (01)
  • [5] HPE: Hierarchical Page Eviction Policy for Unified Memory in GPUs
    Yu, Qi
    Childers, Bruce
    Huang, Libo
    Qian, Cheng
    Wang, Zhiying
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (10) : 2461 - 2474
  • [6] Coordinated Page Prefetch and Eviction for Memory Oversubscription Management in GPUs
    Yu, Qi
    Childers, Bruce
    Huang, Libo
    Qian, Cheng
    Guo, Hui
    Wang, Zhiying
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 472 - 482
  • [7] Overcoming Memory Capacity Wall of GPUs With Heterogeneous Memory Stack
    Hong, Jeongmin
    Cho, Sungjun
    Kim, Gwangsun
    IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (02) : 61 - 64
  • [8] Optimal Data Placement for Heterogeneous Cache, Memory, and Storage Systems
    Zhang, Lei
    Karimi, Reza
    Ahmad, Irfan
    Vigfusson, Ymir
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2020, 4 (01)
  • [9] Compiler-assisted Data Placement for Heterogeneous Memory Systems
    Kim, Hwajung
    IEICE ELECTRONICS EXPRESS, 2024, 21 (19):
  • [10] Optimal Data Placement for Heterogeneous Cache, Memory, and Storage Systems
    Zhang L.
    Karimi R.
    Ahmad I.
    Vigfusson Y.
    Zhang, Lei (lei.zhang@emory.edu), 1600, Association for Computing Machinery (48): : 85 - 86