Page Placement Strategies for GPUs within Heterogeneous Memory Systems

被引:0
|
作者
Agarwal, Neha [1 ]
Nellans, David [2 ]
Stephenson, Mark [2 ]
O'Connor, Mike [2 ]
Keckler, Stephen W. [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] NVIDIA, Westford, MA USA
关键词
Bandwidth; Page placement; Linux; Program annotation; DRAM CACHE; MANAGEMENT;
D O I
10.1145/2775054.2694381
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Systems from smartphones to supercomputers are increasingly heterogeneous, being composed of both CPUs and GPUs. To maximize cost and energy efficiency, these systems will increasingly use globally-addressable heterogeneous memory systems, making choices about memory page placement critical to performance. In this work we show that current page placement policies are not sufficient to maximize GPU performance in these heterogeneous memory systems. We propose two new page placement policies that improve GPU performance: one application agnostic and one using application profile information. Our application agnostic policy, bandwidth-aware (BW-AWARE) placement, maximizes GPU throughput by balancing page placement across the memories based on the aggregate memory bandwidth available in a system. Our simulation-based results show that BW-AWARE placement outperforms the existing Linux INTERLEAVE and LOCAL policies by 35% and 18% on average for GPU compute workloads. We build upon BW-AWARE placement by developing a compiler-based profiling mechanism that provides programmers with information about GPU application data structure access patterns. Combining this information with simple program-annotated hints about memory placement, our hint-based page placement approach performs within 90% of oracular page placement on average, largely mitigating the need for costly dynamic page tracking and migration.
引用
收藏
页码:607 / 618
页数:12
相关论文
共 50 条
  • [21] Intelligent Page Migration on Heterogeneous Memory by Using Transformer
    Pei, Songwen
    Qin, Wei
    Li, Jianan
    Tan, Junhao
    Tang, Jie
    Gaudiot, Jean-Luc
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (5-6) : 380 - 399
  • [22] THE ORGANIZATION OF THE PAGE MEMORY OF MICROPROCESSOR SYSTEMS
    POGORELOV, VS
    TOROSHANKO, YI
    KAUSTOV, VA
    TELECOMMUNICATIONS AND RADIO ENGINEERING, 1987, 41-2 (04) : 13 - 16
  • [23] PM-Migration: A Page Placement Mechanism for Real-Time Systems with Hybrid Memory Architecture
    Xu, Lidang
    Chen, Gengbin
    Li, Dingding
    Luo, Haoyu
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT V, 2024, 14491 : 313 - 324
  • [24] GAIA: An OS Page Cache for Heterogeneous Systems
    Brokhman, Tanya
    Lifshits, Pavel
    Silberstein, Mark
    PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE, 2019, : 661 - 674
  • [25] Energy Efficient Frequency Scaling on GPUs in Heterogeneous HPC Systems
    Kraljic, Karlo
    Kerger, Daniel
    Schulz, Martin
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 3 - 16
  • [26] Investigating Effectiveness of Sensor Placement Strategies in Contamination Detection within Water Distribution Systems
    Zheng, Feifei
    Du, Jiawen
    Diao, Kegong
    Zhang, Tuqiao
    Yu, Tingchao
    Shao, Yu
    JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2018, 144 (04)
  • [27] Memory Management Methodology for Application Data Structure Refinement and Placement on Heterogeneous DRAM/NVM Systems
    Katsaragakis, Manolis
    Papadopoulos, Lazaros
    Baloukas, Christos
    Soudris, Dimitrios
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 748 - 753
  • [28] On page blocks in distributed virtual memory systems
    Fellah, A.
    Abaza, M.
    IEEE Pacific RIM Conference on Communications, Computers, and Signal Processing - Proceedings, 1999, : 605 - 607
  • [29] Nimble Page Management for Tiered Memory Systems
    Yan, Zi
    Lustig, Daniel
    Nellans, David
    Bhattacharjee, Abhishek
    TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 331 - 345
  • [30] Devirtualizing Memory in Heterogeneous Systems
    Haria, Swapnil
    Hill, Mark D.
    Swift, Michael M.
    ACM SIGPLAN NOTICES, 2018, 53 (02) : 637 - 650