HyGain: High-performance, Energy-efficient Hybrid Gain Cell-based Cache Hierarchy

被引:0
|
作者
Singh, Sarabjeet [1 ]
Surana, Neelam [2 ]
Prasad, Kailash [3 ]
Jain, Pranjali [4 ]
Mekie, Joycee [3 ]
Awasthi, Manu [5 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] NVIDIA Graph, Hyderabad, Telangana, India
[3] Indian Inst Technol, Dept Elect Engn, Gandhinagar, Gujarat, India
[4] Univ Calif Santa Barbara, Santa Barbara, CA USA
[5] Ashoka Univ, Hyderabad, Telangana, India
关键词
Cache memory; emerging memories; Gain Cell; EMBEDDED DRAM; LOW-COST; STT-RAM; REFRESH; POWER; ARCHITECTURE; PREDICTION; SRAM;
D O I
10.1145/3572839
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose a "full-stack" solution to designing high-apacity and low-latency on-chip cache hierarchies by starting at the circuit level of the hardware design stack. We propose a novel half VDD precharge 2T Gain Cell (GC) design for the cache hierarchy. The GC has several desirable characteristics, including similar to 50% higher storage density and similar to 50% lower dynamic energy as compared to the traditional 6T SRAM, even after accounting for peripheral circuit overheads. We also demonstrate data retention time of 350 us (similar to 17.5x of eDRAM) at 28 nm technology with V-DD = 0.9V and temperature = 27 degrees C that, combined with optimizations like staggered refresh, makes it an ideal candidate to architect all levels of on-chip caches. We show that compared to 6T SRAM, for a given area budget, GC-based caches, on average, provide 30% and 36% increase in IPC for single- and multi-programmed workloads, respectively, on contemporary workloads, including SPEC CPU 2017. We also observe dynamic energy savings of 42% and 34% for single- and multi-programmed workloads, respectively. Finally, in a quest to utilize the best of all worlds, we combine GC with STT-RAM to create hybrid hierarchies. We show that a hybrid hierarchy with GC caches at L1 and L2 and an LLC split between GC and STT-RAM is able to provide a 46% benefit in energy-delay product (EDP) as compared to an all-SRAM design, and 13% as compared to an all-GC cache hierarchy, averaged across multi-programmed workloads.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Hybrid Nonvolatile Disk Cache for Energy-Efficient and High-Performance Systems
    Shi, Liang
    Li, Jianhua
    Xue, Chun Jason
    Zhou, Xuehai
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2013, 18 (01)
  • [2] Cooperative Partitioning: Energy-Efficient Cache Partitioning for High-Performance CMPs
    Sundararajan, Karthik T.
    Porpodas, Vasileios
    Jones, Timothy M.
    Topham, Nigel P.
    Franke, Bjoern
    2012 IEEE 18TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2012, : 311 - 322
  • [3] Building a high-performance key-value cache as an energy-efficient appliance
    Xu, Yuehai
    Frachtenberg, Eitan
    Jiang, Song
    PERFORMANCE EVALUATION, 2014, 79 : 24 - 37
  • [4] High-performance, energy-efficient IGBTs
    Snyder, Lucy A.
    Electron Prod Garden City NY, 2008, 8
  • [5] Distance associativity for high-performance energy-efficient non-uniform cache architectures
    Chishti, Z
    Powell, MD
    Vijaykumar, TN
    36TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 2003, : 55 - 66
  • [6] IACM: Integrated Adaptive Cache Management for High-Performance and Energy-Efficient GPGPU Computing
    Kim, Kyu Yeun
    Park, Jinsu
    Baek, Woongki
    PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 380 - 383
  • [7] A Flexible Hybrid Interconnection Design for High-Performance and Energy-Efficient Chiplet-Based Systems
    Mahmud, Md Tareq
    Wang, Ke
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (02) : 215 - 218
  • [8] Energy-Efficient and High-Performance Data Converters
    Goes, Joao
    2024 31ST INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM, MIXDES 2024, 2024, : 15 - 15
  • [9] Encodings for high-performance energy-efficient signaling
    Bogliolo, A
    ISLPED'01: PROCEEDINGS OF THE 2001 INTERNATIONAL SYMPOSIUM ON LOWPOWER ELECTRONICS AND DESIGN, 2001, : 170 - 175
  • [10] Energy-efficient high-performance storage system
    Wang, Jun
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 2640 - 2644