HyGain: High-performance, Energy-efficient Hybrid Gain Cell-based Cache Hierarchy

被引:0
|
作者
Singh, Sarabjeet [1 ]
Surana, Neelam [2 ]
Prasad, Kailash [3 ]
Jain, Pranjali [4 ]
Mekie, Joycee [3 ]
Awasthi, Manu [5 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] NVIDIA Graph, Hyderabad, Telangana, India
[3] Indian Inst Technol, Dept Elect Engn, Gandhinagar, Gujarat, India
[4] Univ Calif Santa Barbara, Santa Barbara, CA USA
[5] Ashoka Univ, Hyderabad, Telangana, India
关键词
Cache memory; emerging memories; Gain Cell; EMBEDDED DRAM; LOW-COST; STT-RAM; REFRESH; POWER; ARCHITECTURE; PREDICTION; SRAM;
D O I
10.1145/3572839
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose a "full-stack" solution to designing high-apacity and low-latency on-chip cache hierarchies by starting at the circuit level of the hardware design stack. We propose a novel half VDD precharge 2T Gain Cell (GC) design for the cache hierarchy. The GC has several desirable characteristics, including similar to 50% higher storage density and similar to 50% lower dynamic energy as compared to the traditional 6T SRAM, even after accounting for peripheral circuit overheads. We also demonstrate data retention time of 350 us (similar to 17.5x of eDRAM) at 28 nm technology with V-DD = 0.9V and temperature = 27 degrees C that, combined with optimizations like staggered refresh, makes it an ideal candidate to architect all levels of on-chip caches. We show that compared to 6T SRAM, for a given area budget, GC-based caches, on average, provide 30% and 36% increase in IPC for single- and multi-programmed workloads, respectively, on contemporary workloads, including SPEC CPU 2017. We also observe dynamic energy savings of 42% and 34% for single- and multi-programmed workloads, respectively. Finally, in a quest to utilize the best of all worlds, we combine GC with STT-RAM to create hybrid hierarchies. We show that a hybrid hierarchy with GC caches at L1 and L2 and an LLC split between GC and STT-RAM is able to provide a 46% benefit in energy-delay product (EDP) as compared to an all-SRAM design, and 13% as compared to an all-GC cache hierarchy, averaged across multi-programmed workloads.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] MOLAR: A Cost-Efficient, High-Performance SSD-Based Hybrid Storage Cache
    Liu, Yi
    Ge, Xiongzi
    Huang, Xiaoxia
    Du, David H. C.
    COMPUTER JOURNAL, 2015, 58 (09): : 2061 - 2078
  • [22] A High-Performance and Energy-Efficient Ternary Multiplier Using CNTFETs
    Abbasian, Erfan
    Sofimowloodi, Sobhan
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (11) : 14365 - 14379
  • [23] Specialized dynamic optimizations for high-performance energy-efficient microarchitecture
    Almog, Y
    Rosner, R
    Schwartz, N
    Schmorak, A
    CGO 2004: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2004, : 137 - 148
  • [24] Energy-Efficient Pipeline Templates for High-Performance Asynchronous Circuits
    Sheikh, Basit Riaz
    Manohar, Rajit
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2011, 7 (04)
  • [25] Design techniques for high-performance, energy-efficient control logic
    Ko, U
    Hill, A
    Balsara, PT
    1996 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN - DIGEST OF TECHNICAL PAPERS, 1996, : 97 - 100
  • [26] VeSense: High-performance and energy-efficient vehicular sensing platform
    Ahnn, Jong Hoon
    Potkonjak, Miodrag
    PERVASIVE AND MOBILE COMPUTING, 2014, 12 : 112 - 122
  • [27] A High-Performance and Energy-Efficient Ternary Multiplier Using CNTFETs
    Erfan Abbasian
    Sobhan Sofimowloodi
    Arabian Journal for Science and Engineering, 2023, 48 : 14365 - 14379
  • [28] Energy-Efficient Encoding for High-Performance Buses with Staggered Repeaters
    Jayaprakash, Sharath
    Mahapatra, Nihar R.
    2009 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, 2009, : 252 - 257
  • [29] Graphicionado: A High-Performance and Energy-Efficient Accelerator for Graph Analytics
    Ham, Tae Jun
    Wu, Lisa
    Sundaram, Narayanan
    Satish, Nadathur
    Martonosi, Margaret
    2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
  • [30] Kickstarting High-performance Energy-efficient Manycore Architectures with Epiphany
    Olofsson, Andreas
    Nordstrom, Tomas
    Ul-Abdin, Zain
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 1719 - 1726