Performance and Energy-Efficient Design of STT-RAM Last-Level Cache

被引:19
|
作者
Hameed, Fazal [1 ,2 ]
Khan, Asif Ali [1 ]
Castrillon, Jeronimo [1 ]
机构
[1] Tech Univ Dresden, Chair Compiler Consruct, D-01069 Dresden, Germany
[2] Inst Space Technol, Islamabad 44000, Pakistan
关键词
Architecture; cache; embedded systems; memory; memory hierarchy; CHIP DRAM CACHE;
D O I
10.1109/TVLSI.2018.2804938
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent research has proposed having a die-stacked last-level cache (LLC) to overcome the memory wall. Lately, spin-transfer-torque random access memory (STT-RAM) caches have received attention, since they provide improved energy efficiency compared with DRAM caches. However, recently proposed STT-RAM cache architectures unnecessarily dissipate energy by fetching unneeded cache lines (CLs) into the row buffer (RB). In this paper, we propose a selective read policy for the STT-RAM which fetches those CLs into the RB that are likely to be reused. In addition, we propose a tags-update policy that reduces the number of STT-RAM writebacks. This reduces the number of reads/writes and thereby decreases the energy consumption. To reduce the latency penalty of our selective read policy, we propose the following performance optimizations: 1) an RB tags-bypass policy that reduces STT-RAM access latency; 2) an LLC data cache that stores the CLs that are likely to be used in the near future; 3) an address organization scheme that simultaneously reduces LLC access latency and miss rate; and 4) a tags-to-column mapping policy that improves access parallelism. For evaluation, we implement our proposed architecture in the Zesto simulator and run different combinations of SPEC2006 benchmarks on an eight-core system. We compare our approach with a recently proposed STT-RAM LLC with subarray parallelism support and show that our synergistic policies reduce the average LLC dynamic energy consumption by 75% and improve the system performance by 6.5%. Compared with the state-of-the-art DRAM LLC with subarray parallelism, our architecture reduces the LLC dynamic energy consumption by 82% and improves system performance by 6.8%.
引用
收藏
页码:1059 / 1072
页数:14
相关论文
共 50 条
  • [11] Energy-Efficient Runtime Adaptable L1 STT-RAM Cache Design
    Kuan, Kyle
    Adegbija, Tosiron
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (06) : 1328 - 1339
  • [12] Dynamically Reconfigurable Hybrid Cache: An Energy-Efficient Last-Level Cache Design
    Chen, Yu-Ting
    Cong, Jason
    Huang, Hui
    Liu, Bin
    Liu, Chunyue
    Potkonjak, Miodrag
    Reinman, Glenn
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 45 - 50
  • [13] Microarchitectural Exploration of STT-MRAM Last-level Cache Parameters for Energy-efficient Devices
    Marinelli, Tommaso
    Gomez Perez, Jignacio
    Tenllado, Christian
    Komalan, Manu
    Gupta, Mohit
    Catthoor, Francky
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (01)
  • [14] Preventing STT-RAM Last-Level Caches from Port Obstruction
    Wang, Jue
    Dong, Xiangyu
    Xie, Yuan
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (03) : 3 - 21
  • [15] AOS: Adaptive Overwrite Scheme for Energy-Efficient MLC STT-RAM Cache
    Chen, Xunchao
    Khoshavi, Navid
    Zhou, Jian
    Huang, Dan
    DeMara, Ronald F.
    Wang, Jun
    Wen, Wujie
    Chen, Yiran
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [16] A Novel Hybrid DRAM/STT-RAM Last-Level-Cache Architecture for Performance, Energy, and Endurance Enhancement
    Hameed, Fazal
    Castrillon, Jeronimo
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (10) : 2375 - 2386
  • [17] Design of Last-Level On-Chip Cache Using Spin-Torque Transfer RAM (STT RAM)
    Xu, Wei
    Sun, Hongbin
    Wang, Xiaobin
    Chen, Yiran
    Zhang, Tong
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (03) : 483 - 493
  • [18] Coding Last Level STT-RAM Cache For High Endurance And Low Power
    Yazdanshenas, Sadegh
    Pirbasti, Marzieh Ranjbar
    Fazeli, Mahdi
    Patooghy, Ahmad
    IEEE COMPUTER ARCHITECTURE LETTERS, 2014, 13 (02) : 73 - 76
  • [19] An Adjacent-Line-Merging Writeback Scheme for STT-RAM Last-Level Caches
    Sato, Masayuki
    Sakai, Zentaro
    Egawa, Ryusuke
    Kobayashi, Hiroaki
    2017 IEEE SYMPOSIUM IN LOW-POWER AND HIGH-SPEED CHIPS (COOL CHIPS), 2017,
  • [20] Periodic learning-based region selection for energy-efficient MLC STT-RAM cache
    Fanfan Shen
    Yanxiang He
    Jun Zhang
    Chao Xu
    The Journal of Supercomputing, 2019, 75 : 6220 - 6238