Performance and Energy-Efficient Design of STT-RAM Last-Level Cache

被引:19
|
作者
Hameed, Fazal [1 ,2 ]
Khan, Asif Ali [1 ]
Castrillon, Jeronimo [1 ]
机构
[1] Tech Univ Dresden, Chair Compiler Consruct, D-01069 Dresden, Germany
[2] Inst Space Technol, Islamabad 44000, Pakistan
关键词
Architecture; cache; embedded systems; memory; memory hierarchy; CHIP DRAM CACHE;
D O I
10.1109/TVLSI.2018.2804938
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent research has proposed having a die-stacked last-level cache (LLC) to overcome the memory wall. Lately, spin-transfer-torque random access memory (STT-RAM) caches have received attention, since they provide improved energy efficiency compared with DRAM caches. However, recently proposed STT-RAM cache architectures unnecessarily dissipate energy by fetching unneeded cache lines (CLs) into the row buffer (RB). In this paper, we propose a selective read policy for the STT-RAM which fetches those CLs into the RB that are likely to be reused. In addition, we propose a tags-update policy that reduces the number of STT-RAM writebacks. This reduces the number of reads/writes and thereby decreases the energy consumption. To reduce the latency penalty of our selective read policy, we propose the following performance optimizations: 1) an RB tags-bypass policy that reduces STT-RAM access latency; 2) an LLC data cache that stores the CLs that are likely to be used in the near future; 3) an address organization scheme that simultaneously reduces LLC access latency and miss rate; and 4) a tags-to-column mapping policy that improves access parallelism. For evaluation, we implement our proposed architecture in the Zesto simulator and run different combinations of SPEC2006 benchmarks on an eight-core system. We compare our approach with a recently proposed STT-RAM LLC with subarray parallelism support and show that our synergistic policies reduce the average LLC dynamic energy consumption by 75% and improve the system performance by 6.5%. Compared with the state-of-the-art DRAM LLC with subarray parallelism, our architecture reduces the LLC dynamic energy consumption by 82% and improves system performance by 6.8%.
引用
收藏
页码:1059 / 1072
页数:14
相关论文
共 50 条
  • [31] A Novel Hybrid Last Level Cache Based on Multi-retention STT-RAM Cells
    Zhang, Hongguang
    Zhang, Minxuan
    Zhao, Zhenyu
    Tian, Shuo
    ADVANCED COMPUTER ARCHITECTURE, ACA 2016, 2016, 626 : 28 - 39
  • [32] Relaxing Non-Volatility for Fast and Energy-Efficient STT-RAM Caches
    Smullen, Clinton W., IV
    Mohan, Vidyabhushan
    Nigam, Anurag
    Gurumurthi, Sudhanva
    Stan, Mircea R.
    2011 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2011, : 50 - 61
  • [33] Exploration of trade-offs in the design of volatile STT-RAM cache
    Kim, Namhyung
    Choi, Kiyoung
    JOURNAL OF SYSTEMS ARCHITECTURE, 2016, 71 : 23 - 31
  • [34] Using STT-RAM to Enable Energy-Efficient Near-Threshold Chip Multiprocessors
    Pan, Xiang
    Teodorescu, Radu
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 485 - 486
  • [35] TEEMO: Temperature Aware Energy Efficient Multi-Retention STT-RAM Cache Architecture
    Agarwal, Sukarn
    Chakraborty, Shounak
    Sjalander, Magnus
    PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, : 852 - 864
  • [36] Performance and Energy Assessment of Last-Level Cache Replacement Policies
    Peneau, Pierre-Yves
    Novo, David
    Bruguier, Florent
    Sassatelli, Gilles
    Gamatie, Abdoulaye
    PROCEEDINGS OF 2017 FIRST INTERNATIONAL CONFERENCE ON EMBEDDED & DISTRIBUTED SYSTEMS (EDIS 2017), 2017, : 149 - 154
  • [37] Energy minimization in the STT-RAM-based high-capacity last-level caches
    Khajekarimi, Elyas
    Jamshidi, Kamal
    Vafaei, Abbas
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (10): : 6831 - 6854
  • [38] An Efficient Protection Technique for Last Level STT-RAM Caches in Multi-Core Processors
    Azad, Zahra
    Farbeh, Hamed
    Monazzah, Amir Mahdi Hosseini
    Miremadi, Seyed Ghassem
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1564 - 1577
  • [39] Energy minimization in the STT-RAM-based high-capacity last-level caches
    Elyas Khajekarimi
    Kamal Jamshidi
    Abbas Vafaei
    The Journal of Supercomputing, 2019, 75 : 6831 - 6854
  • [40] STT-RAM Cache Hierarchy With Multiretention MTJ Designs
    Sun, Zhenyu
    Bi, Xiuyuan
    Li, Hai
    Wong, Weng-Fai
    Zhu, Xiaochun
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (06) : 1281 - 1293