Performance and Energy-Efficient Design of STT-RAM Last-Level Cache

被引：19

作者：

Hameed, Fazal ^{[1
,2
]}

Khan, Asif Ali ^{[1
]}

Castrillon, Jeronimo ^{[1
]}

机构：

[1] Tech Univ Dresden, Chair Compiler Consruct, D-01069 Dresden, Germany

[2] Inst Space Technol, Islamabad 44000, Pakistan

来源：

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS | 2018年 / 26卷 / 06期

关键词：

Architecture; cache; embedded systems; memory; memory hierarchy; CHIP DRAM CACHE;

D O I：

10.1109/TVLSI.2018.2804938

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent research has proposed having a die-stacked last-level cache (LLC) to overcome the memory wall. Lately, spin-transfer-torque random access memory (STT-RAM) caches have received attention, since they provide improved energy efficiency compared with DRAM caches. However, recently proposed STT-RAM cache architectures unnecessarily dissipate energy by fetching unneeded cache lines (CLs) into the row buffer (RB). In this paper, we propose a selective read policy for the STT-RAM which fetches those CLs into the RB that are likely to be reused. In addition, we propose a tags-update policy that reduces the number of STT-RAM writebacks. This reduces the number of reads/writes and thereby decreases the energy consumption. To reduce the latency penalty of our selective read policy, we propose the following performance optimizations: 1) an RB tags-bypass policy that reduces STT-RAM access latency; 2) an LLC data cache that stores the CLs that are likely to be used in the near future; 3) an address organization scheme that simultaneously reduces LLC access latency and miss rate; and 4) a tags-to-column mapping policy that improves access parallelism. For evaluation, we implement our proposed architecture in the Zesto simulator and run different combinations of SPEC2006 benchmarks on an eight-core system. We compare our approach with a recently proposed STT-RAM LLC with subarray parallelism support and show that our synergistic policies reduce the average LLC dynamic energy consumption by 75% and improve the system performance by 6.5%. Compared with the state-of-the-art DRAM LLC with subarray parallelism, our architecture reduces the LLC dynamic energy consumption by 82% and improves system performance by 6.8%.

引用

页码：1059 / 1072

页数：14

共 50 条

[31] A Novel Hybrid Last Level Cache Based on Multi-retention STT-RAM Cells
Zhang, Hongguang
Zhang, Minxuan
Zhao, Zhenyu
Tian, Shuo
ADVANCED COMPUTER ARCHITECTURE, ACA 2016, 2016, 626 : 28 - 39
[32] Relaxing Non-Volatility for Fast and Energy-Efficient STT-RAM Caches
Smullen, Clinton W., IV
Mohan, Vidyabhushan
Nigam, Anurag
Gurumurthi, Sudhanva
Stan, Mircea R.
2011 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2011, : 50 - 61
[33] Exploration of trade-offs in the design of volatile STT-RAM cache
Kim, Namhyung
Choi, Kiyoung
JOURNAL OF SYSTEMS ARCHITECTURE, 2016, 71 : 23 - 31
[34] Using STT-RAM to Enable Energy-Efficient Near-Threshold Chip Multiprocessors
Pan, Xiang
Teodorescu, Radu
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 485 - 486
[35] TEEMO: Temperature Aware Energy Efficient Multi-Retention STT-RAM Cache Architecture
Agarwal, Sukarn
Chakraborty, Shounak
Sjalander, Magnus
PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, : 852 - 864
[36] Performance and Energy Assessment of Last-Level Cache Replacement Policies
Peneau, Pierre-Yves
Novo, David
Bruguier, Florent
Sassatelli, Gilles
Gamatie, Abdoulaye
PROCEEDINGS OF 2017 FIRST INTERNATIONAL CONFERENCE ON EMBEDDED & DISTRIBUTED SYSTEMS (EDIS 2017), 2017, : 149 - 154
[37] Energy minimization in the STT-RAM-based high-capacity last-level caches
Khajekarimi, Elyas
Jamshidi, Kamal
Vafaei, Abbas
JOURNAL OF SUPERCOMPUTING, 2019, 75 (10): : 6831 - 6854
[38] An Efficient Protection Technique for Last Level STT-RAM Caches in Multi-Core Processors
Azad, Zahra
Farbeh, Hamed
Monazzah, Amir Mahdi Hosseini
Miremadi, Seyed Ghassem
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1564 - 1577
[39] Energy minimization in the STT-RAM-based high-capacity last-level caches
Elyas Khajekarimi
Kamal Jamshidi
Abbas Vafaei
The Journal of Supercomputing, 2019, 75 : 6831 - 6854
[40] STT-RAM Cache Hierarchy With Multiretention MTJ Designs
Sun, Zhenyu
Bi, Xiuyuan
Li, Hai
Wong, Weng-Fai
Zhu, Xiaochun
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (06) : 1281 - 1293

← 1 2 3 4 5 →