Performance and Energy-Efficient Design of STT-RAM Last-Level Cache

被引:19
|
作者
Hameed, Fazal [1 ,2 ]
Khan, Asif Ali [1 ]
Castrillon, Jeronimo [1 ]
机构
[1] Tech Univ Dresden, Chair Compiler Consruct, D-01069 Dresden, Germany
[2] Inst Space Technol, Islamabad 44000, Pakistan
关键词
Architecture; cache; embedded systems; memory; memory hierarchy; CHIP DRAM CACHE;
D O I
10.1109/TVLSI.2018.2804938
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent research has proposed having a die-stacked last-level cache (LLC) to overcome the memory wall. Lately, spin-transfer-torque random access memory (STT-RAM) caches have received attention, since they provide improved energy efficiency compared with DRAM caches. However, recently proposed STT-RAM cache architectures unnecessarily dissipate energy by fetching unneeded cache lines (CLs) into the row buffer (RB). In this paper, we propose a selective read policy for the STT-RAM which fetches those CLs into the RB that are likely to be reused. In addition, we propose a tags-update policy that reduces the number of STT-RAM writebacks. This reduces the number of reads/writes and thereby decreases the energy consumption. To reduce the latency penalty of our selective read policy, we propose the following performance optimizations: 1) an RB tags-bypass policy that reduces STT-RAM access latency; 2) an LLC data cache that stores the CLs that are likely to be used in the near future; 3) an address organization scheme that simultaneously reduces LLC access latency and miss rate; and 4) a tags-to-column mapping policy that improves access parallelism. For evaluation, we implement our proposed architecture in the Zesto simulator and run different combinations of SPEC2006 benchmarks on an eight-core system. We compare our approach with a recently proposed STT-RAM LLC with subarray parallelism support and show that our synergistic policies reduce the average LLC dynamic energy consumption by 75% and improve the system performance by 6.5%. Compared with the state-of-the-art DRAM LLC with subarray parallelism, our architecture reduces the LLC dynamic energy consumption by 82% and improves system performance by 6.8%.
引用
收藏
页码:1059 / 1072
页数:14
相关论文
共 50 条
  • [1] Energy-Efficient Exclusive Last-Level Hybrid Caches Consisting of SRAM and STT-RAM
    Kim, Namhyung
    Ahn, Junwhan
    Seo, Woong
    Choi, Kiyoung
    2015 IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2015, : 183 - 188
  • [2] Architecting the Last-Level Cache for GPUs using STT-RAM Technology
    Samavatian, Mohammad Hossein
    Arjomand, Mohammad
    Bashizade, Ramin
    Sarbazi-Azad, Hamid
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2015, 20 (04)
  • [3] HALLS: An Energy-Efficient Highly Adaptable Last Level STT-RAM Cache for Multicore Systems
    Kuan, Kyle
    Adegbija, Tosiron
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (11) : 1635 - 1646
  • [4] Prefetching Techniques for STT-RAM based Last-level Cache in CMP Systems
    Mao, Mengjie
    Sun, Guangyu
    Li, Yong
    Jones, Alex K.
    Chen, Yiran
    2014 19TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2014, : 67 - 72
  • [5] An Efficient STT-RAM Last Level Cache Architecture for GPUs
    Samavatian, Mohammad Hossein
    Abbasitabar, Hamed
    Arjomand, Mohammad
    Sarbazi-Azad, Hamid
    2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
  • [6] Prediction Hybrid Cache: An Energy-Efficient STT-RAM Cache Architecture
    Ahn, Junwhan
    Yoo, Sungjoo
    Choi, Kiyoung
    IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (03) : 940 - 951
  • [7] TSE: Two-Step Elimination for MLC STT-RAM Last-Level Cache
    Hsieh, Jen-Wei
    Liu, Yi-Yu
    Lee, Hung-Tse
    Chang, Tai
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (09) : 1498 - 1510
  • [8] Design of an area and energy-efficient last-level cache memory using STT-MRAM
    Saha, Rajesh
    Pundir, Yogendra Pratap
    Pal, Pankaj Kumar
    JOURNAL OF MAGNETISM AND MAGNETIC MATERIALS, 2021, 529
  • [9] Efficient STT-RAM Last-Level-Cache Architecture to Replace DRAM Cache
    Hameed, Fazal
    Menard, Christian
    Castrillon, Jeronimo
    MEMSYS 2017: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2017, : 141 - 151
  • [10] OAP: An Obstruction-Aware Cache Management Policy for STT-RAM Last-Level Caches
    Wang, Jue
    Dong, Xiangyu
    Xie, Yuan
    DESIGN, AUTOMATION & TEST IN EUROPE, 2013, : 847 - 852