Performance and Energy-Efficient Design of STT-RAM Last-Level Cache

被引:19
|
作者
Hameed, Fazal [1 ,2 ]
Khan, Asif Ali [1 ]
Castrillon, Jeronimo [1 ]
机构
[1] Tech Univ Dresden, Chair Compiler Consruct, D-01069 Dresden, Germany
[2] Inst Space Technol, Islamabad 44000, Pakistan
关键词
Architecture; cache; embedded systems; memory; memory hierarchy; CHIP DRAM CACHE;
D O I
10.1109/TVLSI.2018.2804938
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent research has proposed having a die-stacked last-level cache (LLC) to overcome the memory wall. Lately, spin-transfer-torque random access memory (STT-RAM) caches have received attention, since they provide improved energy efficiency compared with DRAM caches. However, recently proposed STT-RAM cache architectures unnecessarily dissipate energy by fetching unneeded cache lines (CLs) into the row buffer (RB). In this paper, we propose a selective read policy for the STT-RAM which fetches those CLs into the RB that are likely to be reused. In addition, we propose a tags-update policy that reduces the number of STT-RAM writebacks. This reduces the number of reads/writes and thereby decreases the energy consumption. To reduce the latency penalty of our selective read policy, we propose the following performance optimizations: 1) an RB tags-bypass policy that reduces STT-RAM access latency; 2) an LLC data cache that stores the CLs that are likely to be used in the near future; 3) an address organization scheme that simultaneously reduces LLC access latency and miss rate; and 4) a tags-to-column mapping policy that improves access parallelism. For evaluation, we implement our proposed architecture in the Zesto simulator and run different combinations of SPEC2006 benchmarks on an eight-core system. We compare our approach with a recently proposed STT-RAM LLC with subarray parallelism support and show that our synergistic policies reduce the average LLC dynamic energy consumption by 75% and improve the system performance by 6.5%. Compared with the state-of-the-art DRAM LLC with subarray parallelism, our architecture reduces the LLC dynamic energy consumption by 82% and improves system performance by 6.8%.
引用
收藏
页码:1059 / 1072
页数:14
相关论文
共 50 条
  • [41] ABACa: Access Based Allocation on Set Wise Multi-Retention in STT-RAM Last Level Cache
    Agarwal, Sukarn
    Chakraborty, Shounak
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 171 - 174
  • [42] Taming Process Variations in CNFET for Efficient Last-Level Cache Design
    Xu, Dawen
    Feng, Zhuangyu
    Liu, Cheng
    Li, Li
    Wang, Ying
    Li, Huawei
    Li, Xiaowei
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (04) : 418 - 431
  • [43] Multi Retention Level STT-RAM Cache Designs with a Dynamic Refresh Scheme
    Sun, Zhenyu
    Bi, Xiuyuan
    Li, Hai
    Wong, Weng-Fai
    Ong, Zhong-Liang
    Zhu, Xiaochun
    Wu, Wenqing
    PROCEEDINGS OF THE 2011 44TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 44), 2011, : 329 - 338
  • [44] Rethinking Last-level-cache Write-back Strategy for MLC STT-RAM Main Memory with Asymmetric Write Energy
    Liang, Yu-Pei
    Chen, Tseng-Yi
    Chang, Yuan-Hao
    Chen, Shuo-Han
    Chen, Pei-Yu
    Shih, Wei-Kuan
    2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [45] Low-Energy Volatile STT-RAM Cache Design Using Cache-Coherence-Enabled Adaptive Refresh
    Li, Jianhua
    Shi, Liang
    Li, Qingan
    Xue, Chun Jason
    Chen, Yiran
    Xu, Yinlong
    Wang, Wei
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2013, 19 (01)
  • [46] An Energy-efficent Scheme for STT-RAM L1 Cache
    Yao, Jun
    Ma, Jianliang
    Chen, Tianzhou
    Hu, Tongsen
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1345 - 1350
  • [47] Energy-Aware Adaptive Restore Schemes for MLC STT-RAM Cache
    Chen, Xunchao
    Khoshavi, Navid
    DeMara, Ronald F.
    Wang, Jun
    Huang, Dan
    Wen, Wujie
    Chen, Yiran
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (05) : 786 - 798
  • [48] Dynamic Segmented Bus for Energy-Efficient Last-Level Cache in Advanced Interconnect-Dominant Nodes
    Mayahinia, Mahta
    Marinelli, Tommaso
    Pei, Zhenlin
    Liu, Hsiao-Hsuan
    Pan, Chenyun
    Tokei, Zsolt
    Catthoor, Francky
    Tahoori, Mehdi B.
    IEEE EMBEDDED SYSTEMS LETTERS, 2024, 16 (04) : 321 - 324
  • [49] A Cache Consolidation Design of MLC STT-RAM for Energy Efficiency Enhancement on Cyber-Physical Systems
    Liang, Yu-Pei
    Chen, Shuo-Han
    Chang, Yuan-Hao
    Liu, Yun-Fei
    Wei, Hsin-Wen
    Shih, Wei-Kuan
    APPLIED COMPUTING REVIEW, 2021, 21 (01): : 37 - 49
  • [50] EECache: Exploiting Design Choices in Energy-Efficient Last-Level Caches for Chip Multiprocessors
    Cheng, Hsiang-Yun
    Poremba, Matt
    Shahidi, Narges
    Stalev, Ivan
    Irwin, Mary Jane
    Kandemir, Mahmut
    Sampson, Jack
    Xie, Yuan
    PROCEEDINGS OF THE 2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2014, : 303 - 306