Improving Performance and Energy-efficiency of DNN Accelerators with STT-RAM Buffers

被引:0
|
作者
Byeon, Gwangeun [1 ]
Kim, Seongwook [1 ]
Hong, Seokin [1 ]
机构
[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea
关键词
Deep Learning Accelerator; STT-MRAM;
D O I
10.1109/ISOCC59558.2023.10396473
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
DNN inference on mobile and edge devices is challenging due to high computational and storage demands. To accelerate the inference on these devices, various DNN accelerators have been proposed. In these accelerators, the on-chip buffers occupy a significant portion of the chip area because they need to be large enough to minimize the off-chip memory accesses and usually implemented with SRAM cells. This paper presents a DNN accelerator that utilizes Spin-Transfer Torque RAM (STT-RAM) to build large buffers with a low area budget. By exploiting the access patterns of activations and weights in DNN inference, we optimize the STT-RAM to have short write latency and low write power. Experimental results show that the buffers implemented with optimized STT-RAM significantly boost the performance and energy efficiency of the DNN accelerators.
引用
收藏
页码:207 / 208
页数:2
相关论文
共 50 条
  • [1] Reliable and Energy Efficient MLC STT-RAM Buffer for CNN Accelerators
    Jasemi, Masoomeh
    Hessabi, Shaahin
    Bagherzadeh, Nader
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
  • [2] Reuse Detector: Improving the Management of STT-RAM SLLCs
    Rodriguez-Rodriguez, R.
    Diaz, J.
    Castro, F.
    Ibanez, P.
    Chaver, D.
    Vinals, V.
    Saez, J. C.
    Prieto-Matias, M.
    Pinuel, L.
    Monreal, T.
    Llaberia, J. M.
    COMPUTER JOURNAL, 2018, 61 (06): : 856 - 880
  • [3] Read Performance: The Newest Barrier in Scaled STT-RAM
    Zhang, Yaojun
    Li, Yong
    Sun, Zhenyu
    Li, Hai
    Chen, Yiran
    Jones, Alex K.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (06) : 1170 - 1174
  • [4] Multiple Attempt Write Strategy for Low Energy STT-RAM
    Park, Jaeyoung
    Orshansky, Michael
    2016 INTERNATIONAL GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI), 2016, : 163 - 168
  • [5] TriZone: A Design of MLC STT-RAM Cache for Combined Performance, Energy, and Reliability Optimizations
    Liu, Zihao
    Mao, Mengjie
    Liu, Tao
    Wang, Xue
    Wen, Wujie
    Chen, Yiran
    Li, Hai
    Wang, Danghui
    Pei, Yukui
    Ge, Ning
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (10) : 1985 - 1998
  • [6] Evaluating the performance and energy of STT-RAM caches for real-world wearable workloads
    Gajaria, Dhruv
    Adegbija, Tosiron
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 136 : 231 - 240
  • [7] Performance and Energy-Efficient Design of STT-RAM Last-Level Cache
    Hameed, Fazal
    Khan, Asif Ali
    Castrillon, Jeronimo
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (06) : 1059 - 1072
  • [8] Exploring the Use of Volatile STT-RAM for Energy Efficient Video Processing
    Zhao, Hengyu
    Sun, Hongbin
    Yang, Qiang
    Min, Tai
    Zheng, Nanning
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN ISQED 2016, 2016, : 81 - 87
  • [9] A Cache Consolidation Design of MLC STT-RAM for Energy Efficiency Enhancement on Cyber-Physical Systems
    Liang, Yu-Pei
    Chen, Shuo-Han
    Chang, Yuan-Hao
    Liu, Yun-Fei
    Wei, Hsin-Wen
    Shih, Wei-Kuan
    APPLIED COMPUTING REVIEW, 2021, 21 (01): : 37 - 49
  • [10] Volatile STT-RAM Scratchpad Design and Data Allocation for Low Energy
    Rodriguez, Gabriel
    Tourino, Juan
    Kandemir, Mahmut T.
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (04)