Improving Performance and Energy-efficiency of DNN Accelerators with STT-RAM Buffers

被引：0

作者：

Byeon, Gwangeun ^{[1
]}

Kim, Seongwook ^{[1
]}

Hong, Seokin ^{[1
]}

机构：

[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea

来源：

2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC | 2023年

关键词：

Deep Learning Accelerator; STT-MRAM;

D O I：

10.1109/ISOCC59558.2023.10396473

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

DNN inference on mobile and edge devices is challenging due to high computational and storage demands. To accelerate the inference on these devices, various DNN accelerators have been proposed. In these accelerators, the on-chip buffers occupy a significant portion of the chip area because they need to be large enough to minimize the off-chip memory accesses and usually implemented with SRAM cells. This paper presents a DNN accelerator that utilizes Spin-Transfer Torque RAM (STT-RAM) to build large buffers with a low area budget. By exploiting the access patterns of activations and weights in DNN inference, we optimize the STT-RAM to have short write latency and low write power. Experimental results show that the buffers implemented with optimized STT-RAM significantly boost the performance and energy efficiency of the DNN accelerators.

引用

页码：207 / 208

页数：2

共 50 条

[1] Reliable and Energy Efficient MLC STT-RAM Buffer for CNN Accelerators
Jasemi, Masoomeh
Hessabi, Shaahin
Bagherzadeh, Nader
COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
[2] Reuse Detector: Improving the Management of STT-RAM SLLCs
Rodriguez-Rodriguez, R.
Diaz, J.
Castro, F.
Ibanez, P.
Chaver, D.
Vinals, V.
Saez, J. C.
Prieto-Matias, M.
Pinuel, L.
Monreal, T.
Llaberia, J. M.
COMPUTER JOURNAL, 2018, 61 (06): : 856 - 880
[3] Read Performance: The Newest Barrier in Scaled STT-RAM
Zhang, Yaojun
Li, Yong
Sun, Zhenyu
Li, Hai
Chen, Yiran
Jones, Alex K.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (06) : 1170 - 1174
[4] Multiple Attempt Write Strategy for Low Energy STT-RAM
Park, Jaeyoung
Orshansky, Michael
2016 INTERNATIONAL GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI), 2016, : 163 - 168
[5] TriZone: A Design of MLC STT-RAM Cache for Combined Performance, Energy, and Reliability Optimizations
Liu, Zihao
Mao, Mengjie
Liu, Tao
Wang, Xue
Wen, Wujie
Chen, Yiran
Li, Hai
Wang, Danghui
Pei, Yukui
Ge, Ning
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (10) : 1985 - 1998
[6] Evaluating the performance and energy of STT-RAM caches for real-world wearable workloads
Gajaria, Dhruv
Adegbija, Tosiron
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 136 : 231 - 240
[7] Performance and Energy-Efficient Design of STT-RAM Last-Level Cache
Hameed, Fazal
Khan, Asif Ali
Castrillon, Jeronimo
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (06) : 1059 - 1072
[8] Exploring the Use of Volatile STT-RAM for Energy Efficient Video Processing
Zhao, Hengyu
Sun, Hongbin
Yang, Qiang
Min, Tai
Zheng, Nanning
PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN ISQED 2016, 2016, : 81 - 87
[9] A Cache Consolidation Design of MLC STT-RAM for Energy Efficiency Enhancement on Cyber-Physical Systems
Liang, Yu-Pei
Chen, Shuo-Han
Chang, Yuan-Hao
Liu, Yun-Fei
Wei, Hsin-Wen
Shih, Wei-Kuan
APPLIED COMPUTING REVIEW, 2021, 21 (01): : 37 - 49
[10] Volatile STT-RAM Scratchpad Design and Data Allocation for Low Energy
Rodriguez, Gabriel
Tourino, Juan
Kandemir, Mahmut T.
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (04)

← 1 2 3 4 5 →