Live Demonstration for Input-Sparsity-Aware RRAM Processing-in-Memory Chip

被引:0
|
作者
Wang, Junjie [1 ]
Liu, Shuang [1 ]
Pan, Ruicheng [1 ]
Yan, Shiqin [1 ]
Liu, Yihe [1 ]
Liu, Yang [1 ]
机构
[1] Univ Elect Sci & Technol China, State Key Lab Elect Thin Films & Integrated Devic, Chengdu, Peoples R China
关键词
Computing-in-memory; Sparsity-aware readout; RRAM; Quantization-aware training;
D O I
10.1109/ISCAS58744.2024.10558412
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a live demonstration of an RRAM processing-in-memory (PIM) chip in which the input sparsity is exploited to reduce power consumption and increase the throughput of the PIM chip. An offline quantization-aware training (QAT) is employed to fine-tune models to be suitable for the 4-bit PIM chip. Post-QAT, the model exhibited accuracy of 90.08% on the test dataset. Interestingly, we found that the input sparsity of input activation is always over 90%. This high level of sparsity proves advantageous, contributing substantially to both throughput and energy efficiency of the PIM chip. This design yields a throughput of 410 Gops, which is 9 times higher than the design without input sparsity awareness.
引用
收藏
页数:2
相关论文
共 39 条
  • [31] Parasitic-Aware Modeling and Neural Network Training Scheme for Energy-Efficient Processing-in-Memory With Resistive Crossbar Array
    Cao, Tiancheng
    Liu, Chen
    Gao, Yuan
    Goh, Wang Ling
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 436 - 444
  • [32] POLAR: Performance-aware On-device Learning Capable Programmable Processing-in-Memory Architecture for Low-Power ML Applications
    Bavikadi, Sathwika
    Sutradhar, Purab Ranjan
    Indovina, Mark A.
    Ganguly, Amlan
    Dinakarrao, Sai Manoj Pudukotai
    2022 25TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2022, : 889 - 898
  • [33] AERIS: Area/Energy-Efficient 1T2R ReRAM Based Processing-in-Memory Neural Network System-on-a-Chip
    Yue, Jinshan
    Liu, Yongpan
    Su, Fang
    Li, Shuangchen
    Yuan, Zhe
    Wang, Zhibo
    Sun, Wenyu
    Li, Xueqing
    Yang, Huazhong
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 146 - 151
  • [34] Live Demonstration: Real-Time Image Classification on a Neuromorphic Computing System with Zero Off-chip Memory Access
    Shin, Taehwan
    Kang, Yongshin
    Yang, Seungho
    Kim, Seban
    Chung, Jaeyong
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 449 - 449
  • [35] A 40-nm MLC-RRAM Compute-in-Memory Macro With Sparsity Control, On-Chip Write-Verify, and Temperature-Independent ADC References
    Li, Wantong
    Sun, Xiaoyu
    Huang, Shanshi
    Jiang, Hongwu
    Yu, Shimeng
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2022, 57 (09) : 2868 - 2877
  • [36] An Area- and Energy-Efficient Spiking Neural Network With Spike-Time-Dependent Plasticity Realized With SRAM Processing-in-Memory Macro and On-Chip Unsupervised Learning
    Liu, Shuang
    Wang, J. J.
    Zhou, J. T.
    Hu, S. G.
    Yu, Q.
    Chen, T. P.
    Liu, Y.
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2023, 17 (01) : 92 - 104
  • [37] A 2 x 30k-Spin Multi-Chip Scalable CMOS Annealing Processor Based on a Processing-in-Memory Approach for Solving Large-Scale Combinatorial Optimization Problems
    Takemoto, Takashi
    Hayashi, Masato
    Yoshimura, Chihiro
    Yamaoka, Masanao
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2020, 55 (01) : 145 - 156
  • [38] DPIM: A 19.36 TOPS/W 2T1C eDRAM Transformer-in-Memory Chip with Sparsity-Aware Quantization and Heterogeneous Dense-Sparse Core
    Kim, Donghyuk
    Kim, Jae Young
    Cho, Hyunjun
    Yoo, Seungjae
    Lee, Sukjin
    Yune, Sungwoong
    Jeong, Hoichang
    Park, Keonhee
    Lee, Ki-Soo
    Lee, Jongchan
    Han, Chanheum
    Koo, Gunmo
    Han, Yuli
    Kim, Jaejin
    Kim, Jaemin
    Lee, Kyuho
    Cha, Joo-Hyung
    Cho, Kunhee
    Kim, Joo-Young
    2024 50TH IEEE EUROPEAN SOLID-STATE ELECTRONICS RESEARCH CONFERENCE, ESSERC 2024, 2024, : 141 - 144
  • [39] An 8b-Precision 8-Mb STT-MRAM Near-Memory-Compute Macro Using Weight-Feature and Input-Sparsity Aware Schemes for Energy-Efficient Edge AI Devices
    You, De-Qi
    Chiu, Yen-Cheng
    Khwa, Win-San
    Li, Chung-Yuan
    Hsieh, Fang-Ling
    Chien, Yu-An
    Lo, Chung-Chuan
    Liu, Ren-Shuo
    Hsieh, Chi-Cheng
    Tang, Kea-Tiong
    Chih, Yu-Der
    Chang, Tsung-Yung Jonathan
    Chang, Meng-Fan
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024, 59 (01) : 219 - 230