Live Demonstration for Input-Sparsity-Aware RRAM Processing-in-Memory Chip

被引:0
|
作者
Wang, Junjie [1 ]
Liu, Shuang [1 ]
Pan, Ruicheng [1 ]
Yan, Shiqin [1 ]
Liu, Yihe [1 ]
Liu, Yang [1 ]
机构
[1] Univ Elect Sci & Technol China, State Key Lab Elect Thin Films & Integrated Devic, Chengdu, Peoples R China
关键词
Computing-in-memory; Sparsity-aware readout; RRAM; Quantization-aware training;
D O I
10.1109/ISCAS58744.2024.10558412
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a live demonstration of an RRAM processing-in-memory (PIM) chip in which the input sparsity is exploited to reduce power consumption and increase the throughput of the PIM chip. An offline quantization-aware training (QAT) is employed to fine-tune models to be suitable for the 4-bit PIM chip. Post-QAT, the model exhibited accuracy of 90.08% on the test dataset. Interestingly, we found that the input sparsity of input activation is always over 90%. This high level of sparsity proves advantageous, contributing substantially to both throughput and energy efficiency of the PIM chip. This design yields a throughput of 410 Gops, which is 9 times higher than the design without input sparsity awareness.
引用
收藏
页数:2
相关论文
共 39 条
  • [1] Design and Implementation of a Hybrid, ADC/DAC-Free, Input-Sparsity-Aware, Precision Reconfigurable RRAM Processing-in-Memory Chip
    Wang J.
    Zhang T.
    Liu S.
    Liu Y.
    Wu Y.
    Hu S.
    Chen T.
    Liu Y.
    Yang Y.
    Huang R.
    IEEE Journal of Solid-State Circuits, 2024, 59 (02) : 595 - 604
  • [2] MULTIFUNCTIONAL RRAM CHIP WITH CONFIGURABILITY FOR SPARSITY-AWARE IN-MEMORY ISNG MACHINE
    Yue, Wenshuo
    Jing, Zhaokun
    Yan, Bonan
    Tao, Yaoyu
    Zhang, Teng
    Huang, Ru
    Yang, Yuchao
    CONFERENCE OF SCIENCE & TECHNOLOGY FOR INTEGRATED CIRCUITS, 2024 CSTIC, 2024,
  • [3] GraphSAR: A Sparsity-Aware Processing-in-Memory Architecture for Large-scale Graph Processing on ReRAMs
    Dai, Guohao
    Huang, Tianhao
    Wang, Yu
    Yang, Huazhong
    Wawrzynek, John
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 120 - 126
  • [4] Janus: A Flexible Processing-in-Memory Graph Accelerator Toward Sparsity
    Li, Xing
    Song, Zhuoran
    Ausavarungnirun, Rachata
    Liu, Xiao
    Liu, Xueyuan
    Zhang, Xuan
    Wang, Xuhang
    Ling, Jiayao
    Li, Gang
    Jing, Naifeng
    Liang, Xiaoyao
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (12) : 4813 - 4826
  • [5] RRAM based processing-in-memory for efficient intelligent vision tasks at the edge
    Kumar, Ashwani
    Bezugam, Sai Sukruth
    Memories - Materials, Devices, Circuits and Systems, 2024, 8
  • [6] Thermal-aware processing-in-memory instruction offloading
    Nai, Lifeng
    Hadidi, Ramyad
    Xiao, He
    Kim, Hyojong
    Sim, Jaewoong
    Kim, Hyesoon
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 130 : 193 - 207
  • [7] Z-PIM: An Energy-Efficient Sparsity Aware Processing-In-Memory Architecture with Fully-Variable Weight Precision
    Kim, Ji-Hoon
    Lee, Juhyoung
    Lee, Jinsu
    Yoo, Hoi-Jun
    Kim, Joo-Young
    2020 IEEE SYMPOSIUM ON VLSI CIRCUITS, 2020,
  • [8] uPIM: Performance-aware Online Learning Capable Processing-in-Memory
    Bavikadi, Sathwika
    Sutradhar, Purab Ranjan
    Ganguly, Amlan
    Dinakarrao, Sai Manoj Pudukotai
    2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [9] Gzippo: Highly-compact Processing-In-Memory Graph Accelerator Alleviating Sparsity and Redundancy
    Li, Xing
    Ausavarungnirun, Rachata
    Liu, Xiao
    Liu, Xueyuan
    Zhang, Xuan
    Lu, Heng
    Song, Zhuoran
    Jing, Naifeng
    Liang, Xiaoyao
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [10] Reliability-Aware Training and Performance Modeling for Processing-In-Memory Systems
    Sun, Hanbo
    Zhu, Zhenhua
    Cai, Yi
    Zeng, Shulin
    Qiu, Kaizhong
    Wang, Yu
    Yang, Huazhong
    2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2021, : 847 - 852