Live Demonstration for Input-Sparsity-Aware RRAM Processing-in-Memory Chip

被引:0
|
作者
Wang, Junjie [1 ]
Liu, Shuang [1 ]
Pan, Ruicheng [1 ]
Yan, Shiqin [1 ]
Liu, Yihe [1 ]
Liu, Yang [1 ]
机构
[1] Univ Elect Sci & Technol China, State Key Lab Elect Thin Films & Integrated Devic, Chengdu, Peoples R China
关键词
Computing-in-memory; Sparsity-aware readout; RRAM; Quantization-aware training;
D O I
10.1109/ISCAS58744.2024.10558412
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a live demonstration of an RRAM processing-in-memory (PIM) chip in which the input sparsity is exploited to reduce power consumption and increase the throughput of the PIM chip. An offline quantization-aware training (QAT) is employed to fine-tune models to be suitable for the 4-bit PIM chip. Post-QAT, the model exhibited accuracy of 90.08% on the test dataset. Interestingly, we found that the input sparsity of input activation is always over 90%. This high level of sparsity proves advantageous, contributing substantially to both throughput and energy efficiency of the PIM chip. This design yields a throughput of 410 Gops, which is 9 times higher than the design without input sparsity awareness.
引用
收藏
页数:2
相关论文
共 39 条
  • [21] Z-PIM: A Sparsity-Aware Processing-in-Memory Architecture With Fully Variable Weight Bit-Precision for Energy-Efficient Deep Neural Networks
    Kim, Ji-Hoon
    Lee, Juhyoung
    Lee, Jinsu
    Heo, Jaehoon
    Kim, Joo-Young
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2021, 56 (04) : 1093 - 1104
  • [22] PIMCloud: QoS-Aware Resource Management of Latency-Critical Applications in Clouds with Processing-in-Memory
    Chen, Shuang
    Jiang, Yi
    Delimitrou, Christina
    Martinez, Jose F.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 1086 - 1099
  • [23] PIM-Enabled Instructions: A Low-Overhead, Locality-Aware Processing-in-Memory Architecture
    Ahn, Junwhan
    Yoo, Sungjoo
    Mutlu, Onur
    Choi, Kiyoung
    2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, : 336 - 348
  • [24] RADAR: A Skew-Resistant and Hotness-Aware Ordered Index Design for Processing-in-Memory Systems
    Hua, Yifan
    Zheng, Shengan
    Kong, Weihan
    Zhou, Cong
    Huang, Kaixin
    Ma, Ruoyan
    Huang, Linpeng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (09) : 1598 - 1614
  • [25] Task Parallelism-Aware Deep Neural Network Scheduling on Multiple Hybrid Memory Cube-Based Processing-in-Memory
    Lee, Young Sik
    Han, Tae Hee
    IEEE ACCESS, 2021, 9 : 68561 - 68572
  • [26] Implementation of a 32-bit RISC processor for the data-intensive architecture processing-in-memory chip
    Draper, J
    Sondeen, J
    Mediratta, S
    Kim, I
    IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, PROCEEDINGS, 2002, : 163 - 172
  • [27] Off-Chip Congestion Management for GPU-based Non-Uniform Processing-in-Memory Networks
    Punniyamurthy, Kishore
    Gerstlauer, Andreas
    2020 28TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2020), 2020, : 282 - 289
  • [28] Input/Output Processing using On-Chip Memory for Cyber Physical System
    Hoang Nguyen
    Koc, Hakduran
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [29] A 462GOPs/J RRAM-Based Nonvolatile Intelligent Processor for Energy Harvesting IoE System Featuring Nonvolatile Logics and Processing-In-Memory
    Su, Fang
    Chen, Wei-Hao
    Xia, Lixue
    Lo, Chieh-Pu
    Tang, Tianqi
    Wang, Zhibo
    Hsu, Kuo-Hsiang
    Cheng, Ming
    Li, Jun-Yi
    Xie, Yuan
    Wang, Yu
    Chang, Meng-Fan
    Yang, Huazhong
    Liu, Yongpan
    2017 SYMPOSIUM ON VLSI CIRCUITS, 2017, : C260 - C261
  • [30] A 462GOPs/J RRAM-Based Nonvolatile Intelligent Processor for Energy Harvesting IoE System Featuring Nonvolatile Logics and Processing-In-Memory
    Su, Fang
    Chen, Wei-Hao
    Xia, Lixue
    Lo, Chieh-Pu
    Tang, Tianqi
    Wang, Zhibo
    Hsu, Kuo-Hsiang
    Cheng, Ming
    Li, Jun-Yi
    Xie, Yuan
    Wang, Yu
    Chang, Meng-Fan
    Yang, Huazhong
    Liu, Yongpan
    2017 SYMPOSIUM ON VLSI TECHNOLOGY, 2017, : C260 - C261