A 3D MCAM architecture based on flash memory enabling binary neural network computing for edge AI

被引:0
|
作者
Maoying BAI [1 ]
Shuhao WU [1 ]
Hai WANG [1 ]
Hua WANG [1 ]
Yang FENG [1 ]
Yueran QI [1 ]
Chengcheng WANG [1 ]
Zheng CHAI [2 ]
Tai MIN [2 ]
Jixuan WU [1 ]
Xuepeng ZHAN [1 ]
Jiezhi CHEN [1 ]
机构
[1] School of Information Science and Engineering, Shandong University
[2] Center for Spintronic and Quantum Systems, State Key Laboratory for Mechanical Behavior of Materials,School of Materials Science and Engineering, Xi'an Jiaotong
关键词
D O I
暂无
中图分类号
TP333 [存贮器]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The in-memory computing(IMC) architecture implemented by non-volatile memory units shows great possibilities to break the traditional von Neumann bottleneck. In this paper, a 3D IMC architecture is proposed whose unit is based on a multi-bit content-addressable memory(MCAM). The MCAM unit is comprised of two 65 nm flash memory and two transistors(2Flash2T), which is reconfigurable and multifunctional for both data write/search and XNOR logic operation. Moreover, the MCAM array can also support the population count(POPCOUNT) operation, which can be beneficial for the training and inference process in binary neural network(BNN) computing. Based on the well-known MNIST dataset, the proposed 3D MCAM architecture shows a 98.63% recognition accuracy and a 300% noise-tolerant performance without significant accuracy deterioration. Our findings can provide the potential for developing highly energy-efficient BNN computing for complex artificial intelligence(AI) tasks based on flash-based MCAM units.
引用
收藏
页码:302 / 310
页数:9
相关论文
共 50 条
  • [31] Memristor-based Deep Spiking Neural Network with a Computing-In-Memory Architecture
    Nowshin, Fabiha
    Yi, Yang
    PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022), 2022, : 163 - 168
  • [32] Efficient binary 3D convolutional neural network and hardware accelerator
    Li, Guoqing
    Zhang, Meng
    Zhang, Qianru
    Lin, Zhijian
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (01) : 61 - 71
  • [33] Efficient binary 3D convolutional neural network and hardware accelerator
    Guoqing Li
    Meng Zhang
    Qianru Zhang
    Zhijian Lin
    Journal of Real-Time Image Processing, 2022, 19 : 61 - 71
  • [34] A Separate 3D Convolutional Neural Network Architecture for 3D Medical Image Semantic Segmentation
    Dong, Shidu
    Liu, Zhi
    Wang, Huaqiu
    Zhang, Yihao
    Cui, Shaoguo
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (08) : 1705 - 1716
  • [35] Unsupervised Learning in Winner-Takes-All Neural Network Based on 3D NAND Flash
    Zhou, Wen
    Jin, Lei
    Jia, Xinlei
    Wang, Tingze
    Xu, Pengyu
    Zhang, An
    Huo, Zongliang
    IEEE ELECTRON DEVICE LETTERS, 2022, 43 (03) : 374 - 377
  • [36] Enabling sub-blocks Erase management to boost the performance of 3D NAND flash memory
    Chen, Tseng-Yi
    Chang, Yuan-Hao
    Ho, Chien-Chung
    Chen, Shuo-Han
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [37] Technological Design of 3D NAND-Based Compute-in-Memory Architecture for GB-Scale Deep Neural Network
    Shim, Wonbo
    Yu, Shimeng
    IEEE ELECTRON DEVICE LETTERS, 2021, 42 (02) : 160 - 163
  • [38] Mitigation of Accuracy Degradation in 3D Flash Memory based Approximate Nearest Neighbor Search with Binary Tree Balanced Soft Clustering for Retrieval-augmented AI
    Sasaki, Shinichi
    Aiba, Yuta
    Komano, Yusuke
    Iizuka, Takahiko
    Fujimatsu, Motohiko
    Kawasumi, Atsushi
    Miyashita, Daisuke
    Deguchi, Jun
    Maeda, Takashi
    Miyano, Shinji
    Maruyama, Tooru
    2024 22ND IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS 2024, 2024, : 238 - 242
  • [39] Introduction of Non-Volatile Computing In Memory (nvCIM) by 3D NAND Flash for Inference Accelerator of Deep Neural Network (DNN) and the Read Disturb Reliability Evaluation
    Lue, Hang-Ting
    Hsu, Po-Kai
    Wang, Keh-Chung
    Lu, Chih-Yuan
    2020 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM (IRPS), 2020,
  • [40] In-Memory Computing Architecture for a Convolutional Neural Network Based on Spin Orbit Torque MRAM
    Huang, Jun-Ying
    Syu, Jing-Lin
    Tsou, Yao-Tung
    Kuo, Sy-Yen
    Chang, Ching-Ray
    ELECTRONICS, 2022, 11 (08)