A 3D MCAM architecture based on flash memory enabling binary neural network computing for edge AI

被引:0
|
作者
Maoying BAI [1 ]
Shuhao WU [1 ]
Hai WANG [1 ]
Hua WANG [1 ]
Yang FENG [1 ]
Yueran QI [1 ]
Chengcheng WANG [1 ]
Zheng CHAI [2 ]
Tai MIN [2 ]
Jixuan WU [1 ]
Xuepeng ZHAN [1 ]
Jiezhi CHEN [1 ]
机构
[1] School of Information Science and Engineering, Shandong University
[2] Center for Spintronic and Quantum Systems, State Key Laboratory for Mechanical Behavior of Materials,School of Materials Science and Engineering, Xi'an Jiaotong
关键词
D O I
暂无
中图分类号
TP333 [存贮器]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The in-memory computing(IMC) architecture implemented by non-volatile memory units shows great possibilities to break the traditional von Neumann bottleneck. In this paper, a 3D IMC architecture is proposed whose unit is based on a multi-bit content-addressable memory(MCAM). The MCAM unit is comprised of two 65 nm flash memory and two transistors(2Flash2T), which is reconfigurable and multifunctional for both data write/search and XNOR logic operation. Moreover, the MCAM array can also support the population count(POPCOUNT) operation, which can be beneficial for the training and inference process in binary neural network(BNN) computing. Based on the well-known MNIST dataset, the proposed 3D MCAM architecture shows a 98.63% recognition accuracy and a 300% noise-tolerant performance without significant accuracy deterioration. Our findings can provide the potential for developing highly energy-efficient BNN computing for complex artificial intelligence(AI) tasks based on flash-based MCAM units.
引用
收藏
页码:302 / 310
页数:9
相关论文
共 50 条
  • [41] An ADC-Less RRAM-Based Computing-in-Memory Macro With Binary CNN for Efficient Edge AI
    Li, Yi
    Chen, Jia
    Wang, Linfang
    Zhang, Woyu
    Guo, Zeyu
    Wang, Jun
    Han, Yongkang
    Li, Zhi
    Wang, Fei
    Dou, Chunmeng
    Xu, Xiaoxin
    Yang, Jianguo
    Wang, Zhongrui
    Shang, Dashan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (06) : 1871 - 1875
  • [42] 3D Model Classification Based on Neural Architecture Search
    Zhou, Peng
    Yang, Jun
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (05): : 722 - 733
  • [43] Hyperspectral Compute-In-Memory Architecture for 3D Opto-Electronic Computing
    Suh, Myoung-Gyun
    2024 IEEE PHOTONICS SOCIETY SUMMER TOPICALS MEETING SERIES, SUM 2024, 2024,
  • [44] TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory
    Gao, Mingyu
    Pu, Jing
    Yang, Xuan
    Horowitz, Mark
    Kozyrakis, Christos
    OPERATING SYSTEMS REVIEW, 2017, 51 (02) : 751 - 764
  • [45] TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory
    Gao, Mingyu
    Pu, Jing
    Yang, Xuan
    Horowitz, Mark
    Kozyrakis, Christos
    ACM SIGPLAN NOTICES, 2017, 52 (04) : 751 - 764
  • [46] TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory
    Gao, Mingyu
    Pu, Jing
    Yang, Xuan
    Horowitz, Mark
    Kozyrakis, Christos
    TWENTY-SECOND INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXII), 2017, : 751 - 764
  • [47] TETRIS: Scalable and efficient neural network acceleration with 3D memory
    Gao M.
    Pu J.
    Yang X.
    Horowitz M.
    Kozyrakis C.
    1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (52): : 751 - 764
  • [48] Non-linear 3D rendering workload prediction based on a combined fuzzy-neural network architecture for grid computing applications
    Doulamis, N
    Doulamis, A
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 1069 - 1072
  • [49] Binocular 3D reconstruction based on neural network
    Lin, MX
    Zhao, YR
    Guan, ZG
    Ding, FH
    Xu, QX
    Wang, XH
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 765 - 771
  • [50] 3D reconstruction approach based on neural network
    Hu, Haifeng
    Yang, Zhi
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 630 - +