Architectural Design of 3D NAND Flash based Compute-in-Memory for Inference Engine

被引:5
|
作者
Shim, Wonbo [1 ]
Jiang, Hongwu [1 ]
Peng, Xiaochen [1 ]
Yu, Shimeng [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
Deep neural network; hardware accelerator; compute-in-memory; 3D NAND Flash; MACRO; CMOS;
D O I
10.1145/3422575.3422779
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
3D NAND Flash memory has been proposed as an attractive candidate of inference engine for deep neural network (DNN) owing to its ultra-high density and commercially matured fabrication technology. However, the peripheral circuits require to be modified to enable compute-in-memory (CIM) and the chip architectures need to be redesigned for an optimized dataflow. In this work, we present a design of 3D NAND-CIM accelerator based on the macro parameters from an industry-grade prototype chip. The DNN inference performance is evaluated using the DNN+ NeuroSim framework. To exploit the ultra-high density of 3D NAND Flash, both inputs and weights duplication strategies are introduced to improve the throughput. The benchmarking on a variety of VGG and ResNet networks was performed across technological candidates for CIM including SRAM, RRAM and 3D NAND. Compared to similar designs with SRAM or RRAM, the result shows that 3D NAND based CIM design can achieve not only 17-24% chip size but also 1.92.7 times more competitive energy efficiency for 8-bit precision inference.
引用
收藏
页码:77 / 85
页数:9
相关论文
共 50 条
  • [31] MAC-ECC: In-Situ Error Correction and Its Design Methodology for Reliable NVM-Based Compute-in-Memory Inference Engine
    Li, Wantong
    Read, James
    Jiang, Hongwu
    Yu, Shimeng
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (04) : 835 - 845
  • [32] A Runtime Reconfigurable Design of Compute-in-Memory based Hardware Accelerator
    Lu, Anni
    Peng, Xiaochen
    Luo, Yandong
    Huang, Shanshi
    Yu, Shimeng
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 932 - 937
  • [33] 3D-NAND flash memory based neuromorphic computing
    Chen Yang-Yang
    He Yu-Hui
    Miao Xiang-Shui
    Yang Duo-Hong
    ACTA PHYSICA SINICA, 2022, 71 (21)
  • [34] Monolithic 3D+-IC based Reconfigurable Compute-in-Memory SRAM Macro
    Srinivasa, Srivatsa
    Tu, Yung-Ning
    Si, Xin
    Xue, Cheng-Xin
    Lee, Chun-Ying
    Hsueh, Fu-Kuo
    Shen, Chane-Hone
    Shieh, Jia-Min
    Yeh, Wen-Kuan
    Ramanathan, Akshay Krishna
    Ho, Mon-Shu
    Sampson, Jack
    Chang, Meng-Fan
    Narayanan, Vijaykrishnan
    2019 SYMPOSIUM ON VLSI TECHNOLOGY, 2019, : T32 - T33
  • [35] Impact of etch angles on cell characteristics in 3D NAND flash memory
    Oh, Young-Taek
    Kim, Kyu-Beom
    Shin, Sang-Hoon
    Sim, Hahng
    Toan, Nguyen Van
    Ono, Takahito
    Song, Yun-Heub
    MICROELECTRONICS JOURNAL, 2018, 79 : 1 - 6
  • [36] Impact of plasma induced damage on the fabrication of 3D NAND flash memory
    Reiter, Tobias
    Klemenschits, Xaver
    Filipovic, Lado
    SOLID-STATE ELECTRONICS, 2022, 192
  • [37] Mechanical stress in a tapered channel hole of 3D NAND flash memory
    Yoon, DongGwan
    Sim, JaeMin
    Song, YunHeub
    MICROELECTRONICS RELIABILITY, 2023, 143
  • [38] Impact of plasma induced damage on the fabrication of 3D NAND flash memory
    Reiter, Tobias
    Klemenschits, Xaver
    Filipovic, Lado
    Solid-State Electronics, 2022, 192
  • [39] Characterization of Inter-Cell Interference in 3D NAND Flash Memory
    Park, Suk Kwang
    Moon, Jaekyun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (03) : 1183 - 1192
  • [40] Accelerating Sub-Block Erase in 3D NAND Flash Memory
    Gong, Hongbin
    Shen, Zhirong
    Shu, Jiwu
    2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 228 - 235