System-Technology Codesign of 3-D NAND Flash-Based Compute-in-Memory Inference Engine

被引:8
|
作者
Shim, Wonbo [1 ]
Yu, Shimeng [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
3-D NAND; compute-in-memory (CIM); deep neural network (DNN); hardware accelerator; MACRO;
D O I
10.1109/JXCDC.2021.3093772
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Due to its ultrahigh density and commercially matured fabrication technology, 3-D NAND flash memory has been proposed as an attractive candidate of inference engine for deep neural network (DNN) workloads. However, the peripheral circuits require to be modified with conventional 3-D NAND flash to enable compute-in-memory (CIM), and the chip architectures need to be redesigned for an optimized dataflow. In this work, we present a design of 3-D NAND-CIM accelerator based on the macro parameters from an industry-grade prototype chip. The DNN inference performance is evaluated using the DNN+NeuroSim framework. To exploit the ultrahigh density of 3-D NAND flash, both inputs and weights mapping strategies are introduced to improve the throughput. The benchmarking on the VGG network was performed across the technological candidates for CIM, including SRAM, resistive random access memory (RRAM), and 3-D NAND. Compared to the similar designs with SRAM or RRAM, the result shows that the 3-D NAND-based CIM design can achieve not only 17%-24% chip size but also 1.9-2.7 times more competitive energy efficiency for 8-bit precision inference. Inference accuracy drop induced by 3-D NAND string current drift and variation is also investigated. No accuracy degradation by current variation was observed with the proposed input mapping scheme, while accuracy drops sensitive to the current drift, which implies that some compensation schemes are needed to maintain the inference accuracy.
引用
收藏
页码:61 / 69
页数:9
相关论文
共 50 条
  • [21] Modeling of program Vth distribution for 3-D TLC NAND flash memory
    Wang, Kunliang
    Du, Gang
    Lun, Zhiyuan
    Chen, Wangyong
    Liu, Xiaoyan
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (04)
  • [22] Towards Improving Ionizing Radiation Tolerance of 3-D NAND Flash Memory
    Ray, Biswajit
    Buddhanoy, Matchima
    Kumar, Mondol Anik
    2023 IEEE INTERNATIONAL MEMORY WORKSHOP, IMW, 2023, : 109 - 112
  • [23] System-Technology Co-Optimization for Dense Edge Architectures Using 3-D Integration and Nonvolatile Memory
    Rocha, Leandro M. Giacomini
    Naeim, Mohamed
    Paim, Guilherme
    Brunion, Moritz
    Venugopal, Priya
    Milojevic, Dragomir
    Myers, James
    Badaroglu, Mustafa
    Verhelst, Marian
    Ryckaert, Julien
    Biswas, Dwaipayan
    IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2024, 10 : 125 - 134
  • [24] Interleaved LDPC Decoding Scheme Improves 3-D TLC NAND Flash Memory System Performance
    Yu, Xiaolei
    He, Jing
    Zhang, Bo
    Wang, Xianliang
    Li, Qianhui
    Wang, Qi
    Huo, Zongliang
    Ye, Tianchun
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (11) : 4191 - 4204
  • [25] Technological Design of 3D NAND-Based Compute-in-Memory Architecture for GB-Scale Deep Neural Network
    Shim, Wonbo
    Yu, Shimeng
    IEEE ELECTRON DEVICE LETTERS, 2021, 42 (02) : 160 - 163
  • [26] Advanced DC-SF Cell Technology for 3-D NAND Flash
    Aritome, Seiichi
    Noh, Yoohyun
    Yoo, Hyunseung
    Choi, Eun Seok
    Joo, Han Soo
    Ahn, Youngsoo
    Han, Byeongil
    Chung, Sungjae
    Shim, Keonsoo
    Lee, Keunwoo
    Kwak, Sanghyon
    Shin, Sungchul
    Choi, Iksoo
    Nam, Sanghyuk
    Cho, Gyuseog
    Sheen, Dongsun
    Pyi, Seungho
    Choi, Jongmoo
    Park, Sungkye
    Kim, Jinwoong
    Lee, Seokkiu
    Hong, Sungjoo
    Park, Sungwook
    Kikkawa, Takamaro
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2013, 60 (04) : 1327 - 1333
  • [27] Extraction of Mobility in 3-D NAND Flash Memory with Poly-Si based Macaroni Structure
    Jo, Hyungjun
    Kim, Juhyun
    Kim, Minsoo
    Shin, Hyungcheol
    2020 IEEE ELECTRON DEVICES TECHNOLOGY AND MANUFACTURING CONFERENCE (EDTM 2020), 2020,
  • [28] 3D-NAND flash memory based neuromorphic computing
    Chen Yang-Yang
    He Yu-Hui
    Miao Xiang-Shui
    Yang Duo-Hong
    ACTA PHYSICA SINICA, 2022, 71 (21)
  • [29] Data-Aware 3-D TLC NAND Flash Memory Reliability Optimization
    Salamin, Sami
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2024, 71 (10) : 5962 - 5974
  • [30] Monolithic 3D+-IC based Reconfigurable Compute-in-Memory SRAM Macro
    Srinivasa, Srivatsa
    Tu, Yung-Ning
    Si, Xin
    Xue, Cheng-Xin
    Lee, Chun-Ying
    Hsueh, Fu-Kuo
    Shen, Chane-Hone
    Shieh, Jia-Min
    Yeh, Wen-Kuan
    Ramanathan, Akshay Krishna
    Ho, Mon-Shu
    Sampson, Jack
    Chang, Meng-Fan
    Narayanan, Vijaykrishnan
    2019 SYMPOSIUM ON VLSI TECHNOLOGY, 2019, : T32 - T33