An In-Memory-Computing Binary Neural Network Architecture With In-Memory Batch Normalization

被引:0
|
作者
Rege, Prathamesh Prashant [1 ]
Yin, Ming [2 ]
Parihar, Sanjay [3 ]
Versaggi, Joseph [2 ]
Nemawarkar, Shashank [3 ]
机构
[1] Northeastern Univ, Boston, MA 80305 USA
[2] GLOBALFOUNDRIES, Malta, NY 12020 USA
[3] GLOBALFOUNDRIES, Austin, TX 78735 USA
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Neural networks; Batch normalization; Convolutional neural networks; Training; Data models; Voltage control; In-memory computing; SRAM chips; binary neural network; edge device; in-memory computing; process variation; SRAM;
D O I
10.1109/ACCESS.2024.3444481
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an in-memory computing architecture that combines full-precision computation for the first and last layers of a neural network while employing binary weights and input activations for the intermediate layers. This unique approach presents an efficient and effective solution for optimizing neural-network computations, reducing complexity, and enhancing energy efficiency. Notably, multiple architecture-level optimization methods are developed to ensure the binary operations thereby eliminating the need for intricate "digital logic" components external to the memory units. One of the key contributions of this study is in-memory batch normalization, which is implemented to provide good accuracy for CIFAR10 classification applications. Despite the inherent challenges posed by the process variations, the proposed design demonstrated an accuracy of 78%. Furthermore, the SRAM layer in the architecture showed an energy efficiency of 1086 TOPS/W and throughput of 23 TOPS, all packed efficiently within an area of 60 TOPS/mm2. This novel in-memory computing architecture offers a promising solution for next-generation efficient and high-performance deep learning applications.
引用
收藏
页码:190889 / 190896
页数:8
相关论文
共 50 条
  • [1] In-Memory Batch-Normalization for Resistive Memory based Binary Neural Network Hardware
    Kim, Hyungjun
    Kim, Yulhwa
    Kim, Jae-Joon
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 645 - 650
  • [2] A Skyrmion Racetrack Memory based Computing In-memory Architecture for Binary Neural Convolutional Network
    Pan, Yu
    Ouyang, Peng
    Zhao, Yinglin
    Yin, Shouyi
    Zhang, Youguang
    Wei, Shaojun
    Zhao, Weisheng
    GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 271 - 274
  • [3] A Unified Memory Network Architecture for In-Memory Computing in Commodity Servers
    Zhan, Jia
    Akgun, Itir
    Zhao, Jishen
    Davis, Al
    Faraboschi, Paolo
    Wang, Yuangang
    Xie, Yuan
    2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
  • [4] iMARS: An In-Memory-Computing Architecture for Recommendation Systems
    Li, Mengyuan
    Laguna, Ann Franchesca
    Reis, Dayane
    Yin, Xunzhao
    Niemier, Michael
    Hu, X. Sharon
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 463 - 468
  • [5] An In-Memory Computing SRAM Macro for Memory-Augmented Neural Network
    Kim, Sunghoon
    Lee, Wonjae
    Kim, Sundo
    Park, Sungjin
    Jeon, Dongsuk
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1687 - 1691
  • [6] MOL-Based In-Memory Computing of Binary Neural Networks
    Ali, Khaled Alhaj
    Baghdadi, Amer
    Dupraz, Elsa
    Leonardon, Mathieu
    Rizk, Mostafa
    Diguet, Jean-Philippe
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (07) : 869 - 880
  • [7] SIMBA: A Skyrmionic In-Memory Binary Neural Network Accelerator
    Miriyala, Venkata Pavan Kumar
    Vishwanath, Kale Rahul
    Fong, Xuanyao
    IEEE TRANSACTIONS ON MAGNETICS, 2020, 56 (11)
  • [8] Time-Domain-Based Non-volatile In-Memory Computing Architecture Using FeFETs for Binary Neural Network
    Sharma, Aditya
    Dixit, Vatsal
    Kushwaha, Dinesh
    Chauhan, Nitanshu
    Saxena, Vishal Kumar
    Dasgupta, Sudeb
    Bulusu, Anand
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [9] NNPIM: A Processing In-Memory Architecture for Neural Network Acceleration
    Gupta, Saransh
    Imani, Mohsen
    Kaur, Harveen
    Rosing, Tajana Simunic
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (09) : 1325 - 1337
  • [10] In-Memory Computing Architecture for a Convolutional Neural Network Based on Spin Orbit Torque MRAM
    Huang, Jun-Ying
    Syu, Jing-Lin
    Tsou, Yao-Tung
    Kuo, Sy-Yen
    Chang, Ching-Ray
    ELECTRONICS, 2022, 11 (08)