An In-Memory-Computing Binary Neural Network Architecture With In-Memory Batch Normalization

被引：0

作者：

Rege, Prathamesh Prashant ^{[1
]}

Yin, Ming ^{[2
]}

Parihar, Sanjay ^{[3
]}

Versaggi, Joseph ^{[2
]}

Nemawarkar, Shashank ^{[3
]}

机构：

[1] Northeastern Univ, Boston, MA 80305 USA

[2] GLOBALFOUNDRIES, Malta, NY 12020 USA

[3] GLOBALFOUNDRIES, Austin, TX 78735 USA

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Accuracy; Neural networks; Batch normalization; Convolutional neural networks; Training; Data models; Voltage control; In-memory computing; SRAM chips; binary neural network; edge device; in-memory computing; process variation; SRAM;

D O I：

10.1109/ACCESS.2024.3444481

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes an in-memory computing architecture that combines full-precision computation for the first and last layers of a neural network while employing binary weights and input activations for the intermediate layers. This unique approach presents an efficient and effective solution for optimizing neural-network computations, reducing complexity, and enhancing energy efficiency. Notably, multiple architecture-level optimization methods are developed to ensure the binary operations thereby eliminating the need for intricate "digital logic" components external to the memory units. One of the key contributions of this study is in-memory batch normalization, which is implemented to provide good accuracy for CIFAR10 classification applications. Despite the inherent challenges posed by the process variations, the proposed design demonstrated an accuracy of 78%. Furthermore, the SRAM layer in the architecture showed an energy efficiency of 1086 TOPS/W and throughput of 23 TOPS, all packed efficiently within an area of 60 TOPS/mm2. This novel in-memory computing architecture offers a promising solution for next-generation efficient and high-performance deep learning applications.

引用

页码：190889 / 190896

页数：8

共 50 条

[1] In-Memory Batch-Normalization for Resistive Memory based Binary Neural Network Hardware
Kim, Hyungjun
Kim, Yulhwa
Kim, Jae-Joon
24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 645 - 650
[2] A Skyrmion Racetrack Memory based Computing In-memory Architecture for Binary Neural Convolutional Network
Pan, Yu
Ouyang, Peng
Zhao, Yinglin
Yin, Shouyi
Zhang, Youguang
Wei, Shaojun
Zhao, Weisheng
GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 271 - 274
[3] A Unified Memory Network Architecture for In-Memory Computing in Commodity Servers
Zhan, Jia
Akgun, Itir
Zhao, Jishen
Davis, Al
Faraboschi, Paolo
Wang, Yuangang
Xie, Yuan
2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
[4] iMARS: An In-Memory-Computing Architecture for Recommendation Systems
Li, Mengyuan
Laguna, Ann Franchesca
Reis, Dayane
Yin, Xunzhao
Niemier, Michael
Hu, X. Sharon
PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 463 - 468
[5] An In-Memory Computing SRAM Macro for Memory-Augmented Neural Network
Kim, Sunghoon
Lee, Wonjae
Kim, Sundo
Park, Sungjin
Jeon, Dongsuk
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1687 - 1691
[6] MOL-Based In-Memory Computing of Binary Neural Networks
Ali, Khaled Alhaj
Baghdadi, Amer
Dupraz, Elsa
Leonardon, Mathieu
Rizk, Mostafa
Diguet, Jean-Philippe
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (07) : 869 - 880
[7] SIMBA: A Skyrmionic In-Memory Binary Neural Network Accelerator
Miriyala, Venkata Pavan Kumar
Vishwanath, Kale Rahul
Fong, Xuanyao
IEEE TRANSACTIONS ON MAGNETICS, 2020, 56 (11)
[8] Time-Domain-Based Non-volatile In-Memory Computing Architecture Using FeFETs for Binary Neural Network
Sharma, Aditya
Dixit, Vatsal
Kushwaha, Dinesh
Chauhan, Nitanshu
Saxena, Vishal Kumar
Dasgupta, Sudeb
Bulusu, Anand
2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
[9] NNPIM: A Processing In-Memory Architecture for Neural Network Acceleration
Gupta, Saransh
Imani, Mohsen
Kaur, Harveen
Rosing, Tajana Simunic
IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (09) : 1325 - 1337
[10] In-Memory Computing Architecture for a Convolutional Neural Network Based on Spin Orbit Torque MRAM
Huang, Jun-Ying
Syu, Jing-Lin
Tsou, Yao-Tung
Kuo, Sy-Yen
Chang, Ching-Ray
ELECTRONICS, 2022, 11 (08)

← 1 2 3 4 5 →