Rotated Binary Neural Network

被引：0

作者：

Lin, Mingbao ^{[1
]}

Ji, Rongrong ^{[1
,2
,3
]}

Xu, Zihan ^{[1
]}

Zhang, Baochang ^{[4
]}

Wang, Yan ^{[5
]}

Wu, Yongjian ^{[6
]}

Huang, Feiyue ^{[6
]}

Lin, Chia-Wen ^{[7
]}

机构：

[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Media Analyt & Comp Lab, Xiamen, Peoples R China

[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China

[3] Peng Cheng Lab, Shenzhen, Peoples R China

[4] Beihang Univ, Beijing, Peoples R China

[5] Pinterest, San Francisco, CA USA

[6] Tencent Youtu Lab, Shanghai, Peoples R China

[7] Natl Tsing Hua Univ, Hsinchu, Taiwan

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.

引用

页数：12

共 50 条

[31] An Artificial Deep Neural Network for the Binary Classification of Network Traffic
Abdullah, Shubair A.
Al-Ashoor, Ahmed
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 402 - 408
[32] Controlling information capacity of binary neural network
Ignatov, Dmitry
Ignatov, Andrey
PATTERN RECOGNITION LETTERS, 2020, 138 : 276 - 281
[33] Binary Neural Network for Video Action Recognition
Han, Hongfeng
Lu, Zhiwu
Wen, Ji-Rong
MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 95 - 106
[34] A morphological neural network for binary classification problems
Araujo, Ricardo de A.
Oliveira, Adriano L. I.
Meira, Silvio
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 12 - 28
[35] Towards Accurate Binary Convolutional Neural Network
Lin, Xiaofan
Zhao, Cong
Pan, Wei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[36] Binary Complex Neural Network Acceleration on FPGA
Peng, Hongwu
Zhou, Shanglin
Weitze, Scott
Li, Jiaxin
Islam, Sahidul
Geng, Tong
Li, Ang
Zhang, Wei
Song, Minghu
Xie, Mimi
Liu, Hang
Ding, Caiwen
2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 85 - 92
[37] Binary Neural Network for Multispectral Image Classification
Jing, Weipeng
Zhang, Xu
Wang, Jian
Di, Donglin
Chen, Guangsheng
Song, Houbing
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[38] Quantum Inspired Binary Neural Network Algorithm
Patel, Om Prakash
Tiwari, Aruna
2014 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT), 2014, : 270 - 274
[39] Bump formation in a binary attractor neural network
Koroutchev, K
Korutcheva, E
PHYSICAL REVIEW E, 2006, 73 (02):
[40] Enabling Binary Neural Network Training on the Edge
Wang, Erwei
Davis, James J.
Moro, Daniele
Zielinski, Piotr
Lim, Jia Jie
Coelho, Claudionor
Chatterjee, Satrajit
Cheung, Peter Y. K.
Constantinides, George A.
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (06)

← 1 2 3 4 5 →