Rotated Binary Neural Network

被引：0

作者：

Lin, Mingbao ^{[1
]}

Ji, Rongrong ^{[1
,2
,3
]}

Xu, Zihan ^{[1
]}

Zhang, Baochang ^{[4
]}

Wang, Yan ^{[5
]}

Wu, Yongjian ^{[6
]}

Huang, Feiyue ^{[6
]}

Lin, Chia-Wen ^{[7
]}

机构：

[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Media Analyt & Comp Lab, Xiamen, Peoples R China

[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China

[3] Peng Cheng Lab, Shenzhen, Peoples R China

[4] Beihang Univ, Beijing, Peoples R China

[5] Pinterest, San Francisco, CA USA

[6] Tencent Youtu Lab, Shanghai, Peoples R China

[7] Natl Tsing Hua Univ, Hsinchu, Taiwan

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.

引用

页数：12

共 50 条

[41] Deep Learning Binary Neural Network on an FPGA
Zhou, Yuteng
Redkar, Shrutika
Huang, Xinming
2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 281 - 284
[42] A fuzzy binary neural network for interpretable classifications
Meyer, Robert
O'Keefe, Simon
NEUROCOMPUTING, 2013, 121 : 401 - 415
[43] Modulated Binary Clique Convolutional Neural Network
Xia, Jinpeng
Wu, Jiasong
Wu, Fuzhi
Kong, Youyong
Zhang, Pinzheng
Senhadji, Lotfi
Shu, Huazhong
2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 252 - 257
[44] LINEAR CLASSIFICATION WITH BINARY CONNECTION NEURAL NETWORK
DECHAMBOST, E
NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 733 - 741
[45] Quantum Based Learning with Binary Neural Network
Patel, Om Prakash
Tiwari, Aruna
COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 473 - 482
[46] Lipschitz Continuity Retained Binary Neural Network
Shang, Yuzhang
Xu, Dan
Duan, Bin
Zong, Ziliang
Nie, Liqiang
Yan, Yan
COMPUTER VISION, ECCV 2022, PT XI, 2022, 13671 : 603 - 619
[47] Highly parallelized memristive binary neural network
Chen, Jiadong
Wen, Shiping
Shi, Kaibo
Yang, Yin
NEURAL NETWORKS, 2021, 144 : 565 - 572
[48] Adaptive Binarization Method for Binary Neural Network
Liu, Zhongwei
Zhang, Hesheng
Su, Zhenhua
Zhu, Xiaojin
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8123 - 8127
[49] Learning to Hash with Binary Deep Neural Network
Thanh-Toan Do
Anh-Dzung Doan
Cheung, Ngai-Man
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 219 - 234
[50] Spectral neural network potentials for binary alloys
Zagaceta, David
Yanxon, Howard
Zhu, Qiang
JOURNAL OF APPLIED PHYSICS, 2020, 128 (04)

← 1 2 3 4 5 →