Rotated Binary Neural Network

被引:0
|
作者
Lin, Mingbao [1 ]
Ji, Rongrong [1 ,2 ,3 ]
Xu, Zihan [1 ]
Zhang, Baochang [4 ]
Wang, Yan [5 ]
Wu, Yongjian [6 ]
Huang, Feiyue [6 ]
Lin, Chia-Wen [7 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Media Analyt & Comp Lab, Xiamen, Peoples R China
[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Beihang Univ, Beijing, Peoples R China
[5] Pinterest, San Francisco, CA USA
[6] Tencent Youtu Lab, Shanghai, Peoples R China
[7] Natl Tsing Hua Univ, Hsinchu, Taiwan
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] An Artificial Deep Neural Network for the Binary Classification of Network Traffic
    Abdullah, Shubair A.
    Al-Ashoor, Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 402 - 408
  • [32] Controlling information capacity of binary neural network
    Ignatov, Dmitry
    Ignatov, Andrey
    PATTERN RECOGNITION LETTERS, 2020, 138 : 276 - 281
  • [33] Binary Neural Network for Video Action Recognition
    Han, Hongfeng
    Lu, Zhiwu
    Wen, Ji-Rong
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 95 - 106
  • [34] A morphological neural network for binary classification problems
    Araujo, Ricardo de A.
    Oliveira, Adriano L. I.
    Meira, Silvio
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 12 - 28
  • [35] Towards Accurate Binary Convolutional Neural Network
    Lin, Xiaofan
    Zhao, Cong
    Pan, Wei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [36] Binary Complex Neural Network Acceleration on FPGA
    Peng, Hongwu
    Zhou, Shanglin
    Weitze, Scott
    Li, Jiaxin
    Islam, Sahidul
    Geng, Tong
    Li, Ang
    Zhang, Wei
    Song, Minghu
    Xie, Mimi
    Liu, Hang
    Ding, Caiwen
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 85 - 92
  • [37] Binary Neural Network for Multispectral Image Classification
    Jing, Weipeng
    Zhang, Xu
    Wang, Jian
    Di, Donglin
    Chen, Guangsheng
    Song, Houbing
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [38] Quantum Inspired Binary Neural Network Algorithm
    Patel, Om Prakash
    Tiwari, Aruna
    2014 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT), 2014, : 270 - 274
  • [39] Bump formation in a binary attractor neural network
    Koroutchev, K
    Korutcheva, E
    PHYSICAL REVIEW E, 2006, 73 (02):
  • [40] Enabling Binary Neural Network Training on the Edge
    Wang, Erwei
    Davis, James J.
    Moro, Daniele
    Zielinski, Piotr
    Lim, Jia Jie
    Coelho, Claudionor
    Chatterjee, Satrajit
    Cheung, Peter Y. K.
    Constantinides, George A.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (06)