Rotated Binary Neural Network

被引:0
|
作者
Lin, Mingbao [1 ]
Ji, Rongrong [1 ,2 ,3 ]
Xu, Zihan [1 ]
Zhang, Baochang [4 ]
Wang, Yan [5 ]
Wu, Yongjian [6 ]
Huang, Feiyue [6 ]
Lin, Chia-Wen [7 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Media Analyt & Comp Lab, Xiamen, Peoples R China
[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Beihang Univ, Beijing, Peoples R China
[5] Pinterest, San Francisco, CA USA
[6] Tencent Youtu Lab, Shanghai, Peoples R China
[7] Natl Tsing Hua Univ, Hsinchu, Taiwan
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Rotated general regression neural network
    Gholamrezaei, M.
    Ghorbanian, K.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1959 - 1964
  • [2] Diluted binary neural network
    Lin, Yuhan
    Niu, Lingfeng
    Xiao, Yang
    Zhou, Ruizhi
    PATTERN RECOGNITION, 2023, 140
  • [3] BINARY MORPHOLOGICAL NEURAL NETWORK
    Aouad, Theodore
    Talbot, Hugues
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3276 - 3280
  • [4] Hyperbolic Binary Neural Network
    Chen, Jun
    Xiang, Jingyang
    Huang, Tianxin
    Zhao, Xiangrui
    Liu, Yong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] Resilient Binary Neural Network
    Xu, Sheng
    Li, Yanjing
    Ma, Teli
    Lin, Mingbao
    Dong, Hao
    Zhang, Baochang
    Gao, Peng
    Lu, Jinhu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10620 - 10628
  • [6] SAR ATR WITH ROTATED REGION BASED ON CONVOLUTION NEURAL NETWORK
    Long, Yin
    Jiang, Xue
    Liu, Xingzhao
    Zhang, Yuzhe
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1184 - 1187
  • [7] Parameter-optimized rotated general regression neural network model
    Huang, Jin-Jie
    Xia, Jing
    Man, Chun-Tao
    Wang, Song-Tao
    Dianji yu Kongzhi Xuebao/Electric Machines and Control, 2009, 13 (03): : 442 - 447
  • [8] 3-LAYER NEURAL NETWORK MODELS FOR ROTATED PATTERNS RECOGNITION
    SUZAKI, K
    ARAYA, S
    NAKAMURA, R
    IFIP TRANSACTIONS A-COMPUTER SCIENCE AND TECHNOLOGY, 1992, 12 : 667 - 673
  • [9] Binary convolutional neural network: Review
    Ding W.
    Liu C.
    Li Y.
    Zhang B.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (06):
  • [10] Novel stochastic binary neural network
    Zhao, Jie-Yu
    Zhang, Quan-Fang
    Wang, Jia-Cai
    Zidonghua Xuebao/Acta Automatica Sinica, 2002, 28 (05): : 736 - 744