Rotated Binary Neural Network

被引:0
|
作者
Lin, Mingbao [1 ]
Ji, Rongrong [1 ,2 ,3 ]
Xu, Zihan [1 ]
Zhang, Baochang [4 ]
Wang, Yan [5 ]
Wu, Yongjian [6 ]
Huang, Feiyue [6 ]
Lin, Chia-Wen [7 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Media Analyt & Comp Lab, Xiamen, Peoples R China
[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Beihang Univ, Beijing, Peoples R China
[5] Pinterest, San Francisco, CA USA
[6] Tencent Youtu Lab, Shanghai, Peoples R China
[7] Natl Tsing Hua Univ, Hsinchu, Taiwan
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A comprehensive review of Binary Neural Network
    Yuan, Chunyu
    Agaian, Sos S.
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (11) : 12949 - 13013
  • [22] Dual Path Binary Neural Network
    Chen, Pei-Yin
    Tang, Chi-Huan
    Chen, Wei-Ting
    Yu, Hui-Liang
    2019 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2019, : 251 - 252
  • [23] Artificial Neural Network for Binary and Multiclassification of Network Attacks
    Omarov, Bauyrzhan
    Kostangeldinova, Alma
    Tukenova, Lyailya
    Mambetaliyeva, Gulsara
    Madiyarova, Almira
    Amirgaliyev, Beibut
    Kulambayev, Bakhytzhan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 729 - 736
  • [24] On the Aperiodic Autocorrelations of Rotated Binary Sequences
    Dimitrov, Miroslav
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (05) : 1427 - 1430
  • [25] A combined rotated general regression neural network method for river flow forecasting
    Yin, Sun
    Tang, Deshan
    Jin, Xin
    Chen, Weiwei
    Pu, Nannan
    HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2016, 61 (04): : 669 - 682
  • [26] Recurrent neural network decoding of rotated surface codes based on distributed strategy
    李帆
    李熬庆
    甘启迪
    马鸿洋
    Chinese Physics B, 2024, 33 (04) : 117 - 125
  • [27] Rotated Face Recognition by Manifold Learning with Auto-associative Neural Network
    Ito, Mizuki
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    2015 21ST KOREA-JAPAN JOINT WORKSHOP ON FRONTIERS OF COMPUTER VISION, 2015,
  • [28] Recurrent neural network decoding of rotated surface codes based on distributed strategy
    Li, Fan
    Li, Ao-Qing
    Gan, Qi-Di
    Ma, Hong-Yang
    CHINESE PHYSICS B, 2024, 33 (04)
  • [29] A rotated kernel Probabilistic Neural Network (RKPNN) for multi-class classification
    Galleske, I
    Castellanos, J
    COMPUTATIONAL METHODS IN NEURAL MODELING, PT 1, 2003, 2686 : 152 - 157
  • [30] An artificial deep neural network for the binary classification of network traffic
    Abdullah S.A.
    Al-Ashoor A.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (01): : 402 - 408