Rotated Binary Neural Network

被引:0
|
作者
Lin, Mingbao [1 ]
Ji, Rongrong [1 ,2 ,3 ]
Xu, Zihan [1 ]
Zhang, Baochang [4 ]
Wang, Yan [5 ]
Wu, Yongjian [6 ]
Huang, Feiyue [6 ]
Lin, Chia-Wen [7 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Media Analyt & Comp Lab, Xiamen, Peoples R China
[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Beihang Univ, Beijing, Peoples R China
[5] Pinterest, San Francisco, CA USA
[6] Tencent Youtu Lab, Shanghai, Peoples R China
[7] Natl Tsing Hua Univ, Hsinchu, Taiwan
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Deep Learning Binary Neural Network on an FPGA
    Zhou, Yuteng
    Redkar, Shrutika
    Huang, Xinming
    2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 281 - 284
  • [42] A fuzzy binary neural network for interpretable classifications
    Meyer, Robert
    O'Keefe, Simon
    NEUROCOMPUTING, 2013, 121 : 401 - 415
  • [43] Modulated Binary Clique Convolutional Neural Network
    Xia, Jinpeng
    Wu, Jiasong
    Wu, Fuzhi
    Kong, Youyong
    Zhang, Pinzheng
    Senhadji, Lotfi
    Shu, Huazhong
    2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 252 - 257
  • [44] LINEAR CLASSIFICATION WITH BINARY CONNECTION NEURAL NETWORK
    DECHAMBOST, E
    NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 733 - 741
  • [45] Quantum Based Learning with Binary Neural Network
    Patel, Om Prakash
    Tiwari, Aruna
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 473 - 482
  • [46] Lipschitz Continuity Retained Binary Neural Network
    Shang, Yuzhang
    Xu, Dan
    Duan, Bin
    Zong, Ziliang
    Nie, Liqiang
    Yan, Yan
    COMPUTER VISION, ECCV 2022, PT XI, 2022, 13671 : 603 - 619
  • [47] Highly parallelized memristive binary neural network
    Chen, Jiadong
    Wen, Shiping
    Shi, Kaibo
    Yang, Yin
    NEURAL NETWORKS, 2021, 144 : 565 - 572
  • [48] Adaptive Binarization Method for Binary Neural Network
    Liu, Zhongwei
    Zhang, Hesheng
    Su, Zhenhua
    Zhu, Xiaojin
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8123 - 8127
  • [49] Learning to Hash with Binary Deep Neural Network
    Thanh-Toan Do
    Anh-Dzung Doan
    Cheung, Ngai-Man
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 219 - 234
  • [50] Spectral neural network potentials for binary alloys
    Zagaceta, David
    Yanxon, Howard
    Zhu, Qiang
    JOURNAL OF APPLIED PHYSICS, 2020, 128 (04)