Exact Neural Networks from Inexact Multipliers via Fibonacci Weight Encoding

被引:7
|
作者
Simon, William Andrew [1 ]
Ray, Valerian
Levisse, Alexandre [1 ]
Ansaloni, Giovanni [1 ]
Zapater, Marina [1 ,2 ]
Atienza, David [1 ]
机构
[1] Swiss Fed Inst Technol Lausanne EPFL, Embedded Syst Lab ESL, Lausanne, Switzerland
[2] Univ Appl Sci Western Switzerland HEIG VD HES SO, Delemont, Switzerland
关键词
neural networks; quantization; accelerators; approximate computing;
D O I
10.1109/DAC18074.2021.9586245
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge devices must support computationally demanding algorithms, such as neural networks, within tight area/energy budgets. While approximate computing may alleviate these constraints, limiting induced errors remains an open challenge. In this paper, we propose a hardware/software co-design solution via an inexact multiplier, reducing area/power-delay-product requirements by 73/43%, respectively, while still computing exact results when one input is a Fibonacci encoded value. We introduce a retraining strategy to quantize neural network weights to Fibonacci encoded values, ensuring exact computation during inference. We benchmark our strategy on Squeezenet 1.0, DenseNet-121, and ResNet-18, measuring accuracy degradations of only 0.4/1.1/1.7%.
引用
收藏
页码:805 / 810
页数:6
相关论文
共 50 条
  • [21] From Clustering to Cluster Explanations via Neural Networks
    Kauffmann, Jacob
    Esders, Malte
    Ruff, Lukas
    Montavon, Gregoire
    Samek, Wojciech
    Mueller, Klaus-Robert
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1926 - 1940
  • [22] Affective Concept-Based Encoding of Patient Narratives via Sentic Computing and Neural Networks
    Grissette, Hanane
    Nfaoui, El Habib
    COGNITIVE COMPUTATION, 2022, 14 (01) : 274 - 299
  • [23] Efficient Global Robustness Certification of Neural Networks via Interleaving Twin-Network Encoding
    Wang, Zhilu
    Huang, Chao
    Zhu, Qi
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 1087 - 1092
  • [24] Affective Concept-Based Encoding of Patient Narratives via Sentic Computing and Neural Networks
    Hanane Grissette
    El Habib Nfaoui
    Cognitive Computation, 2022, 14 : 274 - 299
  • [25] Convolving over Time via Recurrent Connections for Sequential Weight Sharing in Neural Networks
    Allred, Jason M.
    Roy, Kaushik
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4444 - 4450
  • [26] Weak and strong convergence analysis of Elman neural networks via weight decay regularization
    Zhou, Li
    Fan, Qinwei
    Huang, Xiaodi
    Liu, Yan
    OPTIMIZATION, 2023, 72 (09) : 2287 - 2309
  • [27] Detection and classification of MSTAR objects via morphological shared-weight neural networks
    Theera-Umpon, N
    Khabou, MA
    Gader, PD
    Keller, JM
    Shi, HC
    Li, HZ
    ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY V, 1998, 3370 : 530 - 540
  • [28] AutoShuffleNet: Learning Permutation Matrices via an Exact Lipschitz Continuous Penalty in Deep Convolutional Neural Networks
    Lyu, Jiancheng
    Zhang, Shuai
    Qi, Yingyong
    Xin, Jack
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 608 - 616
  • [29] Deep Convolutional Neural Networks for Fish Weight Prediction from Images
    Yang, Yunhan
    Xue, Bing
    Jesson, Linley
    Wylie, Matthew
    Zhang, Mengjie
    Wellenreuther, Maren
    PROCEEDINGS OF THE 2021 36TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2021,
  • [30] Time Series Forecasting via Derivative Spike Encoding and Bespoke Loss Functions for Spiking Neural Networks
    Manna, Davide Liberato
    Vicente-Sola, Alex
    Kirkland, Paul
    Bihl, Trevor Joseph
    Di Caterina, Gaetano
    COMPUTERS, 2024, 13 (08)