Exact Neural Networks from Inexact Multipliers via Fibonacci Weight Encoding

被引:7
|
作者
Simon, William Andrew [1 ]
Ray, Valerian
Levisse, Alexandre [1 ]
Ansaloni, Giovanni [1 ]
Zapater, Marina [1 ,2 ]
Atienza, David [1 ]
机构
[1] Swiss Fed Inst Technol Lausanne EPFL, Embedded Syst Lab ESL, Lausanne, Switzerland
[2] Univ Appl Sci Western Switzerland HEIG VD HES SO, Delemont, Switzerland
关键词
neural networks; quantization; accelerators; approximate computing;
D O I
10.1109/DAC18074.2021.9586245
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge devices must support computationally demanding algorithms, such as neural networks, within tight area/energy budgets. While approximate computing may alleviate these constraints, limiting induced errors remains an open challenge. In this paper, we propose a hardware/software co-design solution via an inexact multiplier, reducing area/power-delay-product requirements by 73/43%, respectively, while still computing exact results when one input is a Fibonacci encoded value. We introduce a retraining strategy to quantize neural network weights to Fibonacci encoded values, ensuring exact computation during inference. We benchmark our strategy on Squeezenet 1.0, DenseNet-121, and ResNet-18, measuring accuracy degradations of only 0.4/1.1/1.7%.
引用
收藏
页码:805 / 810
页数:6
相关论文
共 50 条
  • [41] Efficient HD Map encoding via disentangled style-structure representation using graph neural networks
    Beche, Radu
    Nedevschi, Sergiu
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, ICCP, 2022, : 119 - 128
  • [42] Damage Detection from Aerial Images via Convolutional Neural Networks
    Fujita, Aito
    Sakurada, Ken
    Imaizumi, Tomoyuki
    Ito, Riho
    Hikosaka, Shuhei
    Nakamura, Ryosuke
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 5 - 8
  • [43] Rule extraction from neural networks via decision tree induction
    Sato, M
    Tsukimoto, H
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1870 - 1875
  • [44] A DEXiRE for Extracting Propositional Rules from Neural Networks via Binarization
    Contreras, Victor
    Marini, Niccolo
    Fanda, Lora
    Manzo, Gaetano
    Mualla, Yazan
    Calbimonte, Jean-Paul
    Schumacher, Michael
    Calvaresi, Davide
    ELECTRONICS, 2022, 11 (24)
  • [45] Imaging from Temporal Data via Spiking Convolutional Neural Networks
    Kirkland, Paul
    Kapitany, Valentin
    Lyons, Ashley
    Soraghan, John
    Turpin, Alex
    Faccio, Daniele
    Di Caterina, Gaetano
    EMERGING IMAGING AND SENSING TECHNOLOGIES FOR SECURITY AND DEFENCE V; AND ADVANCED MANUFACTURING TECHNOLOGIES FOR MICRO- AND NANOSYSTEMS IN SECURITY AND DEFENCE III, 2020, 11540
  • [46] Retrieving real world clothing images via multi-weight deep convolutional neural networks
    Ruifan Li
    Fangxiang Feng
    Ibrar Ahmad
    Xiaojie Wang
    Cluster Computing, 2019, 22 : 7123 - 7134
  • [47] Retrieving real world clothing images via multi-weight deep convolutional neural networks
    Li, Ruifan
    Feng, Fangxiang
    Ahmad, Ibrar
    Wang, Xiaojie
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S7123 - S7134
  • [48] Tracking Control of Unknown and Constrained Nonlinear Systems via Neural Networks With Implicit Weight and Activation Learning
    Cui, Qian
    Song, Yongduan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5427 - 5434
  • [49] Exact representation and efficient approximations of linear model predictive control laws via HardTanh type deep neural networks
    Lupu, Daniela
    Necoara, Ion
    SYSTEMS & CONTROL LETTERS, 2024, 186
  • [50] Evaluation of the unit weight of organic soils from a CPTM using an Artificial Neural Networks
    Straz, G.
    Borowiec, A.
    ARCHIVES OF CIVIL ENGINEERING, 2021, 67 (03) : 259 - 281