xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

被引:0
|
作者
Rutishauser, Georg [1 ]
Mihali, Joan [2 ]
Scherer, Moritz [1 ]
Benini, Luca [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Dept Informat Technol & Elektrotech, Zurich, Switzerland
[2] Univ Bologna, Dipartimento Ingn Energia Elettr & Informaz, Bologna, Italy
关键词
MULTIPLICATION; ACCELERATOR;
D O I
10.1109/ASAP61560.2024.00049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficiency potential, which has hindered widespread adoption. To address this, we present xTern, a lightweight extension of the RISC-V instruction set architecture (ISA) targeted at accelerating TNN inference on general-purpose cores. To complement the ISA extension, we developed a set of optimized kernels leveraging xTern, achieving 67% higher throughput than their 2-bit equivalents. Power consumption is only marginally increased by 5.2 %, resulting in an energy efficiency improvement by 57.1 %. We demonstrate that the proposed xTern extension, integrated into an octa-core compute cluster, incurs a minimal silicon area overhead of 0.9% with no impact on timing. In end-to-end benchmarks, we demonstrate that xTern enables the deployment of TNNs achieving up to 1.6 percentage points higher CIFAR-10 classification accuracy than 2-bit networks at equal inference latency. Our results show that xTern enables RISCV-based ultra-low-power edge AI platforms to benefit from the efficiency potential of TNNs.
引用
收藏
页码:206 / 213
页数:8
相关论文
共 50 条
  • [21] XpulpNN: Enabling Energy Efficient and Flexible Inference of Quantized Neural Networks on RISC-V based IoT End Nodes
    Garofalo, Angelo
    Tagliavini, Giuseppe
    Conti, Francesco
    Benini, Luca
    Rossi, Davide
    2021 IEEE 28TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH 2021), 2021, : 53 - 53
  • [22] Real-Time Resource Allocation in Passive Optical Network for Energy-Efficient Inference at GPU-Based Network Edge
    Nakayama, Yu
    Nguyen, Anh Hoang Ngoc
    Hara-Azumi, Yuko
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (18): : 17348 - 17358
  • [23] An energy-efficient crypto-extension design for RISC-V
    Wang, Weizhen
    Han, Jun
    Cheng, Xu
    Zeng, Xiaoyang
    MICROELECTRONICS JOURNAL, 2021, 115
  • [24] Competitive Hyperparameter Balancing on Spiking Neural Network for a Fast, Accurate and Energy-Efficient Inference
    Kim, Jeongho
    Kim, Dae-Shik
    ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 44 - 53
  • [25] Energy-Efficient Inference on the Edge Exploiting TinyML Capabilities for UAVs
    Raza, Wamiq
    Osman, Anas
    Ferrini, Francesco
    Natale, Francesco De
    DRONES, 2021, 5 (04)
  • [26] An Accurate, Error-Tolerant, and Energy-Efficient Neural Network Inference Engine Based on SONOS Analog Memory
    Xiao, T. Patrick
    Feinberg, Ben
    Bennett, Christopher H.
    Agrawal, Vineet
    Saxena, Prashant
    Prabhakar, Venkatraman
    Ramkumar, Krishnaswamy
    Medu, Harsha
    Raghavan, Vijay
    Chettuvetty, Ramesh
    Agarwal, Sapan
    Marinella, Matthew J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) : 1480 - 1493
  • [27] Energy-Efficient Virtual Network Embedding Algorithm Based on Hopfield Neural Network
    He, Mengyang
    Zhuang, Lei
    Yang, Sijin
    Zhang, Jianhui
    Meng, Huiping
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [28] An energy-efficient ternary interconnection link for asynchronous systems
    Philippe, Jean-Marc
    Kinvi-Boh, Ekue
    Pillement, Sebastien
    Sentieys, Olivier
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 1011 - +
  • [29] Towards energy-efficient neural network calculations
    Noskova, E. S.
    Zakharov, I. E.
    Shkandybin, Y. N.
    Rykovanov, S. G.
    COMPUTER OPTICS, 2022, 46 (01) : 160 - 166
  • [30] Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
    Rizvi, Shahriyar Masud
    Ab Rahman, Ab Al-Hadi
    Sheikh, Usman Ullah
    Fuad, Kazi Ahmed Asif
    Shehzad, Hafiz Muhammad Faisal
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4499 - 4523