xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

被引:0
|
作者
Rutishauser, Georg [1 ]
Mihali, Joan [2 ]
Scherer, Moritz [1 ]
Benini, Luca [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Dept Informat Technol & Elektrotech, Zurich, Switzerland
[2] Univ Bologna, Dipartimento Ingn Energia Elettr & Informaz, Bologna, Italy
关键词
MULTIPLICATION; ACCELERATOR;
D O I
10.1109/ASAP61560.2024.00049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficiency potential, which has hindered widespread adoption. To address this, we present xTern, a lightweight extension of the RISC-V instruction set architecture (ISA) targeted at accelerating TNN inference on general-purpose cores. To complement the ISA extension, we developed a set of optimized kernels leveraging xTern, achieving 67% higher throughput than their 2-bit equivalents. Power consumption is only marginally increased by 5.2 %, resulting in an energy efficiency improvement by 57.1 %. We demonstrate that the proposed xTern extension, integrated into an octa-core compute cluster, incurs a minimal silicon area overhead of 0.9% with no impact on timing. In end-to-end benchmarks, we demonstrate that xTern enables the deployment of TNNs achieving up to 1.6 percentage points higher CIFAR-10 classification accuracy than 2-bit networks at equal inference latency. Our results show that xTern enables RISCV-based ultra-low-power edge AI platforms to benefit from the efficiency potential of TNNs.
引用
收藏
页码:206 / 213
页数:8
相关论文
共 50 条
  • [31] Utilizing Dual-Port FeFETs for Energy-Efficient Binary Neural Network Inference Accelerators
    Rafiq, Musaib
    Chatterjee, Swetaki
    Kumar, Shubham
    Chauhan, Yogesh Singh
    Sahay, Shubham
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2024, 71 (07) : 4381 - 4388
  • [32] SparkXD: A Framework for Resilient and Energy-Efficient Spiking Neural Network Inference using Approximate DRAM
    Putra, Rachmad Vidya Wicaksana
    Hanif, Muhammad Abdullah
    Shafique, Muhammad
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 379 - 384
  • [33] Energy-Efficient Mapping for a Network of DNN Models at the Edge
    Ghasemi, Mehdi
    Heidari, Soroush
    Kim, Young Geun
    Lamb, Aaron
    Wu, Carole-Jean
    Vrudhula, Sarma
    2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 25 - 30
  • [34] Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
    Shahriyar Masud Rizvi
    Ab Al-Hadi Ab Rahman
    Usman Ullah Sheikh
    Kazi Ahmed Asif Fuad
    Hafiz Muhammad Faisal Shehzad
    Applied Intelligence, 2023, 53 : 4499 - 4523
  • [35] SONIC: A Sparse Neural Network Inference Accelerator with Silicon Photonics for Energy-Efficient Deep Learning
    Sunny, Febin
    Nikdast, Mandi
    Pasricha, Sudeep
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 214 - 219
  • [36] An Energy-Efficient FPGA-based Convolutional Neural Network Implementation
    Irmak, Hasan
    Alachiotis, Nikolaos
    Ziener, Daniel
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [37] A Proposal for Energy-Efficient Cellular Neural Network Based on Spintronic Devices
    Pan, Chenyun
    Naeemi, Azad
    IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2016, 15 (05) : 820 - 827
  • [38] An Energy-Efficient Accelerator for Rain Removal Based on Convolutional Neural Network
    Rao, Lei
    Zhang, Bin
    Zhao, Jizhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (08) : 2957 - 2961
  • [39] Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference
    Yang, Kai
    Shi, Yuanming
    Yu, Wei
    Ding, Zhi
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) : 9456 - 9470
  • [40] RISC-V-Based Evaluation and Strategy Exploration of MRAM Triple-Level Hybrid Cache Systems
    Han, Shaopu
    Jiang, Yanfeng
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (07) : 980 - 992