xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

被引：0

作者：

Rutishauser, Georg ^{[1
]}

Mihali, Joan ^{[2
]}

Scherer, Moritz ^{[1
]}

Benini, Luca ^{[1
,2
]}

机构：

[1] Swiss Fed Inst Technol, Dept Informat Technol & Elektrotech, Zurich, Switzerland

[2] Univ Bologna, Dipartimento Ingn Energia Elettr & Informaz, Bologna, Italy

来源：

2024 IEEE 35TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, ASAP 2024 | 2024年

关键词：

MULTIPLICATION; ACCELERATOR;

D O I：

10.1109/ASAP61560.2024.00049

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficiency potential, which has hindered widespread adoption. To address this, we present xTern, a lightweight extension of the RISC-V instruction set architecture (ISA) targeted at accelerating TNN inference on general-purpose cores. To complement the ISA extension, we developed a set of optimized kernels leveraging xTern, achieving 67% higher throughput than their 2-bit equivalents. Power consumption is only marginally increased by 5.2 %, resulting in an energy efficiency improvement by 57.1 %. We demonstrate that the proposed xTern extension, integrated into an octa-core compute cluster, incurs a minimal silicon area overhead of 0.9% with no impact on timing. In end-to-end benchmarks, we demonstrate that xTern enables the deployment of TNNs achieving up to 1.6 percentage points higher CIFAR-10 classification accuracy than 2-bit networks at equal inference latency. Our results show that xTern enables RISCV-based ultra-low-power edge AI platforms to benefit from the efficiency potential of TNNs.

引用

页码：206 / 213

页数：8

共 50 条

[21] XpulpNN: Enabling Energy Efficient and Flexible Inference of Quantized Neural Networks on RISC-V based IoT End Nodes
Garofalo, Angelo
Tagliavini, Giuseppe
Conti, Francesco
Benini, Luca
Rossi, Davide
2021 IEEE 28TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH 2021), 2021, : 53 - 53
[22] Real-Time Resource Allocation in Passive Optical Network for Energy-Efficient Inference at GPU-Based Network Edge
Nakayama, Yu
Nguyen, Anh Hoang Ngoc
Hara-Azumi, Yuko
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (18): : 17348 - 17358
[23] An energy-efficient crypto-extension design for RISC-V
Wang, Weizhen
Han, Jun
Cheng, Xu
Zeng, Xiaoyang
MICROELECTRONICS JOURNAL, 2021, 115
[24] Competitive Hyperparameter Balancing on Spiking Neural Network for a Fast, Accurate and Energy-Efficient Inference
Kim, Jeongho
Kim, Dae-Shik
ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 44 - 53
[25] Energy-Efficient Inference on the Edge Exploiting TinyML Capabilities for UAVs
Raza, Wamiq
Osman, Anas
Ferrini, Francesco
Natale, Francesco De
DRONES, 2021, 5 (04)
[26] An Accurate, Error-Tolerant, and Energy-Efficient Neural Network Inference Engine Based on SONOS Analog Memory
Xiao, T. Patrick
Feinberg, Ben
Bennett, Christopher H.
Agrawal, Vineet
Saxena, Prashant
Prabhakar, Venkatraman
Ramkumar, Krishnaswamy
Medu, Harsha
Raghavan, Vijay
Chettuvetty, Ramesh
Agarwal, Sapan
Marinella, Matthew J.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) : 1480 - 1493
[27] Energy-Efficient Virtual Network Embedding Algorithm Based on Hopfield Neural Network
He, Mengyang
Zhuang, Lei
Yang, Sijin
Zhang, Jianhui
Meng, Huiping
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[28] An energy-efficient ternary interconnection link for asynchronous systems
Philippe, Jean-Marc
Kinvi-Boh, Ekue
Pillement, Sebastien
Sentieys, Olivier
2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 1011 - +
[29] Towards energy-efficient neural network calculations
Noskova, E. S.
Zakharov, I. E.
Shkandybin, Y. N.
Rykovanov, S. G.
COMPUTER OPTICS, 2022, 46 (01) : 160 - 166
[30] Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
Rizvi, Shahriyar Masud
Ab Rahman, Ab Al-Hadi
Sheikh, Usman Ullah
Fuad, Kazi Ahmed Asif
Shehzad, Hafiz Muhammad Faisal
APPLIED INTELLIGENCE, 2023, 53 (04) : 4499 - 4523

← 1 2 3 4 5 →