xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

被引：0

作者：

Rutishauser, Georg ^{[1
]}

Mihali, Joan ^{[2
]}

Scherer, Moritz ^{[1
]}

Benini, Luca ^{[1
,2
]}

机构：

[1] Swiss Fed Inst Technol, Dept Informat Technol & Elektrotech, Zurich, Switzerland

[2] Univ Bologna, Dipartimento Ingn Energia Elettr & Informaz, Bologna, Italy

来源：

2024 IEEE 35TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, ASAP 2024 | 2024年

关键词：

MULTIPLICATION; ACCELERATOR;

D O I：

10.1109/ASAP61560.2024.00049

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficiency potential, which has hindered widespread adoption. To address this, we present xTern, a lightweight extension of the RISC-V instruction set architecture (ISA) targeted at accelerating TNN inference on general-purpose cores. To complement the ISA extension, we developed a set of optimized kernels leveraging xTern, achieving 67% higher throughput than their 2-bit equivalents. Power consumption is only marginally increased by 5.2 %, resulting in an energy efficiency improvement by 57.1 %. We demonstrate that the proposed xTern extension, integrated into an octa-core compute cluster, incurs a minimal silicon area overhead of 0.9% with no impact on timing. In end-to-end benchmarks, we demonstrate that xTern enables the deployment of TNNs achieving up to 1.6 percentage points higher CIFAR-10 classification accuracy than 2-bit networks at equal inference latency. Our results show that xTern enables RISCV-based ultra-low-power edge AI platforms to benefit from the efficiency potential of TNNs.

引用

页码：206 / 213

页数：8

共 50 条

[31] Utilizing Dual-Port FeFETs for Energy-Efficient Binary Neural Network Inference Accelerators
Rafiq, Musaib
Chatterjee, Swetaki
Kumar, Shubham
Chauhan, Yogesh Singh
Sahay, Shubham
IEEE TRANSACTIONS ON ELECTRON DEVICES, 2024, 71 (07) : 4381 - 4388
[32] SparkXD: A Framework for Resilient and Energy-Efficient Spiking Neural Network Inference using Approximate DRAM
Putra, Rachmad Vidya Wicaksana
Hanif, Muhammad Abdullah
Shafique, Muhammad
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 379 - 384
[33] Energy-Efficient Mapping for a Network of DNN Models at the Edge
Ghasemi, Mehdi
Heidari, Soroush
Kim, Young Geun
Lamb, Aaron
Wu, Carole-Jean
Vrudhula, Sarma
2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 25 - 30
[34] Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
Shahriyar Masud Rizvi
Ab Al-Hadi Ab Rahman
Usman Ullah Sheikh
Kazi Ahmed Asif Fuad
Hafiz Muhammad Faisal Shehzad
Applied Intelligence, 2023, 53 : 4499 - 4523
[35] SONIC: A Sparse Neural Network Inference Accelerator with Silicon Photonics for Energy-Efficient Deep Learning
Sunny, Febin
Nikdast, Mandi
Pasricha, Sudeep
27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 214 - 219
[36] An Energy-Efficient FPGA-based Convolutional Neural Network Implementation
Irmak, Hasan
Alachiotis, Nikolaos
Ziener, Daniel
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[37] A Proposal for Energy-Efficient Cellular Neural Network Based on Spintronic Devices
Pan, Chenyun
Naeemi, Azad
IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2016, 15 (05) : 820 - 827
[38] An Energy-Efficient Accelerator for Rain Removal Based on Convolutional Neural Network
Rao, Lei
Zhang, Bin
Zhao, Jizhong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (08) : 2957 - 2961
[39] Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference
Yang, Kai
Shi, Yuanming
Yu, Wei
Ding, Zhi
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) : 9456 - 9470
[40] RISC-V-Based Evaluation and Strategy Exploration of MRAM Triple-Level Hybrid Cache Systems
Han, Shaopu
Jiang, Yanfeng
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (07) : 980 - 992

← 1 2 3 4 5 →