xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

被引:0
|
作者
Rutishauser, Georg [1 ]
Mihali, Joan [2 ]
Scherer, Moritz [1 ]
Benini, Luca [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Dept Informat Technol & Elektrotech, Zurich, Switzerland
[2] Univ Bologna, Dipartimento Ingn Energia Elettr & Informaz, Bologna, Italy
关键词
MULTIPLICATION; ACCELERATOR;
D O I
10.1109/ASAP61560.2024.00049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficiency potential, which has hindered widespread adoption. To address this, we present xTern, a lightweight extension of the RISC-V instruction set architecture (ISA) targeted at accelerating TNN inference on general-purpose cores. To complement the ISA extension, we developed a set of optimized kernels leveraging xTern, achieving 67% higher throughput than their 2-bit equivalents. Power consumption is only marginally increased by 5.2 %, resulting in an energy efficiency improvement by 57.1 %. We demonstrate that the proposed xTern extension, integrated into an octa-core compute cluster, incurs a minimal silicon area overhead of 0.9% with no impact on timing. In end-to-end benchmarks, we demonstrate that xTern enables the deployment of TNNs achieving up to 1.6 percentage points higher CIFAR-10 classification accuracy than 2-bit networks at equal inference latency. Our results show that xTern enables RISCV-based ultra-low-power edge AI platforms to benefit from the efficiency potential of TNNs.
引用
收藏
页码:206 / 213
页数:8
相关论文
共 50 条
  • [41] SoftEdgeNet: SDN Based Energy-Efficient Distributed Network Architecture For Edge Computing
    Sharma, Pradip Kumar
    Rathore, Shailendra
    Jeong, Young-Sik
    Park, Jong Hyuk
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (12) : 104 - 111
  • [42] Collaborative communication and computational design for energy-efficient edge based learning network
    Dong, Erqiang
    Guo, Hengchuan
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2023, 2023 (01)
  • [43] PULP-NN: A Computing Library for Quantized Neural Network inference at the edge on RISC-V Based Parallel Ultra Low Power Clusters
    Garofalo, Angelo
    Rusci, Manuele
    Conti, Francesco
    Rossi, Davide
    Benini, Luca
    2019 26TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2019, : 33 - 36
  • [44] DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference
    Zhang, Ziyang
    Zhao, Yang
    Li, Huan
    Lin, Changyao
    Liu, Jie
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9042 - 9059
  • [45] REALISE-IoT: RISC-V-Based Efficient and Lightweight Public-Key System for IoT Applications
    Mao, Gaoyu
    Liu, Yao
    Dai, Wangchen
    Li, Guangyan
    Zhang, Zhewen
    Lam, Alan H. F.
    Cheung, Ray C. C.
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02): : 3044 - 3055
  • [46] Energy-efficient Spiking Neural Network Equalization for IM/DD Systems with Optimized Neural Encoding
    von Bank, Alexander
    Edelmann, Eike-Manuel
    Schmalen, Laurent
    2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,
  • [47] A Hot/Cold Task Partition for Energy-Efficient Neural Network Deployment on Heterogeneous Edge Device
    Wane, Jihe
    Zhao, Jiaxiang
    Wang, Danghui
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 563 - 574
  • [48] Energy-Efficient Exposed Datapath Architecture With a RISC-V Instruction Set Mode
    Hepola, Kari
    Multanen, Joonas
    Jaaskelainen, Pekka
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (02) : 560 - 573
  • [49] An Energy-Efficient Deep Neural Network Accelerator Design
    Jung, Jueun
    Lee, Kyuho Jason
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 272 - 276
  • [50] An Approach of Binary Neural Network Energy-Efficient Implementation
    Gao, Jiabao
    Liu, Qingliang
    Lai, Jinmei
    ELECTRONICS, 2021, 10 (15)