Energy Efficient LSTM Accelerators for Embedded FPGAs Through Parameterised Architecture Design

被引:4
|
作者
Qian, Chao [1 ]
Ling, Tianheng [1 ]
Schiele, Gregor [1 ]
机构
[1] Univ Duisburg, Embedded Syst Lab, Duisburg, Germany
关键词
LSTM; Energy Efficiency; Embedded FPGAs;
D O I
10.1007/978-3-031-42785-5_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long Short-term Memory Networks (LSTMs) are a vital Deep Learning technique suitable for performing on-device time series analysis on local sensor data streams of embedded devices. In this paper, we propose a new hardware accelerator design for LSTMs specially optimised for resource-scarce embedded Field Programmable Gate Arrays (FPGAs). Our design improves the execution speed and reduces energy consumption compared to related work. Moreover, it can be adapted to different situations using a number of optimisation parameters, such as the usage of DSPs or the implementation of activation functions. We present our key design decisions and evaluate the performance. Our accelerator achieves an energy efficiency of 11.89 GOP/s/W during a real-time inference with 32873 samples/s.
引用
收藏
页码:3 / 17
页数:15
相关论文
共 50 条
  • [31] NeuroPower: Designing Energy Efficient Convolutional Neural Network Architecture for Embedded Systems
    Loni, Mohammad
    Zoljodi, Ali
    Sinaei, Sima
    Daneshtalab, Masoud
    Sjodin, Mikael
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 208 - 222
  • [32] A Transprecision Floating-Point Architecture for Energy-Efficient Embedded Computing
    Mach, Stefan
    Rossi, Davide
    Tagliavini, Giuseppe
    Marongiu, Andrea
    Benini, Luca
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [33] Energy Efficient Outdoor Light Monitoring and Control Architecture Using Embedded System
    Kaleem, Zeeshan
    Yoon, Tae Min
    Lee, Chankil
    IEEE EMBEDDED SYSTEMS LETTERS, 2016, 8 (01) : 18 - 21
  • [34] Way Halted Prediction Cache : An Energy Efficient Cache Architecture for Embedded Processors
    Mallya, Neethu Bal
    Patil, Geeta
    Raveendran, Biju
    2015 28TH INTERNATIONAL CONFERENCE ON VLSI DESIGN (VLSID), 2015, : 65 - 70
  • [35] Design and implementation of novel FIR filter architecture for efficient signal boundary handling on Xilinx VIRTEX FPGAs
    Benkrid, A
    Benkrid, K
    Crookes, D
    VLSI 2004: IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS, 2004, : 222 - 225
  • [36] An Embedded Co-processor Architecture for Energy-efficient Stream Computing
    Panda, Amrit
    Chatha, Karam S.
    2014 IEEE 12TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2014, : 60 - 69
  • [37] Power-efficient FIR filter architecture design for wireless embedded system
    Lin, SF
    Huang, SC
    Yang, FS
    Ku, CW
    Chen, LG
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2004, 51 (01) : 21 - 25
  • [38] Energy-Efficient DNN Inference on Approximate Accelerators Through Formal Property Exploration
    Spantidi, Ourania
    Zervakis, Georgios
    Anagnostopoulos, Iraklis
    Henkel, Joerg
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 3838 - 3849
  • [39] Design and Modeling of Energy Efficient WSN Architecture for Tactical Applications
    Mohammad, Nazeeruddin
    Muhammad, Shahabuddin
    Bashar, Abul
    Khan, Majid Ali
    2017 MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS CONFERENCE (MILCIS), 2017,
  • [40] Design of an adaptive architecture for energy efficient wireless image communication
    Taylor, CN
    Panigrahi, D
    Dey, S
    EMBEDDED PROCESSOR DESIGN CHALLENGES: SYSTEMS, ARCHITECTURES, MODELLING, AND SIMULATION - SAMOS, 2002, 2268 : 260 - 273