FiC-RNN: A Multi-FPGA Acceleration Framework for Deep Recurrent Neural Networks

被引:12
|
作者
Sun, Yuxi [1 ]
Amano, Hideharu [1 ]
机构
[1] Keio Univ, Dept Informat & Comp Sci, Yokohama, Kanagawa 2238522, Japan
关键词
multi-FPGA; recurrent neural networks; LSTM;
D O I
10.1587/transinf.2020PAP0003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recurrent neural networks (RNNs) have been proven effective for sequence-based tasks thanks to their capability to process temporal information. In real-world systems, deep RNNs are more widely used to solve complicated tasks such as large-scale speech recognition and machine translation. However, the implementation of deep RNNs on traditional hardware platforms is inefficient due to long-range temporal dependence and irregular computation patterns within RNNs. This inefficiency manifests itself in the proportional increase in the latency of RNN inference with respect to the number of layers of deep RNNs on CPUs and GPUs. Previous work has focused mostly on optimizing and accelerating individual RNN cells. To make deep RNN inference fast and efficient, we propose an accelerator based on a multi-FPGA platform called Flow-inCloud (FiC). In this work, we show that the parallelism provided by the multi-FPGA system can be taken advantage of to scale up the inference of deep RNNs, by partitioning a large model onto several FPGAs, so that the latency stays close to constant with respect to increasing number of RNN layers. For single-layer and four-layer RNNs, our implementation achieves 31x and 61x speedup compared with an Intel CPU.
引用
收藏
页码:2457 / 2462
页数:6
相关论文
共 50 条
  • [31] FPGA Acceleration of Dynamic Neural Networks: Challenges and Advancements
    Dimitriou, Anastasios
    Biggs, Benjamin
    Hare, Jonathon
    Merrett, Geoff, V
    2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 323 - 327
  • [32] Gait-Based Identification Using Deep Recurrent Neural Networks and Acceleration Patterns
    Peinado-Contreras, Angel
    Munoz-Organero, Mario
    SENSORS, 2020, 20 (23) : 1 - 18
  • [33] Design and implementation of a multi-FPGA based networks-on-chip emulation platform
    Institute of Microelectronics of Chinese Academy of Sciences, Beijing 100029, China
    Hunan Daxue Xuebao, 2013, 6 (64-68):
  • [34] A deep learning framework using multi-feature fusion recurrent neural networks for energy consumption forecasting
    Fang, Lei
    He, Bin
    APPLIED ENERGY, 2023, 348
  • [35] Large Scale On-Chip Networks : An Accurate Multi-FPGA Emulation Platform
    Kouadri-Mostefaoui
    Abdellah-Medjadji
    Senouci, Benaoumeur
    Petrot, Frederic
    11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 3 - 9
  • [36] RNN-Stega: Linguistic Steganography Based on Recurrent Neural Networks
    Yang, Zhong-Liang
    Guo, Xiao-Qing
    Chen, Zi-Ming
    Huang, Yong-Feng
    Zhang, Yu-Jin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (05) : 1280 - 1295
  • [37] CERT-RNN: Towards Certifying the Robustness of Recurrent Neural Networks
    Du, Tianyu
    Ji, Shouling
    Shen, Lujia
    Zhang, Yao
    Li, Jinfeng
    Shi, Jie
    Fang, Chengfang
    Yin, Jianwei
    Beyah, Raheem
    Wang, Ting
    CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 516 - 534
  • [38] Predicting Earnings Directional Movement Utilizing Recurrent Neural Networks (RNN)
    Baranes, Amos
    Palas, Rimona
    Yosef, Arthur
    JOURNAL OF EMERGING TECHNOLOGIES IN ACCOUNTING, 2022, 19 (02) : 43 - 59
  • [39] ANOMALY DETECTION IN AIRCRAFT DATA USING RECURRENT NEURAL NETWORKS (RNN)
    Nanduri, Anvardh
    Sherry, Lance
    2016 INTEGRATED COMMUNICATIONS NAVIGATION AND SURVEILLANCE (ICNS), 2016,
  • [40] EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks
    Zhang, Pengfei
    Xue, Jianru
    Lan, Cuiling
    Zeng, Wenjun
    Gao, Zhanning
    Zheng, Nanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1061 - 1073