FiC-RNN: A Multi-FPGA Acceleration Framework for Deep Recurrent Neural Networks

被引：12

作者：

Sun, Yuxi ^{[1
]}

Amano, Hideharu ^{[1
]}

机构：

[1] Keio Univ, Dept Informat & Comp Sci, Yokohama, Kanagawa 2238522, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2020年 / E103D卷 / 12期

关键词：

multi-FPGA; recurrent neural networks; LSTM;

D O I：

10.1587/transinf.2020PAP0003

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recurrent neural networks (RNNs) have been proven effective for sequence-based tasks thanks to their capability to process temporal information. In real-world systems, deep RNNs are more widely used to solve complicated tasks such as large-scale speech recognition and machine translation. However, the implementation of deep RNNs on traditional hardware platforms is inefficient due to long-range temporal dependence and irregular computation patterns within RNNs. This inefficiency manifests itself in the proportional increase in the latency of RNN inference with respect to the number of layers of deep RNNs on CPUs and GPUs. Previous work has focused mostly on optimizing and accelerating individual RNN cells. To make deep RNN inference fast and efficient, we propose an accelerator based on a multi-FPGA platform called Flow-inCloud (FiC). In this work, we show that the parallelism provided by the multi-FPGA system can be taken advantage of to scale up the inference of deep RNNs, by partitioning a large model onto several FPGAs, so that the latency stays close to constant with respect to increasing number of RNN layers. For single-layer and four-layer RNNs, our implementation achieves 31x and 61x speedup compared with an Intel CPU.

引用

页码：2457 / 2462

页数：6

共 50 条

[31] FPGA Acceleration of Dynamic Neural Networks: Challenges and Advancements
Dimitriou, Anastasios
Biggs, Benjamin
Hare, Jonathon
Merrett, Geoff, V
2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 323 - 327
[32] Gait-Based Identification Using Deep Recurrent Neural Networks and Acceleration Patterns
Peinado-Contreras, Angel
Munoz-Organero, Mario
SENSORS, 2020, 20 (23) : 1 - 18
[33] Design and implementation of a multi-FPGA based networks-on-chip emulation platform
Institute of Microelectronics of Chinese Academy of Sciences, Beijing 100029, China
Hunan Daxue Xuebao, 2013, 6 (64-68):
[34] A deep learning framework using multi-feature fusion recurrent neural networks for energy consumption forecasting
Fang, Lei
He, Bin
APPLIED ENERGY, 2023, 348
[35] Large Scale On-Chip Networks : An Accurate Multi-FPGA Emulation Platform
Kouadri-Mostefaoui
Abdellah-Medjadji
Senouci, Benaoumeur
Petrot, Frederic
11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 3 - 9
[36] RNN-Stega: Linguistic Steganography Based on Recurrent Neural Networks
Yang, Zhong-Liang
Guo, Xiao-Qing
Chen, Zi-Ming
Huang, Yong-Feng
Zhang, Yu-Jin
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (05) : 1280 - 1295
[37] CERT-RNN: Towards Certifying the Robustness of Recurrent Neural Networks
Du, Tianyu
Ji, Shouling
Shen, Lujia
Zhang, Yao
Li, Jinfeng
Shi, Jie
Fang, Chengfang
Yin, Jianwei
Beyah, Raheem
Wang, Ting
CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 516 - 534
[38] Predicting Earnings Directional Movement Utilizing Recurrent Neural Networks (RNN)
Baranes, Amos
Palas, Rimona
Yosef, Arthur
JOURNAL OF EMERGING TECHNOLOGIES IN ACCOUNTING, 2022, 19 (02) : 43 - 59
[39] ANOMALY DETECTION IN AIRCRAFT DATA USING RECURRENT NEURAL NETWORKS (RNN)
Nanduri, Anvardh
Sherry, Lance
2016 INTEGRATED COMMUNICATIONS NAVIGATION AND SURVEILLANCE (ICNS), 2016,
[40] EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks
Zhang, Pengfei
Xue, Jianru
Lan, Cuiling
Zeng, Wenjun
Gao, Zhanning
Zheng, Nanning
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1061 - 1073

← 1 2 3 4 5 →