FPGA-based Acceleration of Time Series Similarity Prediction: From Cloud to Edge

被引：2

作者：

Kalantar, Amin ^{[1
]}

Zimmerman, Zachary ^{[2
]}

Brisk, Philip ^{[1
]}

机构：

[1] Univ Calif Riverside, 900 Univ Ave, Riverside, CA 92521 USA

[2] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA

来源：

ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS | 2023年 / 16卷 / 01期

基金：

美国国家科学基金会;

关键词：

Field-programmable gate array (FPGA); time series; Matrix Profile;

D O I：

10.1145/3555810

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the proliferation of low-cost sensors and the Internet of Things, the rate of producing data far exceeds the compute and storage capabilities of today's infrastructure. Much of this data takes the form of time series, and in response, there has been increasing interest in the creation of time series archives in the past decade, along with the development and deployment of novel analysis methods to process the data. The general strategy has been to apply a plurality of similarity search mechanisms to various subsets and subsequences of time series data to identify repeated patterns and anomalies; however, the computational demands of these approaches renders them incompatible with today's power-constrained embedded CPUs. To address this challenge, we present FA-LAMP, an FPGA-accelerated implementation of the Learned Approximate Matrix Profile (LAMP) algorithm, which predicts the correlation between streaming data sampled in real-time and a representative time series dataset used for training. FA-LAMP lends itself as a real-time solution for time series analysis problems such as classification. We present the implementation of FA-LAMP on both edge- and cloud-based prototypes. On the edge devices, FA-LAMP integrates accelerated computation as close as possible to IoT sensors, thereby eliminating the need to transmit and store data in the cloud for posterior analysis. On the cloud-based accelerators, FA-LAMP can execute multiple LAMP models on the same board, allowing simultaneous processing of incoming data from multiple data sources across a network. LAMP employs a Convolutional Neural Network (CNN) for prediction. This work investigates the challenges and limitations of deploying CNNs on FPGAs using the Xilinx Deep Learning Processor Unit (DPU) and the Vitis AI development environment. We expose several technical limitations of the DPU, while providing amechanism to overcome them by attaching custom IP block accelerators to the architecture. We evaluate FA-LAMP using a low-cost Xilinx Ultra96-V2 FPGA as well as a cloud-based Xilinx Alveo U280 accelerator card and measure their performance against a prototypical LAMP deployment running on a Raspberry Pi 3, an Edge TPU, a GPU, a desktop CPU, and a server-class CPU. In the edge scenario, the Ultra96-V2 FPGA improved performance and energy consumption compared to the Raspberry Pi; in the cloud scenario, the server CPU and GPU outperformed the Alveo U280 accelerator card, while the desktop CPU achieved comparable performance; however, the Alveo card offered an order of magnitude lower energy consumption compared to the other four platforms. Our implementation is publicly available at https://github.com/aminiok1/lamp-alveo.

引用

页数：27

共 50 条

[41] FPGA-based Cloudification of ECG Signal Diagnosis Acceleration
Lee, Dongkyu
Lee, SeungMin
Park, Daejin
12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2021), 2021, : 236 - 238
[42] FPGA-based Acceleration for Tracking Audio Effects in Movies
Psarakis, Mihalis
Pikrakis, Aggelos
Dendrinos, Giannis
2012 IEEE 20TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2012, : 85 - 92
[43] FPGA-based acceleration of detecting statistical epistasis in GWAS
Wienbrandt, Lars
Kaessens, Jan Christian
Gonzalez-Dominguez, Jorge
Schmidt, Bertil
Ellinghaus, David
Sehimmler, Manfred
2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 220 - 230
[44] FPGA-based Acceleration of FDAS Module Using OpenCL
Wang, Haomiao
Zhang, Ming
Thiagaraj, Prabu
Sinnen, Oliver
2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 53 - 60
[45] Preconditioned Conjugate Gradient Acceleration on FPGA-Based Platforms
Malakonakis, Pavlos
Isotton, Giovanni
Miliadis, Panagiotis
Alverti, Chloe
Theodoropoulos, Dimitris
Pnevmatikatos, Dionisios
Ioannou, Aggelos
Harteros, Konstantinos
Georgopoulos, Konstantinos
Papaefstathiou, Ioannis
Mavroidis, Iakovos
ELECTRONICS, 2022, 11 (19)
[46] FPGA-Based Acceleration of FDTD Sound Field Rendering
Tan, Yiyu
Imamura, Toshiyuki
Kondo, Masaaki
AES: Journal of the Audio Engineering Society, 2021, 69 (7-8): : 542 - 556
[47] FPGA-based Learning Acceleration for LSTM Neural Network
Dec, Grzegorz Rafal
PARALLEL PROCESSING LETTERS, 2023, 33 (01N02)
[48] FPGA-Based Acceleration of FDTD Sound Field Rendering
Tan, Yiyu
Imamura, Toshiyuki
Kondo, Masaaki
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2021, 69 (7-8): : 542 - 556
[49] FPGA-based acceleration architecture for Apache Spark operators
Sun, Yuanwei
Liu, Haikun
Liao, Xiaofei
Jin, Hai
Zhang, Yu
CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2024, 6 (02) : 192 - 205
[50] Pipelined Parallel Join and Its FPGA-Based Acceleration
Yoshimi, Masato
Oge, Yasin
Yoshinaga, Tsutomu
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2017, 10 (04)

← 1 2 3 4 5 →