IMORC: An infrastructure and architecture template for implementing high-performance reconfigurable FPGA accelerators

被引：2

作者：

Schumacher, Tobias ^{[1
]}

Plessl, Christian ^{[1
]}

Platzner, Marco ^{[1
]}

机构：

[1] Univ Gesamthsch Paderborn, Paderborn Ctr Parallel Comp, D-33098 Paderborn, Germany

来源：

MICROPROCESSORS AND MICROSYSTEMS | 2012年 / 36卷 / 02期

关键词：

Reconfigurable computing; kth nearest neighbor technique; FPGA; FRAMEWORK;

D O I：

10.1016/j.micpro.2011.04.002

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The design, implementation and optimization of FPGA accelerators is a challenging task, especially when the accelerator comprises multiple compute cores distributed across CPU and FPGA resources and memories and exhibits data-dependent runtime behavior. In order to simplify the development of FPGA accelerators we propose IMORC, an infrastructure and architecture template that helps raising the level of abstraction. The IMORC development flow bases on a modeling technique for visualizing an application's communication demand and an architecture template that aids the developer in implementing the design. The architectural template consists of a versatile on-chip interconnect with asynchronous FIFOs and bitwidth conversion placed into the communication links, a performance monitoring infrastructure for collecting performance information during runtime and a set of generic infrastructure cores which are frequently needed in accelerator designs. We demonstrate the usefulness of the IMORC development flow by means of the case study of accelerating the kth nearest neighbor thinning problem, where IMORC greatly helps us in understanding the communication demand and in implementing the application. With the integrated performance monitoring infrastructure, we gain insights into the data-dependent behavior of the accelerator that helps us in identifying bottlenecks and optimizing the accelerator to achieve a speedup of 10x to 40x over an optimized CPU implementation. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：110 / 126

页数：17

共 50 条

[31] A High-Performance Reconfigurable Computing Architecture using a Magnetic Configuration Memory
Silva, Victor
Fernandes, Jorge R.
Vestias, Mario P.
Neto, Horacio C.
2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
[32] SoC Reconfigurable Architecture for Implementing Software Trained Recurrent Neural Networks on FPGA
Wasef, Michael
Rafla, Nader
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2497 - 2510
[33] High-performance architecture for flow-table lookup in SDN on FPGA
Rashid Hatami
Hossein Bahramgiri
The Journal of Supercomputing, 2019, 75 : 384 - 399
[34] VTR 8: High-performance CAD and Customizable FPGA Architecture Modelling
Murray, Kevin E.
Petelin, Oleg
Zhong, Sheng
Wang, Jia Min
Eldafrawy, Mohamed
Legault, Jean-Philippe
Sha, Eugene
Graham, Aaron G.
Wu, Jean
Walker, Matthew J. P.
Zeng, Hanqing
Patros, Panagiotis
Luu, Jason
Kent, Kenneth B.
Betz, Vaughn
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2020, 13 (02)
[35] Designing a novel high-performance FPGA architecture for data intensive applications
Kostas Siozios
Dimitrios Soudris
Journal of Real-Time Image Processing, 2009, 4 : 155 - 166
[36] Low-Power High-Performance Multitransform Architecture Using Run-Time Reconfigurable Adder for FPGA and ASIC Implementation
Sivanandam, K.
Kumar, P.
SYSTEM AND ARCHITECTURE, CSI 2015, 2018, 732 : 63 - 72
[37] Designing a novel high-performance FPGA architecture for data intensive applications
Siozios, Kostas
Soudris, Dimitrios
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2009, 4 (02) : 155 - 166
[38] High-performance architecture for flow-table lookup in SDN on FPGA
Hatami, Rashid
Bahramgiri, Hossein
JOURNAL OF SUPERCOMPUTING, 2019, 75 (01): : 384 - 399
[39] Scalable High-Performance Architecture for Convolutional Ternary Neural Networks on FPGA
Prost-Boucle, Adrien
Bourge, Alban
Petrot, Frederic
Alemdar, Hande
Caldwell, Nicholas
Leroy, Vincent
2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
[40] Python accelerators for high-performance computing
Ami Marowka
The Journal of Supercomputing, 2018, 74 : 1449 - 1460

← 1 2 3 4 5 →