IMORC: An infrastructure and architecture template for implementing high-performance reconfigurable FPGA accelerators

被引:2
|
作者
Schumacher, Tobias [1 ]
Plessl, Christian [1 ]
Platzner, Marco [1 ]
机构
[1] Univ Gesamthsch Paderborn, Paderborn Ctr Parallel Comp, D-33098 Paderborn, Germany
关键词
Reconfigurable computing; kth nearest neighbor technique; FPGA; FRAMEWORK;
D O I
10.1016/j.micpro.2011.04.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The design, implementation and optimization of FPGA accelerators is a challenging task, especially when the accelerator comprises multiple compute cores distributed across CPU and FPGA resources and memories and exhibits data-dependent runtime behavior. In order to simplify the development of FPGA accelerators we propose IMORC, an infrastructure and architecture template that helps raising the level of abstraction. The IMORC development flow bases on a modeling technique for visualizing an application's communication demand and an architecture template that aids the developer in implementing the design. The architectural template consists of a versatile on-chip interconnect with asynchronous FIFOs and bitwidth conversion placed into the communication links, a performance monitoring infrastructure for collecting performance information during runtime and a set of generic infrastructure cores which are frequently needed in accelerator designs. We demonstrate the usefulness of the IMORC development flow by means of the case study of accelerating the kth nearest neighbor thinning problem, where IMORC greatly helps us in understanding the communication demand and in implementing the application. With the integrated performance monitoring infrastructure, we gain insights into the data-dependent behavior of the accelerator that helps us in identifying bottlenecks and optimizing the accelerator to achieve a speedup of 10x to 40x over an optimized CPU implementation. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:110 / 126
页数:17
相关论文
共 50 条
  • [31] A High-Performance Reconfigurable Computing Architecture using a Magnetic Configuration Memory
    Silva, Victor
    Fernandes, Jorge R.
    Vestias, Mario P.
    Neto, Horacio C.
    2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
  • [32] SoC Reconfigurable Architecture for Implementing Software Trained Recurrent Neural Networks on FPGA
    Wasef, Michael
    Rafla, Nader
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2497 - 2510
  • [33] High-performance architecture for flow-table lookup in SDN on FPGA
    Rashid Hatami
    Hossein Bahramgiri
    The Journal of Supercomputing, 2019, 75 : 384 - 399
  • [34] VTR 8: High-performance CAD and Customizable FPGA Architecture Modelling
    Murray, Kevin E.
    Petelin, Oleg
    Zhong, Sheng
    Wang, Jia Min
    Eldafrawy, Mohamed
    Legault, Jean-Philippe
    Sha, Eugene
    Graham, Aaron G.
    Wu, Jean
    Walker, Matthew J. P.
    Zeng, Hanqing
    Patros, Panagiotis
    Luu, Jason
    Kent, Kenneth B.
    Betz, Vaughn
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2020, 13 (02)
  • [35] Designing a novel high-performance FPGA architecture for data intensive applications
    Kostas Siozios
    Dimitrios Soudris
    Journal of Real-Time Image Processing, 2009, 4 : 155 - 166
  • [36] Low-Power High-Performance Multitransform Architecture Using Run-Time Reconfigurable Adder for FPGA and ASIC Implementation
    Sivanandam, K.
    Kumar, P.
    SYSTEM AND ARCHITECTURE, CSI 2015, 2018, 732 : 63 - 72
  • [37] Designing a novel high-performance FPGA architecture for data intensive applications
    Siozios, Kostas
    Soudris, Dimitrios
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2009, 4 (02) : 155 - 166
  • [38] High-performance architecture for flow-table lookup in SDN on FPGA
    Hatami, Rashid
    Bahramgiri, Hossein
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (01): : 384 - 399
  • [39] Scalable High-Performance Architecture for Convolutional Ternary Neural Networks on FPGA
    Prost-Boucle, Adrien
    Bourge, Alban
    Petrot, Frederic
    Alemdar, Hande
    Caldwell, Nicholas
    Leroy, Vincent
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [40] Python accelerators for high-performance computing
    Ami Marowka
    The Journal of Supercomputing, 2018, 74 : 1449 - 1460