IMORC: An infrastructure and architecture template for implementing high-performance reconfigurable FPGA accelerators

被引:2
|
作者
Schumacher, Tobias [1 ]
Plessl, Christian [1 ]
Platzner, Marco [1 ]
机构
[1] Univ Gesamthsch Paderborn, Paderborn Ctr Parallel Comp, D-33098 Paderborn, Germany
关键词
Reconfigurable computing; kth nearest neighbor technique; FPGA; FRAMEWORK;
D O I
10.1016/j.micpro.2011.04.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The design, implementation and optimization of FPGA accelerators is a challenging task, especially when the accelerator comprises multiple compute cores distributed across CPU and FPGA resources and memories and exhibits data-dependent runtime behavior. In order to simplify the development of FPGA accelerators we propose IMORC, an infrastructure and architecture template that helps raising the level of abstraction. The IMORC development flow bases on a modeling technique for visualizing an application's communication demand and an architecture template that aids the developer in implementing the design. The architectural template consists of a versatile on-chip interconnect with asynchronous FIFOs and bitwidth conversion placed into the communication links, a performance monitoring infrastructure for collecting performance information during runtime and a set of generic infrastructure cores which are frequently needed in accelerator designs. We demonstrate the usefulness of the IMORC development flow by means of the case study of accelerating the kth nearest neighbor thinning problem, where IMORC greatly helps us in understanding the communication demand and in implementing the application. With the integrated performance monitoring infrastructure, we gain insights into the data-dependent behavior of the accelerator that helps us in identifying bottlenecks and optimizing the accelerator to achieve a speedup of 10x to 40x over an optimized CPU implementation. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:110 / 126
页数:17
相关论文
共 50 条
  • [41] High-Performance Computing with Accelerators INTRODUCTION
    Kindratenko, Volodymyr
    Wilhelmson, Robert
    Brunner, Robert
    Martinez, Todd J.
    Hwu, Wen-mei
    COMPUTING IN SCIENCE & ENGINEERING, 2010, 12 (04) : 12 - 16
  • [42] High-performance computing using accelerators
    Feng, Wu-Chun
    Manocha, Dinesh
    PARALLEL COMPUTING, 2007, 33 (10-11) : 645 - 647
  • [43] High-Performance Reconfigurable Computinga
    Benkrid, Khaled
    El-Araby, Esam
    Huang, Miaoqing
    Sano, Kentaro
    Steinke, Thomas
    INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2012, 2012
  • [44] High-performance reconfigurable computing
    Buell, Duncan
    El-Ghazawi, Tarek
    Gaj, Kris
    Kindratenko, Volodymyr
    COMPUTER, 2007, 40 (03) : 23 - 27
  • [45] High-performance template tracking
    Cabido, R.
    Montemayor, A. S.
    Pantrigo, J. J.
    Martinez-Zarzuela, M.
    Payne, B. R.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (02) : 271 - 286
  • [46] A New High-Performance Scalable Dynamic Interconnection for FPGA-based Reconfigurable Systems
    Jovanovic, Slavisa
    Tanougast, Camel
    Weber, Serge
    2008 INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2008, : 61 - 66
  • [47] CHIMAERA: A high-performance architecture with a tightly-coupled reconfigurable functional unit
    Ye, ZA
    Moshovos, A
    Hauck, S
    Banerjee, P
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 225 - 235
  • [48] A New Application-Tuned Processor Architecture for High-Performance Reconfigurable Computing
    Shang, Li-Hong
    Zhou, Mi
    Zhang, Jiong
    Li, Hong-Bin
    PROCEEDINGS OF THE 2009 NASA/ESA CONFERENCE ON ADAPTIVE HARDWARE AND SYSTEMS, 2009, : 138 - 143
  • [49] Area-Conscious Reconfigurable Arithmetic Unit Architecture for High-Performance DSP
    Feng, Chunyang
    Yang, Liang
    Huang, Shitan
    PROCEEDINGS OF THE 2013 ASIA-PACIFIC COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY CONFERENCE, 2013, : 84 - 92
  • [50] High-performance FPGA implementation of DES using a novel method for implementing the key schedule
    McLoone, M
    McCanny, JV
    IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 2003, 150 (05): : 373 - 378