Coarse Grained FPGA Overlay for Rapid Just-In-Time Accelerator Compilation

被引:9
|
作者
Jain, Abhishek Kumar [1 ]
Maskell, Douglas L. [2 ]
Fahmy, Suhaib A. [3 ]
机构
[1] Xilinx Inc, San Jose, CA 95124 USA
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] King Abdullah Univ Sci & Technol, Thuwal 23955, Saudi Arabia
关键词
Field programmable gate arrays; Kernel; Computer architecture; Hardware; Runtime; Performance evaluation; Throughput; parallel processing; hardware accelerators; HIGH-LEVEL SYNTHESIS; OPENCL; PERFORMANCE; PARALLEL; SCALE;
D O I
10.1109/TPDS.2021.3116859
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Coarse-grained FPGA overlays built around the runtime programmable DSP blocks in modern FPGAs can achieve high throughput and improved scalability compared to traditional overlays built without detailed consideration of FPGA architecture. These overlays can be mapped to using higher level compilers, achieving fast compilation, software-like programmability and run-time management, and high-level design abstraction. OpenCL allows programs running on a host computer to launch accelerator kernels which can be compiled at run-time for a specific architecture, thus enabling portability. However, prohibitive hardware compilation times in traditional design flows mean that the tools cannot effectively use just-in-time (JIT) compilation or runtime performance scaling on FPGAs. We present a methodology for runtime compilation of dataflow graphs expressed as OpenCL kernels onto coarse-grained overlays. The methodology benefits from the high level of abstraction afforded by using the OpenCL programming model, while the mapping to the overlay significantly reduces compilation and load times. Key characteristics of this work include highly performant DSP-optimized functional units that scale to large overlays on modern devices and the ability to perform automatic resource-aware kernel replication up to the size of the overlay. We demonstrate place and route times orders of magnitude better than traditional HLS flows, even when running on an embedded processor in the Xilinx Zynq.
引用
收藏
页码:1478 / 1490
页数:13
相关论文
共 50 条
  • [1] Dynamic FPGA routing for just-in-time FPGA compilation
    Lysecky, R
    Vahid, F
    Tan, SXD
    41ST DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2004, 2004, : 954 - 959
  • [2] A study of the scalability of on-chip routing for just-in-time FPGA compilation
    Lysecky, R
    Vahid, F
    Tan, SXD
    FCCM 2005: 13TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2005, : 57 - 62
  • [3] Just-in-Time Compilation for Verilog A New Technique for Improving the FPGA Programming Experience
    Schkufza, Eric
    Wei, Michael
    Rossbach, Christopher J.
    TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 271 - 286
  • [4] Quantum simulation with just-in-time compilation
    Efthymiou, Stavros
    Lazzarin, Marco
    Pasquale, Andrea
    Carrazza, Stefano
    QUANTUM, 2022, 6
  • [5] A Just-In-Time Modulo Scheduling for Virtual Coarse-Grained Reconfigurable Architectures
    Ferreira, Ricardo
    Duarte, Vinicius
    Meireles, Waldir
    Pereira, Monica
    Carro, Luigi
    Wong, Stephan
    2013 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION (IC-SAMOS), 2013, : 188 - 195
  • [6] Efficient Java']JavaVM just-in-time compilation
    Krall, A
    1998 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1998, : 205 - 212
  • [7] Just-in-time compilation for SQL query processing
    Viglas, Stratis D.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1190 - 1191
  • [8] Just-in-time compilation for SQL query processing
    Viglas, Stratis D.
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 1298 - 1301
  • [9] Remote Just-in-Time Compilation for Dynamic Languages
    Pecimuth, Andrej
    COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 1 - 3
  • [10] On Supporting Efficient Partial Reconfiguration with Just-In-Time Compilation
    Sidiropoulos, Harry
    Siozios, Kostas
    Figuli, Peter
    Soudris, Dimitrios
    Hubner, Michael
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 328 - 335