In-FPGA Instrumentation Framework for OpenCL-Based Designs

被引:2
|
作者
Bensalem, Hachem [1 ]
Blaquiere, Yves [1 ]
Savaria, Yvon [2 ]
机构
[1] Ecole Technol Super, Dept Elect Engn, Montreal, PQ H3C 1K3, Canada
[2] Polytech Montreal, Dept Elect Engn, Montreal, PQ H3T 1J4, Canada
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
加拿大自然科学与工程研究理事会;
关键词
Field programmable gate arrays; Instruments; Tools; Kernel; Debugging; Hardware; Benchmark testing; OpenCL; FPGA; instrumentation; high-performance reconfigurable computing; HLS; timing performance;
D O I
10.1109/ACCESS.2020.3040081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The productivity achieved when developing applications on high-performance reconfigurable heterogeneous computing (HPRHC) systems is increased by using the Open Computing Language (OpenCL). However, the hardware produced by OpenCL compilers in field-programmable gate arrays (FPGAs) can result in severe performance bottlenecks that are challenging to solve. The problem is compounded by the fact that the generated netlist details are disorganized, making them mostly unreadable and only partially visible to designers. This paper proposes an in-FPGA instrumentation method and a new framework for extracting the FPGA-cycle-accurate timing performances of OpenCL-based designs. The results clearly show that the chosen execution model for OpenCL-based designs strongly affects the timing performance when it is not properly implemented. Our framework is implemented on an HPRHC platform that contains a CPU and two Arria10 FPGAs, and it is evaluated with a wide variety of benchmarks with different complexities. After testing on the reported benchmarks, the average logic overhead for one inserted instrument is 0.2 % of the total amount of adaptive look-up tables (ALUTs) and 0.1 % of the total registers in an FPGA. This resource utilization is between 1.5 and six times lower than those reported in the best previously published works. The scalability of the framework is also evaluated by inserting up to 50 instruments. The experimental results show that the average logic utilization per instrument is 0.19 % of the ALUTs and 0.17 % of the registers in the FPGA when 50 instruments are inserted.
引用
收藏
页码:212979 / 212994
页数:16
相关论文
共 50 条
  • [41] Optimizing OpenCL-Based CNN Design on FPGA with Comprehensive Design Space Exploration and Collaborative Performance Modeling
    Mu, Jiandong
    Zhang, Wei
    Liang, Hao
    Sinha, Sharad
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2020, 13 (03)
  • [42] Efficient OpenCL-based concurrent tasks offloading on accelerators
    Lazaro-Munoz, A. J.
    Gonzalez-Linares, J. M.
    Gomez-Luna, J.
    Guil, N.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2353 - 2357
  • [43] RPkNN: An OpenCL-Based FPGA Implementation of the Dimensionality-Reduced kNN Algorithm Using Random Projection
    Tavakoli, Erfan Bank
    Beygi, Amir
    Yao, Xuebin
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (04) : 549 - 552
  • [44] Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks
    Suda, Naveen
    Chandra, Vikas
    Dasika, Ganesh
    Mohanty, Abinash
    Ma, Yufei
    Vrudhula, Sarma
    Seo, Jae-Sun
    Cao, Yu
    PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 16 - 25
  • [45] Multikernel Data Partitioning With Channel on OpenCL-Based FPGAs
    Wang, Zeke
    Paul, Johns
    He, Bingsheng
    Zhang, Wei
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (06) : 1906 - 1918
  • [46] Improving Data Partitioning Performance on OpenCL-based FPGAs
    Wang, Zeke
    He, Bingsheng
    Zhang, Wei
    2015 IEEE 23RD ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2015, : 34 - 34
  • [47] OpenCL-based acceleration of the FDTD method in computational electromagnetics
    Stefanski, Tomasz
    Benkler, Stefan
    Chavannes, Nicolas
    Kuster, Niels
    INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2013, 26 (04) : 355 - 365
  • [48] Comprehensive Evaluation of OpenCL-Based CNN Implementations for FPGAs
    Tapiador-Morales, Ricardo
    Rios-Navarro, Antonio
    Linares-Barranco, Alejandro
    Kim, Minkyu
    Kadetotad, Deepak
    Seo, Jae-sun
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT II, 2017, 10306 : 271 - 282
  • [49] Query Processing on OpenCL-based FPGAs: Challenges and Opportunities
    Paul, Johns
    He, Bingsheng
    Lau, Chiew Tong
    2018 IEEE 24TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2018), 2018, : 937 - 945
  • [50] Improved OpenCL-Based Implementation of Social Field Pedestrian Model
    Yu, Bin
    Zhu, Ke
    Wu, Kaiteng
    Zhang, Michael
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (07) : 2828 - 2839