An OpenCL-Based FPGA Accelerator for Faster R-CNN

被引:5
|
作者
An, Jianjing [1 ,2 ]
Zhang, Dezheng [1 ,2 ]
Xu, Ke [1 ,2 ]
Wang, Dong [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
基金
北京市自然科学基金;
关键词
convolutional neural network; Faster R-CNN; FPGA; hardware accelerator;
D O I
10.3390/e24101346
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In recent years, convolutional neural network (CNN)-based object detection algorithms have made breakthroughs, and much of the research corresponds to hardware accelerator designs. Although many previous works have proposed efficient FPGA designs for one-stage detectors such as Yolo, there are still few accelerator designs for faster regions with CNN features (Faster R-CNN) algorithms. Moreover, CNN's inherently high computational complexity and high memory complexity bring challenges to the design of efficient accelerators. This paper proposes a software-hardware co-design scheme based on OpenCL to implement a Faster R-CNN object detection algorithm on FPGA. First, we design an efficient, deep pipelined FPGA hardware accelerator that can implement Faster R-CNN algorithms for different backbone networks. Then, an optimized hardware-aware software algorithm was proposed, including fixed-point quantization, layer fusion, and a multi-batch Regions of interest (RoIs) detector. Finally, we present an end-to-end design space exploration scheme to comprehensively evaluate the performance and resource utilization of the proposed accelerator. Experimental results show that the proposed design achieves a peak throughput of 846.9 GOP/s at the working frequency of 172 MHz. Compared with the state-of-the-art Faster R-CNN accelerator and the one-stage YOLO accelerator, our method achieves 10 x and 2.1 x inference throughput improvements, respectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Face Detection with the Faster R-CNN
    Jiang, Huaizu
    Learned-Miller, Erik
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 650 - 657
  • [42] Geometric Proposals for Faster R-CNN
    Amin, Sikandar
    Galasso, Fabio
    2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,
  • [43] Domain Generalised Faster R-CNN
    Seemakurthy, Karthik
    Fox, Charles
    Aptoula, Erchan
    Bosilj, Petra
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2180 - 2190
  • [44] Optimizing OpenCL-Based CNN Design on FPGA with Comprehensive Design Space Exploration and Collaborative Performance Modeling
    Mu, Jiandong
    Zhang, Wei
    Liang, Hao
    Sinha, Sharad
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2020, 13 (03)
  • [45] Ganster R-CNN: Occluded Object Detection Network Based on Generative Adversarial Nets and Faster R-CNN
    Sun, Kelei
    Wen, Qiufen
    Zhou, Huaping
    IEEE ACCESS, 2022, 10 : 105022 - 105030
  • [46] Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks
    Suda, Naveen
    Chandra, Vikas
    Dasika, Ganesh
    Mohanty, Abinash
    Ma, Yufei
    Vrudhula, Sarma
    Seo, Jae-Sun
    Cao, Yu
    PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 16 - 25
  • [47] Face Detection With Different Scales Based on Faster R-CNN
    Wu, Wenqi
    Yin, Yingjie
    Wang, Xingang
    Xu, De
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (11) : 4017 - 4028
  • [48] Aerial Target Detection Based on Improved Faster R-CNN
    Feng Xiaoyu
    Mei Wei
    Hu Dashuai
    ACTA OPTICA SINICA, 2018, 38 (06)
  • [49] Faster R-CNN Based Autonomous Navigation for Vehicles in Warehouse
    Sun, Yiyou
    Su, Tonghua
    Tu, Zhiying
    2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2017, : 1639 - 1644
  • [50] Research on railway fastener positioning based on Faster R-CNN
    Bai, Tangbo
    Yang, Jianwei
    Xu, Guiyang
    Qiu, Shi
    Qiu, Shi (sheldon.qiu@csu.edu.cn), 1600, Central South University Press (18): : 502 - 508