A High-Performance and Flexible FPGA Inference Accelerator for Decision Forests Based on Prior Feature Space Partitioning

被引:0
|
作者
Chu, Thiem Van [1 ]
Kitajima, Ryuichi [1 ]
Kawamura, Kazushi [1 ]
Yu, Jaehoon [1 ]
Motomura, Masato [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
关键词
D O I
10.1109/ICFPT52863.2021.9609699
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent studies have demonstrated the potential of FPGAs for accelerating the inference computation of decision forests (DFs). However, designing a high-performance architecture that is flexible enough to be adopted in various scenarios of FPGA resource requirements remains a challenge. To address this, we propose a DF inference method that makes a transformation from traversing trees into traversing feature spaces. Specifically, as a preprocessing step, we partition each feature space into multiple regions based on thresholds. The inference task for an input data point is then conducted by (1) determining which region in each feature space the data point belongs to and (2) combining the inference information in these regions. The regularity of the computation allows us to design a DF inference architecture, called FT-DFP (Feature-space Traversing Decision Forest Processor), that can be flexibly configured for different performance and FPGA resource usage requirements. We prototype FT-DFP on a low-end FPGA (Artix-7) board and evaluate it using four real-world datasets. The evaluation results show that (1) the flexibility of FT-DFP allows us to fit a wide variety of DF models into low-end FPGA devices with limited resources; (2) FT-DFP's performance is comparable to the best of existing accelerators implemented on high-end FPGA devices and 3.04x higher than Hummingbird, a state-of-the-art GPU-optimized implementation, running on a high-end GPU; and (3) FT-DFP is 130.96 x more energy-efficient than Hummingbird.
引用
收藏
页码:29 / 38
页数:10
相关论文
共 50 条
  • [21] ADD: Accelerator Design and Deploy - A tool for FPGA high-performance dataflow computing
    Penha, Jeronimo C.
    Silva, Lucas B.
    Silva, Jansen M.
    Coelho, Kristtopher K.
    Baranda, Hector P.
    Nacif, Jose Augusto M.
    Ferreira, Ricardo S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (18):
  • [22] FNNG: A High-Performance FPGA-based Accelerator for K-Nearest Neighbor Graph Construction
    Liu, Chaoqiang
    Liu, Haifeng
    Zheng, Long
    Huang, Yu
    Ye, Xiangyu
    Liao, Xiaofei
    Jin, Hai
    PROCEEDINGS OF THE 2023 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, FPGA 2023, 2023, : 67 - 77
  • [23] Work-in-Progress: A High-performance FPGA Accelerator for Sparse Neural Networks
    Lu, Yuntao
    Gong, Lei
    Xu, Chongchong
    Sun, Fan
    Zhang, Yiwei
    Wang, Chao
    Zhou, Xuehai
    2017 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2017,
  • [24] A High-performance FPGA-based Image Feature Detector and Matcher Based on the FAST and BRIEF Algorithms
    Fularz, Michal
    Kraft, Marek
    Schmidt, Adam
    Kasinski, Andrzej
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2015, 12
  • [25] The Effect of Partitioning of Feature Space on Specific Class Extraction Based on Bayesian Decision
    Bo, Shukui
    Jing, Yongju
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 2697 - 2700
  • [26] An Instruction-Driven Batch-Based High-Performance Resource-Efficient LSTM Accelerator on FPGA
    Mao, Ning
    Yang, Haigang
    Huang, Zhihong
    ELECTRONICS, 2023, 12 (07)
  • [27] High-Performance Parallel Accelerator for Flexible and Efficient Run-Time Monitoring
    Deng, Daniel Y.
    Suh, G. Edward
    2012 42ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2012,
  • [28] High-Performance CNN Accelerator on FPGA Using Unified Winograd-GEMM Architecture
    Kala, S.
    Jose, Babita R.
    Mathew, Jimson
    Nalesh, S.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (12) : 2816 - 2828
  • [29] A High-performance CNN Processor Based on FPGA for MobileNets
    Wu, Di
    Zhang, Yu
    Jia, Xijie
    Tian, Lu
    Li, Tianping
    Sui, Lingzhi
    Xie, Dongliang
    Shan, Yi
    2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 136 - 143
  • [30] Hardware/software interface for high-performance space computing with FPGA coprocessors
    Greco, James
    Cieslewski, Grzegorz
    Jacobs, Adam
    Troxel, Ian A.
    George, Alan D.
    2006 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2006, : 2479 - +