A High-Performance and Flexible FPGA Inference Accelerator for Decision Forests Based on Prior Feature Space Partitioning

被引:0
|
作者
Chu, Thiem Van [1 ]
Kitajima, Ryuichi [1 ]
Kawamura, Kazushi [1 ]
Yu, Jaehoon [1 ]
Motomura, Masato [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
关键词
D O I
10.1109/ICFPT52863.2021.9609699
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent studies have demonstrated the potential of FPGAs for accelerating the inference computation of decision forests (DFs). However, designing a high-performance architecture that is flexible enough to be adopted in various scenarios of FPGA resource requirements remains a challenge. To address this, we propose a DF inference method that makes a transformation from traversing trees into traversing feature spaces. Specifically, as a preprocessing step, we partition each feature space into multiple regions based on thresholds. The inference task for an input data point is then conducted by (1) determining which region in each feature space the data point belongs to and (2) combining the inference information in these regions. The regularity of the computation allows us to design a DF inference architecture, called FT-DFP (Feature-space Traversing Decision Forest Processor), that can be flexibly configured for different performance and FPGA resource usage requirements. We prototype FT-DFP on a low-end FPGA (Artix-7) board and evaluate it using four real-world datasets. The evaluation results show that (1) the flexibility of FT-DFP allows us to fit a wide variety of DF models into low-end FPGA devices with limited resources; (2) FT-DFP's performance is comparable to the best of existing accelerators implemented on high-end FPGA devices and 3.04x higher than Hummingbird, a state-of-the-art GPU-optimized implementation, running on a high-end GPU; and (3) FT-DFP is 130.96 x more energy-efficient than Hummingbird.
引用
收藏
页码:29 / 38
页数:10
相关论文
共 50 条
  • [31] High-Performance SIFT Hardware Accelerator for Real-Time Image Feature Extraction
    Huang, Feng-Cheng
    Huang, Shi-Yu
    Ker, Ji-Wei
    Chen, Yung-Chang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (03) : 340 - 351
  • [32] SPRINT: A High-Performance, Energy-Efficient, and Scalable Chiplet-Based Accelerator With Photonic Interconnects for CNN Inference
    Li, Yuan
    Louri, Ahmed
    Karanth, Avinash
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (10) : 2332 - 2345
  • [33] Generating High-Performance FPGA Accelerator Designs for Big Data Analytics with Fletcher and Apache Arrow
    Johan Peltenburg
    Jeroen van Straten
    Matthijs Brobbel
    Zaid Al-Ars
    H. Peter Hofstee
    Journal of Signal Processing Systems, 2021, 93 : 565 - 586
  • [34] Generating High-Performance FPGA Accelerator Designs for Big Data Analytics with Fletcher and Apache Arrow
    Peltenburg, Johan
    van Straten, Jeroen
    Brobbel, Matthijs
    Al-Ars, Zaid
    Hofstee, H. Peter
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 565 - 586
  • [35] A High Performance FPGA-Based Sorting Accelerator with a Data Compression Mechanism
    Kobayashi, Ryohei
    Kise, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (05) : 1003 - 1015
  • [36] ScalaBFS2: A High-performance BFS Accelerator on an HBM-enhanced FPGA Chip
    Li, Kexin
    Xu, Shaoxian
    Shao, Zhiyuan
    Zheng, Ran
    Liao, Xiaofei
    Jin, Hai
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (02)
  • [37] Survey and design of paleozoic: a high-performance compiler tool chain for deep learning inference accelerator
    Liu, Zihan
    Leng, Jingwen
    Lu, Guandong
    Wang, Chenhui
    Chen, Quan
    Guo, Minyi
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2020, 2 (04) : 332 - 347
  • [38] Survey and design of paleozoic: a high-performance compiler tool chain for deep learning inference accelerator
    Zihan Liu
    Jingwen Leng
    Guandong Lu
    Chenhui Wang
    Quan Chen
    Minyi Guo
    CCF Transactions on High Performance Computing, 2020, 2 : 332 - 347
  • [39] A high-performance progressive radiosity method based on scene partitioning
    Guerra, AP
    Amor, M
    Padrón, EJ
    Doallo, R
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 537 - 548
  • [40] An FPGA-based high-performance wireless vibration analyzer
    Shahzad, Khurram
    Oelmann, Bengt
    2013 NORCHIP, 2013,