Accelerating a random forest classifier: multi-core, GP-GPU, or FPGA?

被引:102
|
作者
Van Essen, Brian [1 ]
Macaraeg, Chris [1 ]
Gokhale, Maya [1 ]
Prenger, Ryan [1 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
关键词
FPGA; GP-GPU; OpenMP; Machine learning;
D O I
10.1109/FCCM.2012.47
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Random forest classification is a well known machine learning technique that generates classifiers in the form of an ensemble ("forest") of decision trees. The classification of an input sample is determined by the majority classification by the ensemble. Traditional random forest classifiers can be highly effective, but classification using a random forest is memory bound and not typically suitable for acceleration using FPGAs or GP-GPUs due to the need to traverse large, possibly irregular decision trees. Recent work at Lawrence Livermore National Laboratory has developed several variants of random forest classifiers, including the Compact Random Forest (CRF), that can generate decision trees more suitable for acceleration than traditional decision trees. Our paper compares and contrasts the effectiveness of FPGAs, GP-GPUs, and multi-core CPUs for accelerating classification using models generated by compact random forest machine learning classifiers. Taking advantage of training algorithms that can produce compact random forests composed of many, small trees rather than fewer, deep trees, we are able to regularize the forest such that the classification of any sample takes a deterministic amount of time. This optimization then allows us to execute the classifier in a pipelined or single-instruction multiple thread (SIMT) fashion. We show that FPGAs provide the highest performance solution, but require a multi-chip / multi-board system to execute even modest sized forests. GP-GPUs offer a more flexible solution with reasonably high performance that scales with forest size. Finally, multi-threading via OpenMP on a shared memory system was the simplest solution and provided near linear performance that scaled with core count, but was still significantly slower than the GP-GPU and FPGA.
引用
收藏
页码:232 / 239
页数:8
相关论文
共 50 条
  • [41] Design and implementation of FPGA verification platform for multi-core processor
    Chen, C. (hmioycc@gmail.com), 1600, Science Press (51):
  • [42] Multi-Core Architecture on FPGA for Large Dictionary String Matching
    Wang, Qingbo
    Prasanna, Viktor K.
    PROCEEDINGS OF THE 2009 17TH IEEE SYMPOSIUM ON FIELD PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2009, : 96 - 103
  • [43] Price-Performance Aspects of Accelerating the FDTD Method Using the Vector Processing Programming Paradigm on GPU and Multi-Core Clusters
    Ilgner, Robert G.
    Davidson, David B.
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2014, 29 (05): : 351 - 360
  • [44] Performance Evaluation of gcForest inferencing on multi-core CPU and FPGA
    Manavar, Piyush
    Mukhekar, Sharyu
    Nambiar, Manoj
    SECOND INTERNATIONAL CONFERENCE ON AIML SYSTEMS 2022, 2022,
  • [45] Heterogeneous Multi-Core System, synchronized by a Petri Processor on FPGA
    Pereyra, M.
    Gallia, N.
    Alasia, M.
    Micolini, O.
    IEEE LATIN AMERICA TRANSACTIONS, 2013, 11 (01) : 218 - 223
  • [46] An FPGA-Based Experiment Platform for Multi-Core System
    Xing, Jianguo
    Zhao, Wenmin
    Hu, Hua
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 2567 - 2571
  • [47] PARALLEL FPGA TECHNOLOGY MAPPING USING MULTI-CORE ARCHITECTURES
    Kennings, Andrew
    Ravishankar, Chirag
    2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 274 - 279
  • [48] GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
    Jayaraj, P. B.
    Ajay, Mathias K.
    Nufail, M.
    Gopakumar, G.
    Jaleel, U. C. A.
    JOURNAL OF CHEMINFORMATICS, 2016, 8
  • [49] GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
    P. B. Jayaraj
    Mathias K. Ajay
    M. Nufail
    G. Gopakumar
    U. C. A. Jaleel
    Journal of Cheminformatics, 8
  • [50] A study of scheduling problems with preemptions on multi-core computers with GPU accelerators
    Blazewicz, Jacek
    Kedad-Sidhoum, Safia
    Monna, Florence
    Mounie, Gregory
    Trystram, Denis
    DISCRETE APPLIED MATHEMATICS, 2015, 196 : 72 - 82