Darknet on OpenCL: A multiplatform tool for object detection and classification

被引:4
|
作者
Sowa, Piotr [1 ]
Izydorczyk, Jacek [2 ]
机构
[1] Self Employed Ltd Co, iSowaio Piotr Sowa, Wieliczka, Poland
[2] Silesian Tech Univ, Dept Automat Control Elect & Comp Sci, Gliwice, Poland
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2022年 / 34卷 / 15期
关键词
computational efficiency; neural networks; programming; DEVICES; LIMITS;
D O I
10.1002/cpe.6936
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of this article is to overview the challenges and problems on the way from the state-of-the-art CUDA accelerated neural network code to multi-GPU code. For this purpose, the authors describe the journey of porting that existing in GitHub, a fully featured CUDA-accelerated Darknet engine, to OpenCL. This article presents the lessons learned and the techniques that were put in place for this porting. There are few other implementations on GitHub that leverage the OpenCL standard, and a few have tried to port Darknet as well. Darknet is a well-known convolutional neural network (CNN) framework. The authors of this article investigated all aspects of porting and achieved a fully featured Darknet engine on OpenCL. The effort was focused not only on classification using YOLO1, YOLO2, YOLO3, and YOLO4 CNN models. Other aspects were also covered, such as training neural networks and benchmarks to identify weak points in the implementation. Compared with the standard CPU version, the GPU computing code substantially improves the Darknet computing time by using underutilized hardware in existing systems. If the system is OpenCL-based, it is practically hardware-independent. The authors also improved the CUDA version as Darknet-vNext.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Heterogeneous System Implementation of Deep Learning Neural Network for Object Detection in OpenCL Framework
    Li, Shuai
    Luo, Yukui
    Sun, Kuangyuan
    Choi, Ken
    2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 456 - 459
  • [22] ATTENTIVE LAYER SEPARATION FOR OBJECT CLASSIFICATION AND OBJECT LOCALIZATION IN OBJECT DETECTION
    Kim, Jung Uk
    Ro, Yong Man
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3995 - 3999
  • [23] Video Index Point Detection and Extraction Framework Using Custom YoloV4 Darknet Object Detection Model
    Mahrishi, Mehul
    Morwal, Sudha
    Muzaffar, Abdul Wahab
    Bhatia, Surbhi
    Dadheech, Pankaj
    Rahmani, Mohammad Khalid Imam
    IEEE ACCESS, 2021, 9 : 143378 - 143391
  • [24] A Neuromorphic System for Object Detection and Classification
    Khosla, Deepak
    Chen, Yang
    Kim, Kyungnam
    Cheng, Shinko Y.
    Honda, Alexander L.
    Zhang, Lei
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII, 2013, 8745
  • [25] Object detection for hierarchical image classification
    Khan, L
    Wang, L
    MINING MULTIMEDIA AND COMPLEX DATA, 2003, 2797 : 36 - 49
  • [26] Hidden Object Detection for Classification of Threat
    Gautam, K. S.
    Thangavel, Senthil Kumar
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2017,
  • [27] Object Detection and Classification in Surveillance System
    Varma, Soumya
    Sreeraj, M.
    2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 299 - 303
  • [28] A review on image object classification and detection
    Huang, Kai-Qi
    Ren, Wei-Qiang
    Tan, Tie-Niu
    Huang, K.-Q. (kqhuang@nlpr.ia.ac.cn), 1600, Science Press (37): : 1225 - 1240
  • [29] Object Detection and Classification on Heterogeneous Datasets
    Brosch, Tobias
    Elshaarany, Ahmed
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 307 - 312
  • [30] Object Detection and Classification for Domestic Robots
    Vincze, Markus
    Wohlkinger, Walter
    Olufs, Sven
    Einramhof, Peter
    Schwarz, Robert
    Varadarajan, Karthik
    LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION, AND VALIDATION, 2012, 336 : 106 - 120