Darknet on OpenCL: A multiplatform tool for object detection and classification

被引:4
|
作者
Sowa, Piotr [1 ]
Izydorczyk, Jacek [2 ]
机构
[1] Self Employed Ltd Co, iSowaio Piotr Sowa, Wieliczka, Poland
[2] Silesian Tech Univ, Dept Automat Control Elect & Comp Sci, Gliwice, Poland
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2022年 / 34卷 / 15期
关键词
computational efficiency; neural networks; programming; DEVICES; LIMITS;
D O I
10.1002/cpe.6936
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of this article is to overview the challenges and problems on the way from the state-of-the-art CUDA accelerated neural network code to multi-GPU code. For this purpose, the authors describe the journey of porting that existing in GitHub, a fully featured CUDA-accelerated Darknet engine, to OpenCL. This article presents the lessons learned and the techniques that were put in place for this porting. There are few other implementations on GitHub that leverage the OpenCL standard, and a few have tried to port Darknet as well. Darknet is a well-known convolutional neural network (CNN) framework. The authors of this article investigated all aspects of porting and achieved a fully featured Darknet engine on OpenCL. The effort was focused not only on classification using YOLO1, YOLO2, YOLO3, and YOLO4 CNN models. Other aspects were also covered, such as training neural networks and benchmarks to identify weak points in the implementation. Compared with the standard CPU version, the GPU computing code substantially improves the Darknet computing time by using underutilized hardware in existing systems. If the system is OpenCL-based, it is practically hardware-independent. The authors also improved the CUDA version as Darknet-vNext.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Quadratic filters for object classification and detection
    Weber, D
    Casasent, D
    OPTICAL PATTERN RECOGNITION VIII, 1997, 3073 : 2 - 13
  • [32] Object Classification from Shape Detection
    Nagpal, Pragya
    Mittal, Ankush
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 2, 2020, 1057 : 581 - 596
  • [33] Active-Darknet: An Iterative Learning Approach for Darknet Traffic Detection and Categorization
    Abbas, Sidra
    Bouazzi, Imen
    Sampedro, Gabriel Avelino
    Alsubai, Shtwai
    Almadhor, Ahmad S.
    Al Hejaili, Abdullah
    Kryvinska, Natalia
    IEEE ACCESS, 2024, 12 : 151987 - 151997
  • [34] ULTRASONIC SENSOR SYSTEMS FOR INDUSTRIAL OBJECT DETECTION AND OBJECT CLASSIFICATION
    BIEHL, KE
    MULLERGRONAU, W
    TECHNISCHES MESSEN, 1988, 55 (10): : 367 - 376
  • [35] Object Detection Framework for Multiclass Food Object Localization and Classification
    Razali, Mohd Norhisham
    Manshor, Noridayu
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1357 - 1361
  • [36] Ultrasonic sensor systems for industrial object detection and object classification
    Biehl, Karl-Ernst
    Mueller-Gronau, Wolfhardt
    TM. Technisches Messen, 1988, 55 (10) : 367 - 376
  • [37] Edge-Based Detection and Classification of Malicious Contents in Tor Darknet Using Machine Learning
    Li, Runchuan
    Chen, Shuhong
    Yang, Jiawei
    Luo, Entao
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [38] OCLoptimizer: an iterative optimization tool for OpenCL
    Fabeiro, Jorge F.
    Andrade, Diego
    Fraguela, Basilio B.
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1322 - 1331
  • [39] DetectionEvaluationJ: A Tool to Evaluate Object Detection Algorithms
    Dominguez, C.
    Garcia, M.
    Heras, J.
    Ines, A.
    Mata, E.
    Pascual, V.
    COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2017, PT II, 2018, 10672 : 273 - 280
  • [40] Development tool goes multisite, multiplatform, and multiprocessor
    Cravotta, R
    EDN, 2001, 46 (15) : 28 - 28