Parallel Algorithm for Connected-Component Analysis Using CUDA

被引:3
|
作者
Windisch, Dominic [1 ]
Kaever, Christian [2 ]
Juckeland, Guido [2 ]
Bieberle, Andre [2 ]
机构
[1] Tech Univ Dresden, Inst Power Engn, D-01062 Dresden, Germany
[2] Helmholtz Zent Dresden Rossendorf, Bautzner Landstr 400, D-01328 Dresden, Germany
关键词
connected-component analysis; image stream processing; parallel computing; CUDA; RAY COMPUTED-TOMOGRAPHY;
D O I
10.3390/a16020080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we introduce a parallel algorithm for connected-component analysis (CCA) on GPUs which drastically reduces the volume of data to transfer from GPU to the host. CCA algorithms targeting GPUs typically store the extracted features in arrays large enough to potentially hold the maximum possible number of objects for the given image size. Transferring these large arrays to the host requires large portions of the overall execution time. Therefore, we propose an algorithm which uses a CUDA kernel to merge trees of connected component feature structs. During the tree merging, various connected-component properties, such as total area, centroid and bounding box, are extracted and accumulated. The tree structure then enables us to only transfer features of valid objects to the host for further processing or storing. Our benchmarks show that this implementation significantly reduces memory transfer volume for processing results on the host whilst maintaining similar performance to state-of-the-art CCA algorithms.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Optimizing two-pass connected-component labeling algorithms
    Kesheng Wu
    Ekow Otoo
    Kenji Suzuki
    Pattern Analysis and Applications, 2009, 12 : 117 - 135
  • [42] A Connected-Component Improvement Procedure for Relative Isolation Probability Calculation
    Tan, Renzo Roel P.
    Ikeda, Kazushi
    ENGINEERING LETTERS, 2024, 32 (04) : 761 - 767
  • [43] Connected-component identification and cluster update on graphics processing units
    Weigel, Martin
    PHYSICAL REVIEW E, 2011, 84 (03)
  • [44] An efficient run-based connected-component labeling algorithm for three-dimensional binary images
    He, Lifeng
    Chao, Yuyan
    Suzuki, Kenji
    Tang, Wei
    Shi, Zhenghao
    Nakamura, Tsuyoshi
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND PATTERN RECOGNITION IN INDUSTRIAL ENGINEERING, 2010, 7820
  • [45] General approach to connected-component labeling for arbitrary image representations
    Dillencourt, Michael B.
    Samet, Hannan
    Tamminen, Markku
    Journal of the ACM, 1992, 39 (02): : 253 - 280
  • [46] Connected-component labeling based on hypercubes for memory constrained scenarios
    da Silva, Eduardo Sant'Ana
    Pedrini, Helio
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 61 : 272 - 281
  • [47] An Efficient Algorithm for Parallel Computation of Rough Entropy using CUDA
    Jing, Si-Yuan
    Liu, Cai-Ming
    Li, Gong-Liang
    Yan, Gao-Rong
    Zhang, Yan
    2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 1 - 5
  • [48] A GENERAL-APPROACH TO CONNECTED-COMPONENT LABELING FOR ARBITRARY IMAGE REPRESENTATIONS
    DILLENCOURT, MB
    SAMET, H
    TAMMINEN, M
    JOURNAL OF THE ACM, 1992, 39 (02) : 253 - 280
  • [49] PARALLEL EDGE DETECTION BY SOBEL ALGORITHM USING CUDA C
    Jain, Adhir
    Namdev, Anand
    Chawla, Meenu
    2016 IEEE STUDENTS' CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER SCIENCE (SCEECS), 2016,
  • [50] Parallel UPGMA Algorithm on Graphics Processing Units Using CUDA
    Chen, Yu-Rong
    Hung, Che Lun
    Lin, Yu-Shiang
    Lin, Chun-Yuan
    Lee, Tien-Lin
    Lee, Kual-Zheng
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 849 - 854