Efficient GPU-accelerated parallel cross-correlation

被引:0
|
作者
Madera, Karel [1 ]
Smelko, Adam [1 ]
Krulis, Martin [1 ]
机构
[1] Charles Univ Prague, Dept Distributed & Dependable Syst, Prague, Czech Republic
关键词
Cross-correlation; GPU; CUDA; Parallel; Algorithm; Caching; Optimizations;
D O I
10.1016/j.jpdc.2025.105054
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cross-correlation is a data analysis method widely employed in various signal processing and similarity-search applications. Our objective is to design a highly optimized GPU-accelerated implementation that will speed up the applications and also improve energy efficiency since GPUs are more efficient than CPUs in data-parallel tasks. There are two rudimentary ways to compute cross-correlation - a definition-based algorithm that tries all possible overlaps and an algorithm based on the Fourier transform, which is much more complex but has better asymptotical time complexity. We have focused mainly on the definition-based approach which is better suited for smaller input data and we have implemented multiple CUDA-enabled algorithms with multiple optimization options. The algorithms were evaluated on various scenarios, including the most typical types of multi-signal correlations, and we provide empirically verified optimal solutions for each of the studied scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] GPU-accelerated MART and concurrent cross-correlation for tomographic PIV
    Xin Zeng
    Chuangxin He
    Yingzheng Liu
    Experiments in Fluids, 2022, 63
  • [2] GPU-accelerated MART and concurrent cross-correlation for tomographic PIV
    Zeng, Xin
    He, Chuangxin
    Liu, Yingzheng
    EXPERIMENTS IN FLUIDS, 2022, 63 (05)
  • [3] GPU-accelerated parallel optimization for sparse regularization
    Wang, Xingran
    Liu, Tianyi
    Minh Trinh-Hoang
    Pesavento, Marius
    2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
  • [4] GPU-accelerated parallel algorithms for linear rankSVM
    Jing Jin
    Xianggao Cai
    Guoming Lai
    Xiaola Lin
    The Journal of Supercomputing, 2015, 71 : 4141 - 4171
  • [5] GPU-accelerated parallel algorithms for linear rankSVM
    Jin, Jing
    Cai, Xianggao
    Lai, Guoming
    Lin, Xiaola
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (11): : 4141 - 4171
  • [6] An efficient fine-grained parallel genetic algorithm based on GPU-accelerated
    Li, Jian-Ming
    Wang, Xiao-Jing
    He, Rong-Sheng
    Chi, Zhong-Xian
    2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 855 - +
  • [7] Efficient Intranode Communication in GPU-Accelerated Systems
    Ji, Feng
    Aji, Ashwin M.
    Dinan, James
    Buntinas, Darius
    Balaji, Pavan
    Feng, Wu-chun
    Ma, Xiaosong
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1838 - 1847
  • [8] GPU-Accelerated Parallel FDTD on Distributed Heterogeneous Platform
    Jiang, Ronglin
    Jiang, Shugang
    Zhang, Yu
    Xu, Ying
    Xu, Lei
    Zhang, Dandan
    INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2014, 2014
  • [9] A GPU-accelerated parallel K-means algorithm
    Cuomo, S.
    De Angelis, V.
    Farina, G.
    Marcellino, L.
    Toraldo, G.
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 75 : 262 - 274
  • [10] GPU-Accelerated Microdosimetry
    Decunha, J.
    Mohan, R.
    MEDICAL PHYSICS, 2022, 49 (06) : E467 - E468