A High Performance Parallel and Heterogeneous Approach to Narrowband Beamforming

被引:1
|
作者
Sarofeen, Christian [1 ]
Gillett, Philip [2 ]
机构
[1] Naval Surface Warfare Ctr, Computat Anal & Design, Carderock Div, West Bethesda, MD 20817 USA
[2] Naval Surface Warfare Ctr, Hydroacoust & Propulsor Dev, Carderock Div, West Bethesda, MD 20817 USA
关键词
Beamforming; delay-sum beamforming; distributed computing; heterogeneous computing; hybrid parallel programming; ALGORITHM; FPGA; CUDA;
D O I
10.1109/TPDS.2015.2494038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a high performing, hybrid parallel, and heterogeneous algorithmic approach to narrowband Delay-Sum Beamforming (DSB) in the frequency domain using a Just-In-Time Asynchronous Data Method (JIT-ADM) parallel pattern. JIT-ADM is a novel asynchronous parallel programming pattern that unifies various levels of asynchronous concurrency available with distributed heterogeneous computing. The computational performance of this DSB algorithm was analyzed on a 50 node Cray XC30 with a single 10-core Intel Xeon E5-2670 v2 and NVIDIA Tesla K20X general purpose Graphics Processing Unit (GPU) on each node. The algorithm exhibits well behaved weak scalability with 92.7 percent parallel efficiency at 50 nodes compared to maximum performance observed. It is also shown that the algorithm efficiently utilizes a large portion of the available hardware. During beamforming the GPU is utilized at 51.8 percent of its maximum double precision floating point throughput whereas a comparable Central Processing Unit (CPU) version utilizes 60.0 percent of its maximum expected floating point throughput. Across the weak scalability study, utilizing GPUs for processing, a 2-5x performance gain is achieved compared to using CPUs. A brief derivation and validation of the implemented DSB is also presented.
引用
收藏
页码:2196 / 2207
页数:12
相关论文
共 50 条
  • [1] Criterion for narrowband beamforming
    Qin, T
    Zhang, H
    Zhang, X
    ELECTRONICS LETTERS, 2004, 40 (14) : 846 - 847
  • [2] High performance parallel KMP algorithm on a heterogeneous architecture
    Neungsoo Park
    Soeun Park
    Myungho Lee
    Cluster Computing, 2020, 23 : 2205 - 2217
  • [3] High Performance Parallel KMP Algorithm on a Heterogeneous Architecture
    Park, Soeun
    Kim, Daehee
    Park, Neungsoo
    Lee, Myungho
    2018 IEEE 3RD INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2018, : 65 - 71
  • [4] High performance parallel KMP algorithm on a heterogeneous architecture
    Park, Neungsoo
    Park, Soeun
    Lee, Myungho
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (03): : 2205 - 2217
  • [5] A high performance parallel DCT with OpenCL on heterogeneous computing environment
    Cheong Ghil Kim
    Yong Soo Choi
    Multimedia Tools and Applications, 2013, 64 : 475 - 489
  • [6] A high performance parallel DCT with OpenCL on heterogeneous computing environment
    Kim, Cheong Ghil
    Choi, Yong Soo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 64 (02) : 475 - 489
  • [7] On Beamforming Gain Models for Performance Evaluation and Analysis of Narrowband and Wideband Wireless Networks
    Chen, Yi
    Han, Chong
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (11) : 7864 - 7878
  • [8] High-Performance Computation of Bezier Surfaces on Parallel and Heterogeneous Platforms
    Palomar, Rafael
    Gomez-Luna, Juan
    Cheikh, Faouzi A.
    Olivares-Bueno, Joaquin
    Elle, Ole J.
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (06) : 1035 - 1062
  • [9] Aerospace imagery parallel processing on the high performance heterogeneous computer network
    Buchnev, A.A.
    Kim, P.A.
    Pyatkin, V.P.
    Issledovanie Zemli iz Kosmosa, 2002, (02): : 46 - 52
  • [10] A Heterogeneous Supercomputer Model for High-Performance Parallel Computing Pedagogy
    Wolfer, James
    PROCEEDINGS OF 2015 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2015, : 799 - 805