A High Performance Parallel and Heterogeneous Approach to Narrowband Beamforming

被引:1
|
作者
Sarofeen, Christian [1 ]
Gillett, Philip [2 ]
机构
[1] Naval Surface Warfare Ctr, Computat Anal & Design, Carderock Div, West Bethesda, MD 20817 USA
[2] Naval Surface Warfare Ctr, Hydroacoust & Propulsor Dev, Carderock Div, West Bethesda, MD 20817 USA
关键词
Beamforming; delay-sum beamforming; distributed computing; heterogeneous computing; hybrid parallel programming; ALGORITHM; FPGA; CUDA;
D O I
10.1109/TPDS.2015.2494038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a high performing, hybrid parallel, and heterogeneous algorithmic approach to narrowband Delay-Sum Beamforming (DSB) in the frequency domain using a Just-In-Time Asynchronous Data Method (JIT-ADM) parallel pattern. JIT-ADM is a novel asynchronous parallel programming pattern that unifies various levels of asynchronous concurrency available with distributed heterogeneous computing. The computational performance of this DSB algorithm was analyzed on a 50 node Cray XC30 with a single 10-core Intel Xeon E5-2670 v2 and NVIDIA Tesla K20X general purpose Graphics Processing Unit (GPU) on each node. The algorithm exhibits well behaved weak scalability with 92.7 percent parallel efficiency at 50 nodes compared to maximum performance observed. It is also shown that the algorithm efficiently utilizes a large portion of the available hardware. During beamforming the GPU is utilized at 51.8 percent of its maximum double precision floating point throughput whereas a comparable Central Processing Unit (CPU) version utilizes 60.0 percent of its maximum expected floating point throughput. Across the weak scalability study, utilizing GPUs for processing, a 2-5x performance gain is achieved compared to using CPUs. A brief derivation and validation of the implemented DSB is also presented.
引用
收藏
页码:2196 / 2207
页数:12
相关论文
共 50 条
  • [21] High-performance and energy-efficient heterogeneous subword parallel instructions
    Kim, J
    Wills, DS
    SIPS 2003: IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS: DESIGN AND IMPLEMENTATION, 2003, : 75 - 80
  • [22] A language and programming environment for high-performance parallel computing on heterogeneous networks
    A. L. Lastovetsky
    A. Ya. Kalinov
    I. N. Ledovskikh
    D. M. Arapov
    M. A. Posypkin
    Programming and Computer Software, 2000, 26 : 216 - 236
  • [23] The RECIPE approach to challenges in deeply heterogeneous high performance systems
    Agosta, Giovanni
    Fornaciari, William
    Atienza, David
    Canal, Ramon
    Cilardo, Alessandro
    Flich Cardo, Jose
    Hernandez Luz, Carles
    Kulczewski, Michal
    Massari, Giuseppe
    Tornero Gavila, Rafael
    Zapater, Marina
    MICROPROCESSORS AND MICROSYSTEMS, 2020, 77
  • [24] High performance inorganic filterless narrowband photodetectors
    Fan, Xinye
    Chen, Yiren
    Zhang, Zhiwei
    Miao, Guoqing
    Jiang, Hong
    Song, Hang
    MATERIALS LETTERS, 2022, 328
  • [25] Performance analysis for 5G beamforming heterogeneous networks
    Xie, Yi
    Li, Bo
    Zuo, Xiaoya
    Yan, Zhongjiang
    Yang, Mao
    WIRELESS NETWORKS, 2020, 26 (01) : 463 - 477
  • [26] Robustness of adaptive narrowband beamforming with respect to bandwidth
    Oudin, Marc
    Delmas, Jean Pierre
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (04) : 1532 - 1538
  • [27] Spectral Bias in Adaptive Beamforming With Narrowband Interference
    Jeffs, Brian D.
    Warnick, Karl F.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (04) : 1373 - 1382
  • [28] Robustness of adaptive narrowband beamforming with respect to bandwidth
    Département CITI, GET/ Institut National des Télécommunications , Institut National des Télécommunications, 91011 Evry Cedex, France
    IEEE Trans Signal Process, 1600, 4 (1532-1538):
  • [29] Performance analysis for 5G beamforming heterogeneous networks
    Yi Xie
    Bo Li
    Xiaoya Zuo
    Zhongjiang Yan
    Mao Yang
    Wireless Networks, 2020, 26 : 463 - 477
  • [30] A high-performance narrowband high temperature superconducting filter
    CUI Bin
    Science Bulletin, 2010, (14) : 1367 - 1371