Optimized Real-Time MUSIC Algorithm With CPU-GPU Architecture

被引:2
|
作者
Huang, Qinghua [1 ]
Lu, Naida [1 ]
机构
[1] Shanghai Univ, Key Lab Specialty Fiber Opt & Opt Access Networks, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiple signal classification; Signal processing algorithms; Graphics processing units; Sensors; Sensor arrays; Computer architecture; Estimation; Direction-of-arrival (DOA) estimation; uniform planar arrays (UPA); high-resolution; real-time; CPU-GPU architecture; DOA ESTIMATION; ESPRIT;
D O I
10.1109/ACCESS.2021.3070980
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Direction-of-arrival (DOA) estimation algorithm for uniform planar arrays has been applied in many fields. The multiple signal classification (MUSIC) algorithm has obvious advantage in high-resolution signal source estimation scenarios. However, the MUSIC algorithm has high computational costs, therefore it is hard to be used in real-time scenes. Many studies are dedicated to accelerating MUSIC algorithm by parallel hardware, especially by Graphics Processing Units (GPU). MUSIC algorithm based on Central Processing Unit (CPU) -GPU architecture acceleration is rarely investigated in previous literatures, and how well MUSIC Algorithm with CPU-GPU architecture could perform remains unknown. In this paper, we present and evaluate a model of search parallel MUSIC algorithm with CPU-GPU architecture. In the proposed model, the steering vector of each candidate incident signal and the corresponding value of 2D spatial pseudo-spectrum (SPS) function are sequentially calculated in a single core of the GPU, and the subsequent calculation of each elevation or azimuth is parallel in batches. Furthermore, in order to improve the peak search speed, we propose a new Coarse and Fine Traversal (CFT) peak search algorithm via CPU and a new parallel peak search algorithm based on GPU acceleration. Across strategy comparison, utilizing CPU-GPU architecture for processing, a 150-160x performance gain is achieved compared to using CPU only. Besides, the resolution of uniform planar arrays is also analyzed.
引用
收藏
页码:54067 / 54077
页数:11
相关论文
共 50 条
  • [41] Parabolic Radon transform parallel algorithm for CPU-GPU heterogeneous platform
    Zhang Q.
    Lin B.
    Yang B.
    Peng B.
    Zhang W.
    Tu R.
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (06): : 1263 - 1270
  • [42] An improved smith-waterman algorithm on heterogeneous CPU-GPU Systems
    Yin, Meng Jia
    Xu, Xianbin
    Xiong, Zenggang
    Zhang, Tao
    Zheng, Fang
    International Journal of Applied Mathematics and Statistics, 2013, 50 (20): : 499 - 507
  • [43] Parallel Implementation of Sieving Algorithm on Heterogeneous CPU-GPU Computing Architectures
    Wu, Mengsi
    Li, Pei
    Chen, Jiageng
    Yao, Shixiong
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2024, 2025, 15053 : 258 - 272
  • [44] A Peta-scalable CPU-GPU Algorithm for Global Atmospheric Simulations
    Yang, Chao
    Xue, Wei
    Fu, Haohuan
    Gan, Lin
    Li, Linfeng
    Xu, Yangtong
    Lu, Yutong
    Sun, Jiachang
    Yang, Guangwen
    Zheng, Weimin
    ACM SIGPLAN NOTICES, 2013, 48 (08) : 1 - 11
  • [45] Architecture for Fast Object Detection Supporting CPU-GPU Hybrid and Distributed Computing
    Bae, Yuseok
    Park, Jongyoul
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2017,
  • [46] Column-Stored System Join Optimization on Coupled CPU-GPU Architecture
    Ding, Xiangwu
    Li, Zitong
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 184 - 191
  • [47] Using the CPU and GPU for Real-Time Video Enhancement on a Mobile Computer
    Bachoo, Asheer
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 405 - 408
  • [48] Real-Time ROS Extension on Transparent CPU/GPU Coordination Mechanism
    Suzuki, Yuhei
    Azumi, Takuya
    Kato, Shinpei
    Nishio, Nobuhiko
    2018 IEEE 21ST INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2018), 2018, : 184 - 192
  • [49] Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture
    He, Jiong
    Lu, Mian
    He, Bingsheng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (10): : 889 - 900
  • [50] Performance Optimization by Dynamically Altering Cache Replacement Algorithm in CPU-GPU Heterogeneous Multi-Core Architecture
    Fang, Juan
    Fan, Qingwen
    Hao, Xiaoting
    Cheng, Yanjin
    Sun, Lijun
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 723 - +