Optimized Real-Time MUSIC Algorithm With CPU-GPU Architecture

被引:2
|
作者
Huang, Qinghua [1 ]
Lu, Naida [1 ]
机构
[1] Shanghai Univ, Key Lab Specialty Fiber Opt & Opt Access Networks, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiple signal classification; Signal processing algorithms; Graphics processing units; Sensors; Sensor arrays; Computer architecture; Estimation; Direction-of-arrival (DOA) estimation; uniform planar arrays (UPA); high-resolution; real-time; CPU-GPU architecture; DOA ESTIMATION; ESPRIT;
D O I
10.1109/ACCESS.2021.3070980
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Direction-of-arrival (DOA) estimation algorithm for uniform planar arrays has been applied in many fields. The multiple signal classification (MUSIC) algorithm has obvious advantage in high-resolution signal source estimation scenarios. However, the MUSIC algorithm has high computational costs, therefore it is hard to be used in real-time scenes. Many studies are dedicated to accelerating MUSIC algorithm by parallel hardware, especially by Graphics Processing Units (GPU). MUSIC algorithm based on Central Processing Unit (CPU) -GPU architecture acceleration is rarely investigated in previous literatures, and how well MUSIC Algorithm with CPU-GPU architecture could perform remains unknown. In this paper, we present and evaluate a model of search parallel MUSIC algorithm with CPU-GPU architecture. In the proposed model, the steering vector of each candidate incident signal and the corresponding value of 2D spatial pseudo-spectrum (SPS) function are sequentially calculated in a single core of the GPU, and the subsequent calculation of each elevation or azimuth is parallel in batches. Furthermore, in order to improve the peak search speed, we propose a new Coarse and Fine Traversal (CFT) peak search algorithm via CPU and a new parallel peak search algorithm based on GPU acceleration. Across strategy comparison, utilizing CPU-GPU architecture for processing, a 150-160x performance gain is achieved compared to using CPU only. Besides, the resolution of uniform planar arrays is also analyzed.
引用
收藏
页码:54067 / 54077
页数:11
相关论文
共 50 条
  • [31] An Implementation of Block Conjugate Gradient Algorithm on CPU-GPU Processors
    Ji, Hao
    Sosonkina, Masha
    Li, Yaohang
    2014 HARDWARE-SOFTWARE CO-DESIGN FOR HIGH PERFORMANCE COMPUTING (CO-HPC), 2014, : 72 - 77
  • [32] Parallelization of Cipher Algorithm on CPU/GPU for Real-time Software-Defined Access Network
    Suzuki, Takahiro
    Kim, Sang-Yucp
    Kani, Jun-ichi
    Suzuki, Kcn-Ichi
    Otaka, Akihiro
    Hanawa, Toshihiro
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 484 - 487
  • [33] DNN Model Architecture Fingerprinting Attack on CPU-GPU Edge Devices
    Patwari, Kartik
    Hafiz, Syed Mahbub
    Wang, Han
    Homayoun, Houman
    Shafiq, Zubair
    Chuah, Chen-Nee
    2022 IEEE 7TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2022), 2022, : 337 - 355
  • [34] Hybrid CPU-GPU Computation of Adjoint Derivatives in Time Domain
    Statz, Christoph
    Muetze, Marco
    Hegler, Sebastian
    Plettemeier, Dirk
    2013 COMPUTATIONAL ELECTROMAGNETICS WORKSHOP (CEM'13), 2013, : 32 - 33
  • [35] Exploration/exploitation of a hybrid-enhanced MPSO-GA algorithm on a fused CPU-GPU architecture
    Franz, Wayne
    Thulasiraman, Parimala
    Thulasiram, Ruppa K.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (04): : 973 - 993
  • [36] Exploring Time-Predictable and High-Performance Last-Level Caches for Hard Real-Time Integrated CPU-GPU Processors
    Wang X.
    Zhang W.
    Zhang, Wei (wei.zhang@louisville.edu), 2020, Korean Institute of Information Scientists and Engineers (14) : 89 - 101
  • [37] Optimization of Parallel Algorithm for Kalman Filter on CPU-GPU Heterogeneous System
    Xu, Dandan
    Xiao, Zheng
    Li, Dapu
    Wu, Fan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 2165 - 2172
  • [38] An industrial defect detection algorithm based on CPU-GPU parallel call
    Li, Zhu
    Lin, Hong-wei
    Liu, Yuan-yuan
    Chen, Chong
    Xia, Yun-fei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 44191 - 44207
  • [39] Development of a CPU-GPU heterogeneous platform based on a nonlinear parallel algorithm
    Ma, Haifeng
    NONLINEAR ENGINEERING - MODELING AND APPLICATION, 2022, 11 (01): : 215 - 222
  • [40] An industrial defect detection algorithm based on CPU-GPU parallel call
    Zhu Li
    Hong-wei Lin
    Yuan-yuan Liu
    Chong Chen
    Yun-fei Xia
    Multimedia Tools and Applications, 2023, 82 : 44191 - 44207