Optimized Real-Time MUSIC Algorithm With CPU-GPU Architecture

被引：2

作者：

Huang, Qinghua ^{[1
]}

Lu, Naida ^{[1
]}

机构：

[1] Shanghai Univ, Key Lab Specialty Fiber Opt & Opt Access Networks, Shanghai 200444, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金;

关键词：

Multiple signal classification; Signal processing algorithms; Graphics processing units; Sensors; Sensor arrays; Computer architecture; Estimation; Direction-of-arrival (DOA) estimation; uniform planar arrays (UPA); high-resolution; real-time; CPU-GPU architecture; DOA ESTIMATION; ESPRIT;

D O I：

10.1109/ACCESS.2021.3070980

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Direction-of-arrival (DOA) estimation algorithm for uniform planar arrays has been applied in many fields. The multiple signal classification (MUSIC) algorithm has obvious advantage in high-resolution signal source estimation scenarios. However, the MUSIC algorithm has high computational costs, therefore it is hard to be used in real-time scenes. Many studies are dedicated to accelerating MUSIC algorithm by parallel hardware, especially by Graphics Processing Units (GPU). MUSIC algorithm based on Central Processing Unit (CPU) -GPU architecture acceleration is rarely investigated in previous literatures, and how well MUSIC Algorithm with CPU-GPU architecture could perform remains unknown. In this paper, we present and evaluate a model of search parallel MUSIC algorithm with CPU-GPU architecture. In the proposed model, the steering vector of each candidate incident signal and the corresponding value of 2D spatial pseudo-spectrum (SPS) function are sequentially calculated in a single core of the GPU, and the subsequent calculation of each elevation or azimuth is parallel in batches. Furthermore, in order to improve the peak search speed, we propose a new Coarse and Fine Traversal (CFT) peak search algorithm via CPU and a new parallel peak search algorithm based on GPU acceleration. Across strategy comparison, utilizing CPU-GPU architecture for processing, a 150-160x performance gain is achieved compared to using CPU only. Besides, the resolution of uniform planar arrays is also analyzed.

引用

页码：54067 / 54077

页数：11

共 50 条

[31] An Implementation of Block Conjugate Gradient Algorithm on CPU-GPU Processors
Ji, Hao
Sosonkina, Masha
Li, Yaohang
2014 HARDWARE-SOFTWARE CO-DESIGN FOR HIGH PERFORMANCE COMPUTING (CO-HPC), 2014, : 72 - 77
[32] Parallelization of Cipher Algorithm on CPU/GPU for Real-time Software-Defined Access Network
Suzuki, Takahiro
Kim, Sang-Yucp
Kani, Jun-ichi
Suzuki, Kcn-Ichi
Otaka, Akihiro
Hanawa, Toshihiro
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 484 - 487
[33] DNN Model Architecture Fingerprinting Attack on CPU-GPU Edge Devices
Patwari, Kartik
Hafiz, Syed Mahbub
Wang, Han
Homayoun, Houman
Shafiq, Zubair
Chuah, Chen-Nee
2022 IEEE 7TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2022), 2022, : 337 - 355
[34] Hybrid CPU-GPU Computation of Adjoint Derivatives in Time Domain
Statz, Christoph
Muetze, Marco
Hegler, Sebastian
Plettemeier, Dirk
2013 COMPUTATIONAL ELECTROMAGNETICS WORKSHOP (CEM'13), 2013, : 32 - 33
[35] Exploration/exploitation of a hybrid-enhanced MPSO-GA algorithm on a fused CPU-GPU architecture
Franz, Wayne
Thulasiraman, Parimala
Thulasiram, Ruppa K.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (04): : 973 - 993
[36] Exploring Time-Predictable and High-Performance Last-Level Caches for Hard Real-Time Integrated CPU-GPU Processors
Wang X.
Zhang W.
Zhang, Wei (wei.zhang@louisville.edu), 2020, Korean Institute of Information Scientists and Engineers (14) : 89 - 101
[37] Optimization of Parallel Algorithm for Kalman Filter on CPU-GPU Heterogeneous System
Xu, Dandan
Xiao, Zheng
Li, Dapu
Wu, Fan
2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 2165 - 2172
[38] An industrial defect detection algorithm based on CPU-GPU parallel call
Li, Zhu
Lin, Hong-wei
Liu, Yuan-yuan
Chen, Chong
Xia, Yun-fei
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 44191 - 44207
[39] Development of a CPU-GPU heterogeneous platform based on a nonlinear parallel algorithm
Ma, Haifeng
NONLINEAR ENGINEERING - MODELING AND APPLICATION, 2022, 11 (01): : 215 - 222
[40] An industrial defect detection algorithm based on CPU-GPU parallel call
Zhu Li
Hong-wei Lin
Yuan-yuan Liu
Chong Chen
Yun-fei Xia
Multimedia Tools and Applications, 2023, 82 : 44191 - 44207

← 1 2 3 4 5 →