Sound source localization based on residual network and channel attention module

被引：4

作者：

Hu, Fucai ^{[1
]}

Song, Xiaohui ^{[1
]}

He, Ruhan ^{[2
]}

Yu, Yongsheng ^{[3
]}

机构：

[1] Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Hubei, Peoples R China

[2] Wuhan Text Univ, Sch Comp Sci & Artificial Intelligence, Wuhan 430200, Hubei, Peoples R China

[3] Wuhan Univ Technol, State Key Lab Silicate Mat Architectures, Wuhan 430070, Hubei, Peoples R China

来源：

SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期

关键词：

D O I：

10.1038/s41598-023-32657-7

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This paper presents a sound source localization (SSL) model based on residual network and channel attention mechanism. The method takes the combination of log-Mel spectrogram and generalized cross-correlation phase transform (GCC-PHAT) as the input features, and extracts the time-frequency information by using the residual structure and channel attention mechanism, thus obtaining a better localizing performance. The residual blocks are introduced to extract deeper features, which can stack more layers for high-level features and avoid gradient vanishing or exploding at the same time. The attention mechanism is taken into account for the feature extraction stage in the proposed SSL model, which can focus on the most important information on the input features. We use the signals collected by microphone array to explore the performance of the model under different features, and find the most suitable input features of the proposed method. We compare our method with other models on public dataset. Experience results show a quite substantial improvement of sound source localizing performance.

引用

页数：9

共 50 条

[21] Fault Arc Detection Based on Channel Attention Mechanism and Lightweight Residual Network
Gao, Xiang
Zhou, Gan
Zhang, Jian
Zeng, Ying
Feng, Yanjun
Liu, Yuyuan
ENERGIES, 2023, 16 (13)
[22] Facial Expression Recognition Based on Multi-Channel Attention Residual Network
Shen, Tongping
Xu, Huanqing
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 135 (01): : 539 - 560
[23] An Image Fusion Method Based on Special Residual Network and Efficient Channel Attention
Li, Yang
Yang, Haitao
Wang, Jinyu
Zhang, Changgong
Liu, Zhengjun
Chen, Hang
ELECTRONICS, 2022, 11 (19)
[24] Sound-Source Localization System Based on Neural Network for Mobile Robots
Geng, Yang
Jung, Jongdae
Seol, Donggug
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3126 - 3130
[25] Convolutional Neural Network Based Indoor Microphone Array Sound Source Localization
Chen, Jiao
Tao, Zhang
Sun Jianhong
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (08)
[26] Robust offline trained neural network for TDOA based sound source localization
Chetupalli, Srikanth Raj
Ram, Ashwin
Thippur, Sreenivas, V
2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
[27] An Illegal Image Classification System Based on Deep Residual Network and Convolutional Block Attention Module
Cai, Zengyu
Hu, Xinhua
Geng, Zhi
Zhang, Jianwei
Feng, Yuan
International Journal of Network Security, 2023, 25 (02) : 351 - 359
[28] Indoor Sound Source Localization With Probabilistic Neural Network
Sun, Yingxiang
Chen, Jiajia
Yuen, Chau
Rahardja, Susanto
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (08) : 6403 - 6413
[29] RSHAN: Image super-resolution network based on residual separation hybrid attention module
Shen, Ying
Zheng, Weihuang
Chen, Liqiong
Huang, Feng
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
[30] Dual residual attention module network for single image super resolution
Wang, Xiumei
Gu, Yanan
Gao, Xinbo
Hui, Zheng
NEUROCOMPUTING, 2019, 364 : 269 - 279

← 1 2 3 4 5 →