Sound source localization based on residual network and channel attention module

被引:4
|
作者
Hu, Fucai [1 ]
Song, Xiaohui [1 ]
He, Ruhan [2 ]
Yu, Yongsheng [3 ]
机构
[1] Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Hubei, Peoples R China
[2] Wuhan Text Univ, Sch Comp Sci & Artificial Intelligence, Wuhan 430200, Hubei, Peoples R China
[3] Wuhan Univ Technol, State Key Lab Silicate Mat Architectures, Wuhan 430070, Hubei, Peoples R China
关键词
D O I
10.1038/s41598-023-32657-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a sound source localization (SSL) model based on residual network and channel attention mechanism. The method takes the combination of log-Mel spectrogram and generalized cross-correlation phase transform (GCC-PHAT) as the input features, and extracts the time-frequency information by using the residual structure and channel attention mechanism, thus obtaining a better localizing performance. The residual blocks are introduced to extract deeper features, which can stack more layers for high-level features and avoid gradient vanishing or exploding at the same time. The attention mechanism is taken into account for the feature extraction stage in the proposed SSL model, which can focus on the most important information on the input features. We use the signals collected by microphone array to explore the performance of the model under different features, and find the most suitable input features of the proposed method. We compare our method with other models on public dataset. Experience results show a quite substantial improvement of sound source localizing performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Fault Arc Detection Based on Channel Attention Mechanism and Lightweight Residual Network
    Gao, Xiang
    Zhou, Gan
    Zhang, Jian
    Zeng, Ying
    Feng, Yanjun
    Liu, Yuyuan
    ENERGIES, 2023, 16 (13)
  • [22] Facial Expression Recognition Based on Multi-Channel Attention Residual Network
    Shen, Tongping
    Xu, Huanqing
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 135 (01): : 539 - 560
  • [23] An Image Fusion Method Based on Special Residual Network and Efficient Channel Attention
    Li, Yang
    Yang, Haitao
    Wang, Jinyu
    Zhang, Changgong
    Liu, Zhengjun
    Chen, Hang
    ELECTRONICS, 2022, 11 (19)
  • [24] Sound-Source Localization System Based on Neural Network for Mobile Robots
    Geng, Yang
    Jung, Jongdae
    Seol, Donggug
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3126 - 3130
  • [25] Convolutional Neural Network Based Indoor Microphone Array Sound Source Localization
    Chen, Jiao
    Tao, Zhang
    Sun Jianhong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (08)
  • [26] Robust offline trained neural network for TDOA based sound source localization
    Chetupalli, Srikanth Raj
    Ram, Ashwin
    Thippur, Sreenivas, V
    2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
  • [27] An Illegal Image Classification System Based on Deep Residual Network and Convolutional Block Attention Module
    Cai, Zengyu
    Hu, Xinhua
    Geng, Zhi
    Zhang, Jianwei
    Feng, Yuan
    International Journal of Network Security, 2023, 25 (02) : 351 - 359
  • [28] Indoor Sound Source Localization With Probabilistic Neural Network
    Sun, Yingxiang
    Chen, Jiajia
    Yuen, Chau
    Rahardja, Susanto
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (08) : 6403 - 6413
  • [29] RSHAN: Image super-resolution network based on residual separation hybrid attention module
    Shen, Ying
    Zheng, Weihuang
    Chen, Liqiong
    Huang, Feng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [30] Dual residual attention module network for single image super resolution
    Wang, Xiumei
    Gu, Yanan
    Gao, Xinbo
    Hui, Zheng
    NEUROCOMPUTING, 2019, 364 : 269 - 279