Sound source localization based on residual network and channel attention module

被引:4
|
作者
Hu, Fucai [1 ]
Song, Xiaohui [1 ]
He, Ruhan [2 ]
Yu, Yongsheng [3 ]
机构
[1] Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Hubei, Peoples R China
[2] Wuhan Text Univ, Sch Comp Sci & Artificial Intelligence, Wuhan 430200, Hubei, Peoples R China
[3] Wuhan Univ Technol, State Key Lab Silicate Mat Architectures, Wuhan 430070, Hubei, Peoples R China
关键词
D O I
10.1038/s41598-023-32657-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a sound source localization (SSL) model based on residual network and channel attention mechanism. The method takes the combination of log-Mel spectrogram and generalized cross-correlation phase transform (GCC-PHAT) as the input features, and extracts the time-frequency information by using the residual structure and channel attention mechanism, thus obtaining a better localizing performance. The residual blocks are introduced to extract deeper features, which can stack more layers for high-level features and avoid gradient vanishing or exploding at the same time. The attention mechanism is taken into account for the feature extraction stage in the proposed SSL model, which can focus on the most important information on the input features. We use the signals collected by microphone array to explore the performance of the model under different features, and find the most suitable input features of the proposed method. We compare our method with other models on public dataset. Experience results show a quite substantial improvement of sound source localizing performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Sound source localization based on residual network and channel attention module
    Fucai Hu
    Xiaohui Song
    Ruhan He
    Yongsheng Yu
    Scientific Reports, 13
  • [2] A generalized network based on multi-scale densely connection and residual attention for sound source localization and detection
    Hu, Ying
    Sun, Xinghao
    He, Liang
    Huang, Hao
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (03): : 1754 - 1768
  • [3] Sound source localization and detection based on densely connected network and attention mechanism
    Zhou, Bomao
    Tang, Jin
    APPLIED ACOUSTICS, 2025, 228
  • [4] IFAN: An Icosahedral Feature Attention Network for Sound Source Localization
    Zhu, Xin-Cheng
    Zhang, Hong
    Feng, Hui-Tao
    Zhao, Deng-Huang
    Zhang, Xiao-Jun
    Tao, Zhi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [5] HEART SOUND CLASSIFICATION USING RESIDUAL NEURAL NETWORK AND CONVOLUTION BLOCK ATTENTION MODULE
    Frimpong, Enoch Adjei
    Qin Zhiguang
    Kwadwo, Tenagyei Edwin
    Rutherford, Patamia Agbeshi
    Baagyere, Edward Y.
    Turkson, Regina Esi
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [6] Attention mechanism combined with residual recurrent neural network for sound event detection and localization
    Chaofeng Lan
    Lei Zhang
    Yuanyuan Zhang
    Lirong Fu
    Chao Sun
    Yulan Han
    Meng Zhang
    EURASIP Journal on Audio, Speech, and Music Processing, 2022
  • [7] Attention mechanism combined with residual recurrent neural network for sound event detection and localization
    Lan, Chaofeng
    Zhang, Lei
    Zhang, Yuanyuan
    Fu, Lirong
    Sun, Chao
    Han, Yulan
    Zhang, Meng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
  • [8] Face Recognition Based on Improved Residual Network and Channel Attention
    Zeng, Jingfang
    Li, Jieyu
    Feng, Linlang
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (05) : 383 - 392
  • [9] Face Recognition Based on Improved Residual Network and Channel Attention
    Jieyu Jingfang Zeng
    Linlang Li
    Automatic Control and Computer Sciences, 2022, 56 : 383 - 392
  • [10] Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization
    Zhou, Yuting
    Wan, Hongjie
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)