Robust DOA Estimation Using Multi-Scale Fusion Network with Attention Mask

被引:1
|
作者
Yan, Yuting [1 ]
Huang, Qinghua [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 11期
关键词
complex-valued neural network; direction-of-arrival; reverberant; multi-scale; attention; SPHERICAL MICROPHONE ARRAY; OF-ARRIVAL ESTIMATION; NEURAL-NETWORK; ACOUSTIC ANALYSIS; DIRECTION; LOCALIZATION; ALGORITHM; FRAMEWORK;
D O I
10.3390/app14114488
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
To overcome the limitations of traditional methods in reverberant and noisy environments, a robust multi-scale fusion neural network with attention mask is designed to improve direction-of-arrival (DOA) estimation accuracy for acoustic sources. It combines the benefits of deep learning and complex-valued operations to effectively deal with the interference of reverberation and noise in speech signals. The unique properties of complex-valued signals are exploited to fully capture inherent features and rich information is preserved in the complex field. An attention mask module is designed to generate distinct masks for selectively focusing and masking based on the input. After that, the multi-scale fusion block efficiently captures multi-scale spatial features by stacking complex-valued convolutional layers with small size kernels, and reduces the module complexity through special branching operations. Experimental results demonstrate that the model achieves significant improvements over other methods for speaker localization in reverberant and noisy environments. It provides a new solution for DOA estimation for acoustic sources in different scenarios, which has significant theoretical and practical implications.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] MsRAN: a multi-scale residual attention network for multi-model image fusion
    Wang, Jing
    Yu, Long
    Tian, Shengwei
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (12) : 3615 - 3634
  • [22] MsRAN: a multi-scale residual attention network for multi-model image fusion
    Jing Wang
    Long Yu
    Shengwei Tian
    Medical & Biological Engineering & Computing, 2022, 60 : 3615 - 3634
  • [23] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [24] Multi-scale feature fusion based DOA and range estimation for near-field sources
    Liu, Ke
    Fu, Yanyan
    Ma, Junda
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [25] MSGAFN: Multi-scale graph attention fusion network for machine fault diagnosis
    Ni, Peihao
    Zhang, Yuanyuan
    Xiong, Xiaoyun
    Wang, Jinlong
    Ji, Aiguo
    Dong, Liangcheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2024, 238 (15) : 7894 - 7907
  • [26] MSFFA: a multi-scale feature fusion and attention mechanism network for crowd counting
    Zhaoxin Li
    Shuhua Lu
    Yishan Dong
    Jingyuan Guo
    The Visual Computer, 2023, 39 : 1045 - 1056
  • [27] Multi-scale recurrent attention gated fusion network for single image dehazing
    Zhang, Xiangfen
    Yang, Shuo
    Zhang, Qingyi
    Yuan, Feiniu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 101
  • [28] Multi-Scale Feature Fusion Attention Network for Infrared Small Target Detection
    Zhang, Yidan
    Li, Chunlei
    Liu, Yundong
    Liu, Zhoufeng
    Yang, Ruimin
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [29] A Multi-Scale Progressive Collaborative Attention Network for Remote Sensing Fusion Classification
    Ma, Wenping
    Li, Yating
    Zhu, Hao
    Ma, Haoxiang
    Jiao, Licheng
    Shen, Jianchao
    Hou, Biao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 3897 - 3911
  • [30] Fusion of Geometric Attention and Multi-Scale Feature Network for Point Cloud Registration
    Du, Jiajin
    Bai, Zhengyao
    Liu, Xuheng
    Li, Zekai
    Xiao, Xiao
    You, Yilin
    Computer Engineering and Applications, 2024, 60 (12) : 234 - 244