Robust DOA Estimation Using Multi-Scale Fusion Network with Attention Mask

被引:1
|
作者
Yan, Yuting [1 ]
Huang, Qinghua [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 11期
关键词
complex-valued neural network; direction-of-arrival; reverberant; multi-scale; attention; SPHERICAL MICROPHONE ARRAY; OF-ARRIVAL ESTIMATION; NEURAL-NETWORK; ACOUSTIC ANALYSIS; DIRECTION; LOCALIZATION; ALGORITHM; FRAMEWORK;
D O I
10.3390/app14114488
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
To overcome the limitations of traditional methods in reverberant and noisy environments, a robust multi-scale fusion neural network with attention mask is designed to improve direction-of-arrival (DOA) estimation accuracy for acoustic sources. It combines the benefits of deep learning and complex-valued operations to effectively deal with the interference of reverberation and noise in speech signals. The unique properties of complex-valued signals are exploited to fully capture inherent features and rich information is preserved in the complex field. An attention mask module is designed to generate distinct masks for selectively focusing and masking based on the input. After that, the multi-scale fusion block efficiently captures multi-scale spatial features by stacking complex-valued convolutional layers with small size kernels, and reduces the module complexity through special branching operations. Experimental results demonstrate that the model achieves significant improvements over other methods for speaker localization in reverberant and noisy environments. It provides a new solution for DOA estimation for acoustic sources in different scenarios, which has significant theoretical and practical implications.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Enhancing EEG and sEMG Fusion Decoding Using a Multi-Scale Parallel Convolutional Network With Attention Mechanism
    Tang, Xianlun
    Qi, Yidan
    Zhang, Jing
    Liu, Ke
    Tian, Yin
    Gao, Xinbo
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 212 - 222
  • [42] Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation
    Chen, Qingjun
    Zheng, Hua
    Pan, Hao
    Liao, Xiaoqi
    Wang, Hongkai
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [43] MLANet: multi-level attention network with multi-scale feature fusion for crowd counting
    Xiong, Liyan
    Zeng, Yijuan
    Huang, Xiaohui
    Li, Zhida
    Huang, Peng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 6591 - 6608
  • [44] MAPoseNet: Animal pose estimation network via multi-scale convolutional attention
    Liu, Sicong
    Fan, Qingcheng
    Li, Shuqin
    Zhao, Chunjiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [45] Monocular Image Depth Estimation Based on Multi-Scale Attention Oriented Network
    Liu J.
    Wen J.
    Liang Y.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (12): : 52 - 62
  • [46] Single image deraining using multi-stage and multi-scale joint channel coordinate attention fusion network
    Yang, Yitong
    Zhang, Yongjun
    Cui, Zhongwei
    Li, Zhi
    Xu, Yujie
    Zhao, Haoliang
    Ou, Yangtin
    Yang, Heliang
    Wang, Xihe
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9750 - 9773
  • [47] Multi-scale Refocusing Attention Siamese Network
    Liu, Guoqiang
    Chen, Zhe
    Shen, Guangze
    2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 42 - 46
  • [48] Multi-Scale Attention Network for Image Cropping
    Lian, Tianpei
    Xian, Ke
    Pan, Zhiyu
    Hong, Chaoyi
    Cao, Zhiguo
    Zhong, Weicai
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2640 - 2645
  • [49] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [50] SSD with multi-scale feature fusion and attention mechanism
    Liu, Qiang
    Dong, Lijun
    Zeng, Zhigao
    Zhu, Wenqiu
    Zhu, Yanhui
    Meng, Chen
    SCIENTIFIC REPORTS, 2023, 13 (01):