Speech Enhancement Performance Based on the MANNER Network Using Feature Fusion

被引:0
|
作者
Wang, Shijie [1 ]
Li, Ji [2 ]
Shao, Lei [2 ]
Liu, Hongli [2 ]
Zhu, Lihua [2 ]
Zhu, Xiaochen [1 ]
机构
[1] Tianjin Univ Technol, Sch Elect Engn & Automat, Tianjin 300384, Peoples R China
[2] Tianjin Key Lab New Energy Power Convers Transmiss, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
speech enhancement; feature fusion; attention mechanisms; U-Net; MANNER;
D O I
10.3390/electronics12081768
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problems that the multi-view attention network for noise erasure (MANNER) cannot take into account are the local and global features in the speech enhancement of long sequences. An attention and feature fusion MANNER (AF-MANNER) network is proposed, which improves the multi-view attention (MA) module in MANNER and replaces the global and local attention in the module. AF-MANNER also designs the feature-weighted fusion module to fuse the features of flash attention and neighborhood attention to enhance the feature expression of the network. The final ablation studies show that this network exhibits a good performance in speech enhancement and that its structure is valuable for improving the intelligibility and perceptual quality of speech.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Underwater Image Enhancement Method Based on Dynamic Heterogeneous Feature Fusion Neural Network
    Zhou, Xuejun
    Zhang, Jiachen
    Zhou, Fangyuan
    IEEE ACCESS, 2022, 10 : 91816 - 91827
  • [22] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [23] Research on Speech Emotional Feature Extraction Based on Multidimensional Feature Fusion
    Zheng, Chunjun
    Wang, Chunli
    Sun, Wei
    Jia, Ning
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 535 - 547
  • [24] A Visual Enhancement Network with Feature Fusion for Image Aesthetic Assessment
    Zhang, Xin
    Jiang, Xinyu
    Song, Qing
    Zhang, Pengzhou
    ELECTRONICS, 2023, 12 (11)
  • [25] Enhancement-fusion feature pyramid network for object detection
    Dong, Shifeng
    Wang, Rujing
    Du, Jianming
    Jiao, Lin
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [26] Performance Enhancement of Target Recognition Using Feature Vector Fusion of Monostatic and Bistatic Radar
    Lee, Seung-Jae
    Choi, In-Sik
    Cho, Byung-Lae
    Rothwell, Edward J.
    Temme, Andrew K.
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2014, 144 : 291 - 302
  • [27] Speech Emotion Recognition based on Multiple Feature Fusion
    Jiang, Changjiang
    Mao, Rong
    Liu, Geng
    Wang, Mingyi
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 907 - 912
  • [28] Pedestrian Detection based on Deep Fusion Network using Feature Correlation
    Lee, Yongwoo
    Bui, Toan Duc
    Shin, Jitae
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 694 - 699
  • [29] MFFR-net: Multi-scale feature fusion and attentive recalibration network for deep neural speech enhancement
    Saleem, Nasir
    Bourouis, Sami
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [30] FESSD:SSD target detection based on feature fusion and feature enhancement
    Qian, Huaming
    Wang, Huilin
    Feng, Shuai
    Yan, Shuya
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)