PFRNet: Dual-Branch Progressive Fusion Rectification Network for Monaural Speech Enhancement

被引:12
|
作者
Yu, Runxiang [1 ,2 ]
Zhao, Ziwei [1 ,2 ]
Ye, Zhongfu [1 ,2 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Anhui, Peoples R China
[2] Natl Engn Res Ctr Speech & Language Informat Proc, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Speech enhancement; Tensors; Convolution; Decoding; Time-frequency analysis; Fusion rectification block; interactive time-frequency improved transformer; monaural speech enhancement;
D O I
10.1109/LSP.2022.3222045
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, the transformer-based dual-branch magnitude and complex spectrum estimation framework achieves state-of-the-art performance for monaural speech enhancement. However, the insufficient utilization of the interactive information in the middle layers makes each branch lack the ability of compensation and rectification. To address this problem, this letter proposes a novel dual-branch progressive fusion rectification network (PFRNet) for monaural speech enhancement. PFRNet is an encoder-decoder-based dual-branch structure with interactive improved real & complex transformers. In PFRNet, the fusion rectification block is proposed to convert the implicit relationship of the two branches into a fusion feature by the frequency-domain mutual attention mechanism. The fusion feature provides a platform for the interaction in the middle layers. The interactive time-frequency improved real & complex transformer can make better use of the long-term dependencies in the time-frequency domain. Experimental results show that the proposed PFRNet outperforms most advanced dual-branch speech enhancement approaches and previous advanced systems in terms of speech quality and intelligibility.
引用
收藏
页码:2358 / 2362
页数:5
相关论文
共 50 条
  • [31] CFIFusion: Dual-Branch Complementary Feature Injection Network for Medical Image Fusion
    Xie, Yiyuan
    Yu, Lei
    Ding, Cheng
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (04)
  • [32] FUSION OF HYPERSPECTRAL AND LIDAR DATA BASED ON DUAL-BRANCH CONVOLUTIONAL NEURAL NETWORK
    Wang, Jinzhe
    Zhang, Junping
    Guo, Qingle
    Li, Tong
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3388 - 3391
  • [33] Unsupervised learning based dual-branch fusion low-light image enhancement
    Guang Han
    Yu Zhou
    Fanyu Zeng
    Multimedia Tools and Applications, 2023, 82 : 37593 - 37614
  • [34] MDAN: Multilevel dual-branch attention network for infrared and visible image fusion
    Wang, Jiawei
    Jiang, Min
    Kong, Jun
    OPTICS AND LASERS IN ENGINEERING, 2024, 176
  • [35] Dual-branch feature fusion dehazing network via multispectral channel attention
    Jian, Huachun
    Zhang, Yongjun
    Gao, Weihao
    Wang, Bufan
    Wang, Guomei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2655 - 2671
  • [36] A Vascular Feature Detection and Matching Method Based on Dual-Branch Fusion and Structure Enhancement
    Xu, Kaiyang
    Wu, Haibin
    Iwahori, Yuji
    Yu, Xiaoyu
    Hu, Zeyu
    Wang, Aili
    SENSORS, 2024, 24 (06)
  • [37] A Dual-branch Convolutional Network Architecture Processing on both Frequency and Time Domain for Single-channel Speech Enhancement
    Zhang, Kanghao
    He, Shulin
    Li, Hao
    Zhang, Xueliang
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (03)
  • [38] Dual-Branch Remote Sensing Building Extraction Network Based on Texture Enhancement
    Chen Xu
    Shi Mingchang
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (14)
  • [39] Unsupervised learning based dual-branch fusion low-light image enhancement
    Han, Guang
    Zhou, Yu
    Zeng, Fanyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (24) : 37593 - 37614
  • [40] Dual-Branch Fusion of Convolutional Neural Network and Graph Convolutional Network for PolSAR Image Classification
    Radman, Ali
    Mahdianpari, Masoud
    Brisco, Brian
    Salehi, Bahram
    Mohammadimanesh, Fariba
    REMOTE SENSING, 2023, 15 (01)