Multi-task Learning for End-to-end Noise-robust Bandwidth Extension

被引:11
|
作者
Hou, Nana [1 ]
Xu, Chenglin [1 ,4 ]
Zhou, Joey Tianyi [3 ]
Chng, Eng Siong [1 ,2 ]
Li, Haizhou [4 ,5 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Nanyang Technol Univ, Temasek Labs, Singapore, Singapore
[3] ASTAR, Inst High Performance Comp IHPC, Singapore, Singapore
[4] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[5] Univ Bremen, Machine Listening Lab, Bremen, Germany
来源
基金
新加坡国家研究基金会;
关键词
Noise-robust bandwidth extension; multi-task learning; time-domain masking; temporal convolutional network; NEURAL-NETWORK; SPEECH;
D O I
10.21437/Interspeech.2020-2022
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Bandwidth extension aims to reconstruct wideband speech signals from narrowband inputs to improve perceptual quality. Prior studies mostly perform bandwidth extension under the assumption that the narrowband signals are clean without noise. The use of such extension techniques is greatly limited in practice when signals are corrupted by noise. To alleviate such problem, we propose an end-to-end time-domain framework for noise-robust bandwidth extension, that jointly optimizes a mask-based speech enhancement and an ideal bandwidth extension module with multi-task learning. The proposed framework avoids decomposing the signals into magnitude and phase spectra, therefore, requires no phase estimation. Experimental results show that the proposed method achieves 14.3% and 15.8% relative improvements over the best baseline in terms of perceptual evaluation of speech quality (PESQ) and log-spectral distortion (LSD), respectively. Furthermore, our method is 3 times more compact than the best baseline in terms of the number of parameters.
引用
收藏
页码:4069 / 4073
页数:5
相关论文
共 50 条
  • [31] Multi-task and multi-view training for end-to-end relation extraction
    Zhang, Junchi
    Zhang, Yue
    Ji, Donghong
    Liu, Mengchi
    NEUROCOMPUTING, 2019, 364 : 245 - 253
  • [32] End-to-end Multi-task Learning Framework for Spatio-Temporal Grounding in Video Corpus
    Gao, Yingqi
    Luo, Zhiling
    Chen, Shiqian
    Zhou, Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3958 - 3962
  • [33] An effective multi-task learning model for end-to-end emotion-cause pair extraction
    Chenbing Li
    Jie Hu
    Tianrui Li
    Shengdong Du
    Fei Teng
    Applied Intelligence, 2023, 53 : 3519 - 3529
  • [34] ATTENTION-AUGMENTED END-TO-END MULTI-TASK LEARNING FOR EMOTION PREDICTION FROM SPEECH
    Zhang, Zixing
    Wu, Bingwen
    Schuller, Bjoern
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6705 - 6709
  • [35] An end-to-end multi-task learning to link framework for emotion-cause pair extraction
    Song, Haolin
    Song, Dawei
    2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [36] An effective multi-task learning model for end-to-end emotion-cause pair extraction
    Li, Chenbing
    Hu, Jie
    Li, Tianrui
    Du, Shengdong
    Teng, Fei
    APPLIED INTELLIGENCE, 2023, 53 (03) : 3519 - 3529
  • [37] An Interactive Multi-Task Learning Network for End-to-End Aspect-Based Sentiment Analysis
    He, Ruidan
    Lee, Wee Sun
    Ng, Hwee Tou
    Dahlmeier, Daniel
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 504 - 515
  • [38] Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis
    Bensoltane, Rajae
    Zaki, Taher
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [39] End-to-end Delay Analysis for Coded Computing with Multi-Task Arrival
    Ji, Zhongming
    Chen, Li
    Chen, Xiaohui
    Wei, Guo
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 398 - 403
  • [40] SDAPNet: End-to-End Multi-task Simultaneous Detection and Prediction Network
    Ye, Shanding
    Yao, Han
    Wang, Wenfu
    Fu, Yongjian
    Pan, Zhijie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,