Self-Supervised Learning and Multi-Task Pre-Training Based Single-Channel Acoustic Denoising

被引:1
|
作者
Li, Yi [1 ]
Sun, Yang [2 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Grp, Newcastle Upon Tyne NE1 7RU, England
[2] Univ Oxford, Big Data Inst, Oxford OX3 7LF, England
关键词
MONAURAL SOURCE SEPARATION; SPEECH; ENVIRONMENTS;
D O I
10.1109/MFI55806.2022.9913855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In self-supervised learning-based single-channel speech denoising problem, it is challenging to reduce the gap between the denoising performance on the estimated and target speech signals with existed pre-tasks. In this paper, we propose a multi-task pre-training method to improve the speech denoising performance within self-supervised learning. In the proposed pre-training autoencoder (PAE), only a very limited set of unpaired and unseen clean speech signals are required to learn speech latent representations. Meanwhile, to solve the limitation of existing single pre-task, the proposed masking module exploits the dereverberated mask and estimated ratio mask to denoise the mixture as the new pre-task. The downstream task autoencoder (DAE) utilizes unlabeled and unseen reverberant mixtures to generate the estimated mixtures. The DAE is trained to share a latent representation with the clean examples from the learned representation in the PAE. Experimental results on a benchmark dataset demonstrate that the proposed method outperforms the state-of-the-art approaches.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Self-Supervised Pre-Training for Deep Image Prior-Based Robust PET Image Denoising
    Onishi, Yuya
    Hashimoto, Fumio
    Ote, Kibo
    Matsubara, Keisuke
    Ibaraki, Masanobu
    IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES, 2024, 8 (04) : 348 - 356
  • [32] Self-supervised Pre-training and Semi-supervised Learning for Extractive Dialog Summarization
    Zhuang, Yingying
    Song, Jiecheng
    Sadagopan, Narayanan
    Beniwal, Anurag
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1069 - 1076
  • [33] Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
    Zhang, Bowen
    Cao, Songjun
    Zhang, Xiaoming
    Zhang, Yike
    Ma, Long
    Shinozaki, Takahiro
    INTERSPEECH 2022, 2022, : 2653 - 2657
  • [34] CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text
    Nandy, Abhilash
    Kapadnis, Manav Nitin
    Goyal, Pawan
    Ganguly, Niloy
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8793 - 8806
  • [35] Multi-task self-supervised learning based fusion representation for Multi-view clustering
    Guo, Tianlong
    Shen, Derong
    Kou, Yue
    Nie, Tiezheng
    INFORMATION SCIENCES, 2025, 694
  • [36] Self-Supervised Pre-training for Time Series Classification
    Shi, Pengxiang
    Ye, Wenwen
    Qin, Zheng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [37] Self-supervised Multi-task Representation Learning for Sequential Medical Images
    Dong, Nanqing
    Kampffmeyer, Michael
    Voiculescu, Irina
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 779 - 794
  • [38] Learning Representations for Bipartite Graphs Using Multi-task Self-supervised Learning
    Sethi, Akshay
    Gupta, Sonia
    Malhotra, Aakarsh
    Asthana, Siddhartha
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 19 - 35
  • [39] MULTI-TASK VOICE ACTIVATED FRAMEWORK USING SELF-SUPERVISED LEARNING
    Hussain, Shehzeen
    Van Nguyen
    Zhang, Shuhua
    Visser, Erik
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6137 - 6141
  • [40] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
    Georgescu, Mariana-Iuliana
    Barbalau, Antonio
    Ionescu, Radu Tudor
    Khan, Fahad Shahbaz
    Popescu, Marius
    Shah, Mubarak
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12737 - 12747