Zero-shot test time adaptation via knowledge distillation for personalized speech denoising and dereverberation

被引:2
|
作者
Kim, Sunwoo [1 ]
Athi, Mrudula [1 ]
Shi, Guangji [1 ]
Kim, Minje [1 ,2 ]
Kristjansson, Trausti [1 ]
机构
[1] Amazon Lab126, Sunnyvale, CA 94089 USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
来源
基金
美国国家科学基金会;
关键词
DOMAIN ADAPTATION; ENHANCEMENT; NOISE;
D O I
10.1121/10.0024621
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A personalization framework to adapt compact models to test time environments and improve their speech enhancement (SE) performance in noisy and reverberant conditions is proposed. The use-cases are when the end-user device encounters only one or a few speakers and noise types that tend to reoccur in the specific acoustic environment. Hence, a small personalized model that is sufficient to handle this focused subset of the original universal SE problem is postulated. The study addresses a major data shortage issue: although the goal is to learn from a specific user's speech signals and the test time environment, the target clean speech is unavailable for model training due to privacy-related concerns and technical difficulty of recording noise and reverberation-free voice signals. The proposed zero-shot personalization method uses no clean speech target. Instead, it employs the knowledge distillation framework, where the more advanced denoising results from an overly large teacher work as pseudo targets to train a small student model. Evaluation on various test time conditions suggests that the proposed personalization approach can significantly enhance the compact student model's test time performance. Personalized models outperform larger non-personalized baseline models, demonstrating that personalization achieves model compression with no loss in dereverberation and denoising performance.
引用
收藏
页码:1353 / 1367
页数:15
相关论文
共 50 条
  • [41] Zero-Shot Low-Field MRI Enhancement via Denoising Diffusion Driven Neural Representation
    Lin, Xiyue
    Du, Chenhe
    Wu, Qing
    Tian, Xuanyu
    Yu, Jingyi
    Zhang, Yuyao
    Wei, Hongjiang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 775 - 785
  • [42] Zero-shot Transfer Learning within a Heterogeneous Graph via Knowledge Transfer Networks
    Yoon, Minji
    Palowitch, John
    Zelle, Dustin
    Hu, Ziniu
    Salakhutdinov, Ruslan
    Perozzi, Bryan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [43] Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data
    Chen, Qi
    Wang, Wei
    Huang, Kaizhu
    Coenen, Frans
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12): : 9205 - 9213
  • [44] Improving Zero-Shot Phrase Grounding via Reasoning on External Knowledge and Spatial Relations
    Shi, Zhan
    Shen, Yilin
    Jin, Hongxia
    Zhu, Xiaodan
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2253 - 2261
  • [45] Relevant Entity Selection: Knowledge Graph Bootstrapping via Zero-Shot Analogical Pruning
    Jarnac, Lucas
    Couceiro, Miguel
    Monnin, Pierre
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 934 - 944
  • [46] Explainable zero-shot learning via attentive graph convolutional network and knowledge graphs
    Geng, Yuxia
    Chen, Jiaoyan
    Ye, Zhiquan
    Yuan, Zonggang
    Zhang, Wei
    Chen, Huajun
    SEMANTIC WEB, 2021, 12 (05) : 741 - 765
  • [47] DreamMotion: Space-Time Self-similar Score Distillation for Zero-Shot Video Editing
    Jeong, Hyeonho
    Chang, Jinho
    Park, Geon Yeong
    Ye, Jong Chul
    COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 358 - 376
  • [48] A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection
    Pamungkas, Endang Wahyu
    Basile, Valerio
    Patti, Viviana
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [49] Sketch3T: Test-Time Training for Zero-Shot SBIR
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Potlapalli, Vaishnav
    Chowdhury, Pinaki Nath
    Xiang, Tao
    Song, Yi-Zhe
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7452 - 7461
  • [50] PRIVACY-ENHANCED ZERO-SHOT LEARNING VIA DATA-FREE KNOWLEDGE TRANSFER
    Gao, Rui
    Wan, Fan
    Organisciak, Daniel
    Pu, Jiyao
    Duan, Haoran
    Zhang, Peng
    Hou, Xingsong
    Long, Yang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 432 - 437