Background-Sound Controllable Voice Source Separation

被引:0
|
作者
Eom, Deokjun [1 ]
Nam, Woo Hyun [1 ]
Kim, Kyung-Rae [1 ]
机构
[1] Samsung Elect, Samsung Res, Suwon, South Korea
来源
关键词
background-sound controllable; voice source separation; speech separation; deep learning;
D O I
10.21437/Interspeech.2023-185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There have been various approaches to separate mixed voices. In the real world, input voices contain many different kinds of background sounds but existing methods have not considered the background sounds in model architectures. These approaches are difficult to control the background sounds directly and the voice separation results include the background sounds randomly. In this paper, we propose an extended voice separation framework, background-sound controllable voice source separation that can control the degrees of background sounds of voice separation outputs using a control parameter that ranges from 0 to 1 without additional mixing procedures. Several experiments show the controllability of background sounds on various real world datasets with preserving voice separation performances.
引用
收藏
页码:1698 / 1702
页数:5
相关论文
共 50 条
  • [21] Blind source separation of coexisting background in Raman spectra
    Yao, Ju
    Su, Hui
    Yao, Zhixiang
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2020, 238 (238)
  • [22] Application of independent component analysis for sound source separation
    Tan, Chin An
    Gupta, Arvind
    Li, Shaungqing
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCE AND INFORMATION IN ENGINEERING CONFERENCE, VOL 1, PTS A-C, 2008, : 299 - 305
  • [23] Compute and Memory Efficient Universal Sound Source Separation
    Tzinis, Efthymios
    Wang, Zhepei
    Jiang, Xilin
    Smaragdis, Paris
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (02): : 245 - 259
  • [24] Compute and Memory Efficient Universal Sound Source Separation
    Efthymios Tzinis
    Zhepei Wang
    Xilin Jiang
    Paris Smaragdis
    Journal of Signal Processing Systems, 2022, 94 : 245 - 259
  • [25] BINAURAL SOUND SOURCE SEPARATION MOTIVATED BY AUDITORY PROCESSING
    Kim, Chanwoo
    Kumar, Kshitiz
    Stern, Richard M.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5072 - 5075
  • [26] Sound source separation in the frequency domain with image processing
    Ninagawa, K
    Umeyama, T
    Suzuki, K
    Sugie, N
    HUMAN-COMPUTER INTERACTION - INTERACT'01, 2001, : 781 - 782
  • [27] VISUAL SOUND SOURCE SEPARATION WITH PARTIAL SUPERVISION LEARNING
    Wang, Huasen
    Gao, Lingling
    Tan, Qianchao
    Ji, Luping
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2127 - 2131
  • [28] AUDITORY FILTERBANKS BENEFIT UNIVERSAL SOUND SOURCE SEPARATION
    Li, Han
    Chen, Kean
    Seeber, Bernhard U.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 181 - 185
  • [29] SOUND SOURCE SEPARATION OF MOVING SPEAKERS FOR ROBOT AUDITION
    Nakadai, Kazuhiro
    Nakajima, Hirofumi
    Hasegawa, Yuji
    Tsujino, Hiroshi
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3685 - 3688
  • [30] Music separation method based on repeating structural model and sound source separation
    ZHANG Tian
    ZHANG Tianqi
    GE Wanying
    YU Shengqi
    Chinese Journal of Acoustics, 2020, 39 (04) : 567 - 581