Background-Sound Controllable Voice Source Separation

被引:0
|
作者
Eom, Deokjun [1 ]
Nam, Woo Hyun [1 ]
Kim, Kyung-Rae [1 ]
机构
[1] Samsung Elect, Samsung Res, Suwon, South Korea
来源
关键词
background-sound controllable; voice source separation; speech separation; deep learning;
D O I
10.21437/Interspeech.2023-185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There have been various approaches to separate mixed voices. In the real world, input voices contain many different kinds of background sounds but existing methods have not considered the background sounds in model architectures. These approaches are difficult to control the background sounds directly and the voice separation results include the background sounds randomly. In this paper, we propose an extended voice separation framework, background-sound controllable voice source separation that can control the degrees of background sounds of voice separation outputs using a control parameter that ranges from 0 to 1 without additional mixing procedures. Several experiments show the controllability of background sounds on various real world datasets with preserving voice separation performances.
引用
收藏
页码:1698 / 1702
页数:5
相关论文
共 50 条
  • [41] Sound source separation based on time reversal technique in room
    Zeng, Jinfang
    Xu, Lintao
    Zeng, Yicheng
    Bai, Bing
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON SENSORS, MECHATRONICS AND AUTOMATION (ICSMA 2016), 2016, 136 : 346 - 350
  • [42] Monophonic sound source separation with an unsupervised network of spiking neurones
    Pichevar, Ramin
    Rouat, Jean
    NEUROCOMPUTING, 2007, 71 (1-3) : 109 - 120
  • [43] Note-based sound source separation of polyphonic recordings
    Aczel, Kristof
    Vajk, Istvan
    INFOCOMMUNICATIONS JOURNAL, 2009, 1 (01): : 36 - 40
  • [44] A Nonwhitening Post Filter to Improve the Performance of Sound Source Separation
    Widyotriatmo, Augie
    Tjokronegoro, Harijono A.
    Hong, Keum-Shik
    2008 6TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1-3, 2008, : 566 - +
  • [45] Sound Source Separation and Automatic Speech Recognition for Moving Sources
    Nakadai, Kazuhiro
    Nakajima, Hirofumi
    Ince, Goekhan
    Hasegawa, Yuji
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 976 - 981
  • [46] Source Separation of the Second Heart Sound via Alternating Optimization
    Renna, Francesco
    Plumbley, Mark D.
    Coimbra, Miguel
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [47] Spatial Manipulation of Musical Sound: Informed Source Separation and Respatialization
    Marchand, Sylvain
    COMPUTATIONAL PHONOGRAM ARCHIVING, 2019, 5 : 175 - 190
  • [48] Robust Estimation of Sound Source Direction with Deterministic Background Noise and Stochastic Source Dynamics Models
    Mizumachi, Mitsunori
    Niyada, Katsuyuki
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2010, 14 (02) : 208 - 213
  • [49] A Light Source System of Multi-star Simulator with Adjustable Background and Controllable Magnitude
    SUN Gaofei
    LIU Shi
    ZHANG Guoyu
    ZHANG Yu
    LEI Jie
    MA Yiyuan
    空间科学学报, 2017, (06) : 760 - 765
  • [50] Spherical-harmonics-based sound field decomposition and multichannel NMF for sound source separation
    Pezzoli, Mirco
    Carabias-Orti, Julio
    Vera-Candeas, Pedro
    Antonacci, Fabio
    Sarti, Augusto
    APPLIED ACOUSTICS, 2024, 218