Background-Sound Controllable Voice Source Separation

被引：0

作者：

Eom, Deokjun ^{[1
]}

Nam, Woo Hyun ^{[1
]}

Kim, Kyung-Rae ^{[1
]}

机构：

[1] Samsung Elect, Samsung Res, Suwon, South Korea

来源：

INTERSPEECH 2023 | 2023年

关键词：

background-sound controllable; voice source separation; speech separation; deep learning;

D O I：

10.21437/Interspeech.2023-185

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

There have been various approaches to separate mixed voices. In the real world, input voices contain many different kinds of background sounds but existing methods have not considered the background sounds in model architectures. These approaches are difficult to control the background sounds directly and the voice separation results include the background sounds randomly. In this paper, we propose an extended voice separation framework, background-sound controllable voice source separation that can control the degrees of background sounds of voice separation outputs using a control parameter that ranges from 0 to 1 without additional mixing procedures. Several experiments show the controllability of background sounds on various real world datasets with preserving voice separation performances.

引用

页码：1698 / 1702

页数：5

共 50 条

[41] Sound source separation based on time reversal technique in room
Zeng, Jinfang
Xu, Lintao
Zeng, Yicheng
Bai, Bing
PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON SENSORS, MECHATRONICS AND AUTOMATION (ICSMA 2016), 2016, 136 : 346 - 350
[42] Monophonic sound source separation with an unsupervised network of spiking neurones
Pichevar, Ramin
Rouat, Jean
NEUROCOMPUTING, 2007, 71 (1-3) : 109 - 120
[43] Note-based sound source separation of polyphonic recordings
Aczel, Kristof
Vajk, Istvan
INFOCOMMUNICATIONS JOURNAL, 2009, 1 (01): : 36 - 40
[44] A Nonwhitening Post Filter to Improve the Performance of Sound Source Separation
Widyotriatmo, Augie
Tjokronegoro, Harijono A.
Hong, Keum-Shik
2008 6TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1-3, 2008, : 566 - +
[45] Sound Source Separation and Automatic Speech Recognition for Moving Sources
Nakadai, Kazuhiro
Nakajima, Hirofumi
Ince, Goekhan
Hasegawa, Yuji
IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 976 - 981
[46] Source Separation of the Second Heart Sound via Alternating Optimization
Renna, Francesco
Plumbley, Mark D.
Coimbra, Miguel
2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
[47] Spatial Manipulation of Musical Sound: Informed Source Separation and Respatialization
Marchand, Sylvain
COMPUTATIONAL PHONOGRAM ARCHIVING, 2019, 5 : 175 - 190
[48] Robust Estimation of Sound Source Direction with Deterministic Background Noise and Stochastic Source Dynamics Models
Mizumachi, Mitsunori
Niyada, Katsuyuki
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2010, 14 (02) : 208 - 213
[49] A Light Source System of Multi-star Simulator with Adjustable Background and Controllable Magnitude
SUN Gaofei
LIU Shi
ZHANG Guoyu
ZHANG Yu
LEI Jie
MA Yiyuan
空间科学学报, 2017, (06) : 760 - 765
[50] Spherical-harmonics-based sound field decomposition and multichannel NMF for sound source separation
Pezzoli, Mirco
Carabias-Orti, Julio
Vera-Candeas, Pedro
Antonacci, Fabio
Sarti, Augusto
APPLIED ACOUSTICS, 2024, 218

← 1 2 3 4 5 →