Background-Sound Controllable Voice Source Separation

被引:0
|
作者
Eom, Deokjun [1 ]
Nam, Woo Hyun [1 ]
Kim, Kyung-Rae [1 ]
机构
[1] Samsung Elect, Samsung Res, Suwon, South Korea
来源
关键词
background-sound controllable; voice source separation; speech separation; deep learning;
D O I
10.21437/Interspeech.2023-185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There have been various approaches to separate mixed voices. In the real world, input voices contain many different kinds of background sounds but existing methods have not considered the background sounds in model architectures. These approaches are difficult to control the background sounds directly and the voice separation results include the background sounds randomly. In this paper, we propose an extended voice separation framework, background-sound controllable voice source separation that can control the degrees of background sounds of voice separation outputs using a control parameter that ranges from 0 to 1 without additional mixing procedures. Several experiments show the controllability of background sounds on various real world datasets with preserving voice separation performances.
引用
收藏
页码:1698 / 1702
页数:5
相关论文
共 50 条
  • [31] Music separation method based on repeating structural model and sound source separation
    Zhang, Tian
    Zhang, Tianqi
    Ge, Wanying
    Yu, Shengqi
    Shengxue Xuebao/Acta Acustica, 2020, 45 (05): : 707 - 715
  • [32] Noise source separation of diesel engine by combining binaural sound localization method and blind source separation method
    Yao, Jiachi
    Xiang, Yang
    Qian, Sichong
    Li, Shengyang
    Wu, Shaowei
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2017, 96 : 303 - 320
  • [33] Source music, background music, fantasy and reality in early sound film
    Neumeyer, D
    COLLEGE MUSIC SYMPOSIUM, 1997, 37 : 13 - 20
  • [34] Background- Source separation-Not only for astronomical images
    Guglielmetti, F.
    Fischer, R.
    Dose, V.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2013, 1553 : 30 - 37
  • [35] SOUND SOURCE SEPARATION USING SPECTROGRAM ANALYSIS BY NEURAL NETWORKS
    Doura, Tomoki
    Shiraishi, Toshihiko
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2017 VOL 13, 2018,
  • [36] Sound Source Separation for Robot Audition using Deep Learning
    Noda, Kuniaki
    Hashimoto, Naoya
    Nakadai, Kazuhiro
    Ogata, Tetsuya
    2015 IEEE-RAS 15TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2015, : 389 - 394
  • [37] VARIATIONAL EM FOR BINAURAL SOUND-SOURCE SEPARATION AND LOCALIZATION
    Deleforge, Antoine
    Forbes, Florence
    Horaud, Radu
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 76 - 80
  • [38] MONAURAL SOUND SOURCE SEPARATION USING COVARIANCE PROFILE OF PARTIALS
    Goel, Priyank
    Ramakrishnan, K. R.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2452 - 2456
  • [39] LEAST-SQUARES METHOD FOR SOURCE SOUND SEPARATION.
    Nakamichi, Kohji
    Nomura, Yasuo
    Yanagida, Masuzo
    Kakusho, Osamu
    Technology Reports of Kansai University, 1985, (26): : 51 - 59
  • [40] Assessment of a Beamforming Implementation Developed for Surface Sound Source Separation
    Zhong, Zhi
    Shakeel, Muhammad
    Itoyama, Katsutoshi
    Nishida, Kenji
    Nakadai, Kazuhiro
    2021 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2021, : 369 - 374