Informed source separation through spectrogram coding and data embedding

被引:41
|
作者
Liutkus, Antoine [1 ]
Pinel, Jonathan [2 ]
Badeau, Roland [1 ]
Girin, Laurent [2 ]
Richard, Gael [1 ]
机构
[1] Telecom ParisTech, CNRS LTCI, Inst Telecom, F-75014 Paris, France
[2] Grenoble Inst Technol, F-38402 Grenoble, France
关键词
Audio source separation; Wiener filtering; Data embedding; NTF; NONNEGATIVE MATRIX FACTORIZATION; WATERMARKING-BASED METHOD; AUDIO; ALGORITHMS; MIXTURES;
D O I
10.1016/j.sigpro.2011.09.016
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We address the issue of underdetermined source separation in a particular informed configuration where both the sources and the mixtures are known during a so-called encoding stage. This knowledge enables the computation of a side-information which is small enough to be inaudibly embedded into the mixtures. At the decoding stage, the sources are no longer assumed to be known, only the mixtures and the extracted side-information are processed for source separation. The proposed system models the sources as independent and locally stationary Gaussian processes (GP) and the mixing process as a linear filtering. This model allows reliable estimation of the sources through generalized Wiener filtering, provided their spectrograms are known. As these spectrograms are too large to be embedded in the mixtures, we show how they can be efficiently approximated using either Nonnegative Tensor Factorization (NTF) or image compression. A high-capacity embedding method is used by the system to inaudibly embed the separation side-information into the mixtures. This method is an application of the Quantization Index Modulation technique applied to the time-frequency coefficients of the mixtures and permits to reach embedding rates of about 250 kbps. Finally, a study of the performance of the full system is presented. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1937 / 1949
页数:13
相关论文
共 50 条
  • [21] AMICABLE EXAMPLES FOR INFORMED SOURCE SEPARATION
    Takahashi, Naoya
    Mitsufuji, Yuki
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4368 - 4372
  • [22] AN OVERVIEW OF INFORMED AUDIO SOURCE SEPARATION
    Liutkus, Antoine
    Durrieu, Jean-Louis
    Daudet, Laurent
    Richard, Gael
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
  • [23] WEAKLY INFORMED AUDIO SOURCE SEPARATION
    Schulze-Forster, Kilian
    Doire, Clement
    Richard, Gael
    Badeau, Roland
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 273 - 277
  • [24] MOTION INFORMED AUDIO SOURCE SEPARATION
    Parekh, Sanjeel
    Essid, Slim
    Ozerov, Alexey
    Duong, Ngoc Q. K.
    Perez, Patrick
    Richard, Gael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6 - 10
  • [25] Online Spectrogram Inversion for Low-Latency Audio Source Separation
    Magron, Paul
    Virtanen, Tuomas
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 306 - 310
  • [26] Blind source separation by multilayer neural network classifiers for spectrogram analysis
    Shiraishi, Toshihiko
    Doura, Tomoki
    MECHANICAL ENGINEERING JOURNAL, 2019, 6 (06):
  • [27] A Study on Source Separation using Orthogonality between Independent Speeches in Spectrogram
    Jang, HyukJoon
    Jeong, Hong
    THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 338 - 341
  • [28] Music Source Separation Synthesis using Multiple Input Spectrogram Inversion
    Gunawan, David
    Sen, D.
    2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 196 - 200
  • [29] INFORMED AUDIO SOURCE SEPARATION: A COMPARATIVE STUDY
    Liutkus, Antoine
    Gorlow, Stanislaw
    Sturmel, Nicolas
    Zhang, Shuhua
    Girin, Laurent
    Badeau, Roland
    Daudet, Laurent
    Marchand, Sylvain
    Richard, Gael
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2397 - 2401
  • [30] An informed source separation of astrophysical ice analogs
    Igual, Jorge
    Llinares, Raul
    DIGITAL SIGNAL PROCESSING, 2007, 17 (05) : 947 - 964