Informed source separation through spectrogram coding and data embedding

被引:41
|
作者
Liutkus, Antoine [1 ]
Pinel, Jonathan [2 ]
Badeau, Roland [1 ]
Girin, Laurent [2 ]
Richard, Gael [1 ]
机构
[1] Telecom ParisTech, CNRS LTCI, Inst Telecom, F-75014 Paris, France
[2] Grenoble Inst Technol, F-38402 Grenoble, France
关键词
Audio source separation; Wiener filtering; Data embedding; NTF; NONNEGATIVE MATRIX FACTORIZATION; WATERMARKING-BASED METHOD; AUDIO; ALGORITHMS; MIXTURES;
D O I
10.1016/j.sigpro.2011.09.016
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We address the issue of underdetermined source separation in a particular informed configuration where both the sources and the mixtures are known during a so-called encoding stage. This knowledge enables the computation of a side-information which is small enough to be inaudibly embedded into the mixtures. At the decoding stage, the sources are no longer assumed to be known, only the mixtures and the extracted side-information are processed for source separation. The proposed system models the sources as independent and locally stationary Gaussian processes (GP) and the mixing process as a linear filtering. This model allows reliable estimation of the sources through generalized Wiener filtering, provided their spectrograms are known. As these spectrograms are too large to be embedded in the mixtures, we show how they can be efficiently approximated using either Nonnegative Tensor Factorization (NTF) or image compression. A high-capacity embedding method is used by the system to inaudibly embed the separation side-information into the mixtures. This method is an application of the Quantization Index Modulation technique applied to the time-frequency coefficients of the mixtures and permits to reach embedding rates of about 250 kbps. Finally, a study of the performance of the full system is presented. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1937 / 1949
页数:13
相关论文
共 50 条
  • [1] INFORMED SOURCE SEPARATION: SOURCE CODING MEETS SOURCE SEPARATION
    Ozerov, Alexey
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 257 - 260
  • [2] SPATIAL CODING-BASED INFORMED SOURCE SEPARATION
    Liutkus, Antoine
    Ozerov, Alexey
    Badeau, Roland
    Richard, Gael
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2407 - 2411
  • [3] PERCEPTUAL CODING-BASED INFORMED SOURCE SEPARATION
    Kirbiz, Serap
    Ozerov, Alexey
    Liutkus, Antoine
    Girin, Laurent
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 959 - 963
  • [4] INFORMED SOURCE SEPARATION OF UNDERDETERMINED INSTANTANEOUS STEREO MIXTURES USING SOURCE INDEX EMBEDDING
    Parvaix, Mathieu
    Girin, Laurent
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 245 - 248
  • [5] SCORE INFORMED AUDIO SOURCE SEPARATION USING A PARAMETRIC MODEL OF NON-NEGATIVE SPECTROGRAM
    Hennequin, Romain
    David, Bertrand
    Badeau, Roland
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 45 - 48
  • [6] Watermarking Algorithm Based on Informed Coding and Informed Embedding
    Wang, Xin
    Liang, Chao
    Liu, Zhigang
    COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION III, 2014, 443 : 566 - +
  • [7] Using informed coding and informed embedding to design robust fingerprinting embedding systems
    Tomas-Buliart, Joan
    Fernandez, Marcel
    Soriano, Miguel
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT III, PROCEEDINGS, 2007, 4694 : 992 - +
  • [8] KERNEL SPECTROGRAM MODELS FOR SOURCE SEPARATION
    Liutkus, Antoine
    Rafii, Zafar
    Pardo, Bryan
    Fitzgerald, Derry
    Daudet, Laurent
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 6 - 10
  • [9] INFORMED MONAURAL SOURCE SEPARATION OF MUSIC BASED ON CONVOLUTIONAL SPARSE CODING
    Jao, Ping-Keng
    Yang, Yi-Hsuan
    Wohlberg, Brendt
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 236 - 240
  • [10] A robust watermarking scheme based on informed coding and informed embedding
    Coria-Mendoza, L
    Nasiopoulos, P
    Ward, R
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1117 - 1120