Informed source separation through spectrogram coding and data embedding

被引：41

作者：

Liutkus, Antoine ^{[1
]}

Pinel, Jonathan ^{[2
]}

Badeau, Roland ^{[1
]}

Girin, Laurent ^{[2
]}

Richard, Gael ^{[1
]}

机构：

[1] Telecom ParisTech, CNRS LTCI, Inst Telecom, F-75014 Paris, France

[2] Grenoble Inst Technol, F-38402 Grenoble, France

来源：

SIGNAL PROCESSING | 2012年 / 92卷 / 08期

关键词：

Audio source separation; Wiener filtering; Data embedding; NTF; NONNEGATIVE MATRIX FACTORIZATION; WATERMARKING-BASED METHOD; AUDIO; ALGORITHMS; MIXTURES;

D O I：

10.1016/j.sigpro.2011.09.016

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We address the issue of underdetermined source separation in a particular informed configuration where both the sources and the mixtures are known during a so-called encoding stage. This knowledge enables the computation of a side-information which is small enough to be inaudibly embedded into the mixtures. At the decoding stage, the sources are no longer assumed to be known, only the mixtures and the extracted side-information are processed for source separation. The proposed system models the sources as independent and locally stationary Gaussian processes (GP) and the mixing process as a linear filtering. This model allows reliable estimation of the sources through generalized Wiener filtering, provided their spectrograms are known. As these spectrograms are too large to be embedded in the mixtures, we show how they can be efficiently approximated using either Nonnegative Tensor Factorization (NTF) or image compression. A high-capacity embedding method is used by the system to inaudibly embed the separation side-information into the mixtures. This method is an application of the Quantization Index Modulation technique applied to the time-frequency coefficients of the mixtures and permits to reach embedding rates of about 250 kbps. Finally, a study of the performance of the full system is presented. (c) 2011 Elsevier B.V. All rights reserved.

引用

页码：1937 / 1949

页数：13

共 50 条

[1] INFORMED SOURCE SEPARATION: SOURCE CODING MEETS SOURCE SEPARATION
Ozerov, Alexey
Liutkus, Antoine
Badeau, Roland
Richard, Gael
2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 257 - 260
[2] SPATIAL CODING-BASED INFORMED SOURCE SEPARATION
Liutkus, Antoine
Ozerov, Alexey
Badeau, Roland
Richard, Gael
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2407 - 2411
[3] PERCEPTUAL CODING-BASED INFORMED SOURCE SEPARATION
Kirbiz, Serap
Ozerov, Alexey
Liutkus, Antoine
Girin, Laurent
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 959 - 963
[4] INFORMED SOURCE SEPARATION OF UNDERDETERMINED INSTANTANEOUS STEREO MIXTURES USING SOURCE INDEX EMBEDDING
Parvaix, Mathieu
Girin, Laurent
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 245 - 248
[5] SCORE INFORMED AUDIO SOURCE SEPARATION USING A PARAMETRIC MODEL OF NON-NEGATIVE SPECTROGRAM
Hennequin, Romain
David, Bertrand
Badeau, Roland
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 45 - 48
[6] Watermarking Algorithm Based on Informed Coding and Informed Embedding
Wang, Xin
Liang, Chao
Liu, Zhigang
COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION III, 2014, 443 : 566 - +
[7] Using informed coding and informed embedding to design robust fingerprinting embedding systems
Tomas-Buliart, Joan
Fernandez, Marcel
Soriano, Miguel
KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT III, PROCEEDINGS, 2007, 4694 : 992 - +
[8] KERNEL SPECTROGRAM MODELS FOR SOURCE SEPARATION
Liutkus, Antoine
Rafii, Zafar
Pardo, Bryan
Fitzgerald, Derry
Daudet, Laurent
2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 6 - 10
[9] INFORMED MONAURAL SOURCE SEPARATION OF MUSIC BASED ON CONVOLUTIONAL SPARSE CODING
Jao, Ping-Keng
Yang, Yi-Hsuan
Wohlberg, Brendt
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 236 - 240
[10] A robust watermarking scheme based on informed coding and informed embedding
Coria-Mendoza, L
Nasiopoulos, P
Ward, R
2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1117 - 1120

← 1 2 3 4 5 →