SPEECH-GUIDED SOURCE SEPARATION USING A PITCH-ADAPTIVE GUIDE SIGNAL MODEL

被引：0

作者：

Hennequin, Romain ^{[1
]}

Burred, Juan Jose ^{[1
]}

Maller, Simon ^{[1
]}

Leveau, Pierre ^{[1
]}

机构：

[1] Audionamix, F-75010 Paris, France

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Audio source separation; non-negative matrix factorization; informed source separation;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present a new method to perform underdetermined audio source separation using a spoken or sung reference signal to inform the separation process. This method explicitly models possible differences between the spoken reference and the target signal, such as pitch differences and time lag. We show that the proposed algorithm outperforms state-of-the art methods.

引用

页数：5

共 50 条

[1] Exploiting temporal correlation in pitch-adaptive speech enhancement
Stahl, Johannes
Mowlaee, Pejman
SPEECH COMMUNICATION, 2019, 111 : 1 - 13
[2] Assessment of pitch-adaptive front-end signal processing for children's speech recognition
Sinha, Rohit
Shahnawazuddin, S.
COMPUTER SPEECH AND LANGUAGE, 2018, 48 : 103 - 121
[3] Kalman-filters in subbands for noise reduction with enhanced pitch-adaptive speech model estimation
Puder, H
EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 2002, 13 (02): : 139 - 148
[4] PGSS: Pitch-Guided Speech Separation
Li, Xiang
Wang, Yiwen
Sun, Yifan
Wu, Xihong
Chen, Jing
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13130 - 13138
[5] Pitch modification of speech signal using source filter model by Linear Prediction for prosodic transformations
Faycal, Ykhlef
Guertei, Mhania
Bensebti, Mesaoud
PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, MAIN CONFERENCE PAPERS, VOL 1, 2007, : 413 - 418
[6] PITCH-ADAPTIVE DPCM CODING OF SPEECH WITH 2-BIT QUANTIZATION AND FIXED SPECTRUM PREDICTION
JAYANT, NS
BELL SYSTEM TECHNICAL JOURNAL, 1977, 56 (03): : 439 - 454
[7] Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition
Shahnawazuddin, S.
Adiga, Nagaraj
Kathania, Hemant K.
Pradhan, Gaydhar
Sinha, Rohit
DIGITAL SIGNAL PROCESSING, 2018, 79 : 142 - 151
[8] Multichannel speech separation using adaptive parameterization of source PDFs
Kokkinakis, K
Nandi, AK
INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 486 - 493
[9] A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION
Bouvier, Damien
Obin, Nicolas
Liuni, Marco
Roebel, Axel
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 131 - 135
[10] PITCH TRACKING FOR MODEL-BASED SPEECH SEPARATION
Lee, S. W.
Soong, Frank K.
Ching, P. C.
Lee, Tan
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 145 - 148

← 1 2 3 4 5 →