SPEECH-GUIDED SOURCE SEPARATION USING A PITCH-ADAPTIVE GUIDE SIGNAL MODEL

被引:0
|
作者
Hennequin, Romain [1 ]
Burred, Juan Jose [1 ]
Maller, Simon [1 ]
Leveau, Pierre [1 ]
机构
[1] Audionamix, F-75010 Paris, France
关键词
Audio source separation; non-negative matrix factorization; informed source separation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a new method to perform underdetermined audio source separation using a spoken or sung reference signal to inform the separation process. This method explicitly models possible differences between the spoken reference and the target signal, such as pitch differences and time lag. We show that the proposed algorithm outperforms state-of-the art methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Exploiting temporal correlation in pitch-adaptive speech enhancement
    Stahl, Johannes
    Mowlaee, Pejman
    SPEECH COMMUNICATION, 2019, 111 : 1 - 13
  • [2] Assessment of pitch-adaptive front-end signal processing for children's speech recognition
    Sinha, Rohit
    Shahnawazuddin, S.
    COMPUTER SPEECH AND LANGUAGE, 2018, 48 : 103 - 121
  • [3] Kalman-filters in subbands for noise reduction with enhanced pitch-adaptive speech model estimation
    Puder, H
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 2002, 13 (02): : 139 - 148
  • [4] PGSS: Pitch-Guided Speech Separation
    Li, Xiang
    Wang, Yiwen
    Sun, Yifan
    Wu, Xihong
    Chen, Jing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13130 - 13138
  • [5] Pitch modification of speech signal using source filter model by Linear Prediction for prosodic transformations
    Faycal, Ykhlef
    Guertei, Mhania
    Bensebti, Mesaoud
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, MAIN CONFERENCE PAPERS, VOL 1, 2007, : 413 - 418
  • [6] PITCH-ADAPTIVE DPCM CODING OF SPEECH WITH 2-BIT QUANTIZATION AND FIXED SPECTRUM PREDICTION
    JAYANT, NS
    BELL SYSTEM TECHNICAL JOURNAL, 1977, 56 (03): : 439 - 454
  • [7] Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition
    Shahnawazuddin, S.
    Adiga, Nagaraj
    Kathania, Hemant K.
    Pradhan, Gaydhar
    Sinha, Rohit
    DIGITAL SIGNAL PROCESSING, 2018, 79 : 142 - 151
  • [8] Multichannel speech separation using adaptive parameterization of source PDFs
    Kokkinakis, K
    Nandi, AK
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 486 - 493
  • [9] A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION
    Bouvier, Damien
    Obin, Nicolas
    Liuni, Marco
    Roebel, Axel
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 131 - 135
  • [10] PITCH TRACKING FOR MODEL-BASED SPEECH SEPARATION
    Lee, S. W.
    Soong, Frank K.
    Ching, P. C.
    Lee, Tan
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 145 - 148