Gain Adapted Optimum Mixture Estimation Scheme for Single Channel Speech Separation

被引:2
|
作者
Kapoor, Divneet Singh [1 ]
Kohli, Amit Kumar [2 ]
机构
[1] Chandigarh Grp Coll, Dept Elect & Commun Engn, Gharuan, Mohali, India
[2] Thapar Univ, Dept Elect & Commun Engn, Patiala 147004, Punjab, India
关键词
Single channel speech separation (SCSS); Optimum mixture estimator; Mixture-maximization (MixMax); Quadratic estimator; Gain adaptation; BLIND SOURCE SEPARATION; SEGREGATION; RECOGNITION; DRIVEN; SOUND;
D O I
10.1007/s00034-013-9566-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the proof of an Optimum mixture estimator for the single channel speech separation problem, which is a technique for separating two speech signals from a single recording of their mixture. The presented work is an attempt to solve a fundamental limitation in the current single channel speech separation techniques, in which it is assumed that the data used in the training as well as test phases of the separation model have the same energy levels. To overcome this limitation, a gain adapted Optimum mixture estimator is derived, which estimates the mixture of speech signals under the different signal-to-signal ratios (SSRs). Specifically, the speakers' gains are incorporated as unknown parameters into the separation model, and then the estimator is derived in terms of the source distributions and SSR. It is demonstrated that the use of the Optimum mixture estimator results in the lower estimation error than the non-linear mapping (log and inverse-log operations)-based Mixture-Maximization (MixMax) or Quadratic estimators. The experimental results based on the real speech data also depict that the proposed estimator improves the mixture estimation performance significantly when compared with MixMax or Quadratic estimators with the gain adaptation.
引用
收藏
页码:2335 / 2351
页数:17
相关论文
共 50 条
  • [21] Single Channel Speech Separation Based on Sinusoidal Modeling
    Wiem, Belhedi
    anouar, Ben messaoud Mohamed
    Aicha, Bouzid
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 672 - 676
  • [22] A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks
    Wang, Yannan
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1535 - 1546
  • [23] Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method
    Mahmoodzadeh, Azar
    Abutalebi, Hamid Reza
    Soltanian-Zadeh, Hamid
    Sheikhzadeh, Hamid
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [24] Impact of phase estimation on single-channel speech separation based on time-frequency masking
    Mayer, Florian
    Williamson, Donald S.
    Mowlaee, Pejman
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4668 - 4679
  • [25] Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method
    Azar Mahmoodzadeh
    Hamid Reza Abutalebi
    Hamid Soltanian-Zadeh
    Hamid Sheikhzadeh
    EURASIP Journal on Advances in Signal Processing, 2012
  • [26] Single channel speech separation with a frame-based pitch range estimation method in modulation frequency
    Mahmoodzadeh A.
    Abutalebi H.R.
    Soltanian-Zadeh H.
    Sheikhzadeh H.
    2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 609 - 613
  • [27] A Maximum Likelihood Estimation of Vocal-Tract-Related Filter Characteristics for Single Channel Speech Separation
    Radfar, Mohammad H.
    Dansereau, RichardM.
    Sayadiyan, Abolghasem
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [28] A Maximum Likelihood Estimation of Vocal-Tract-Related Filter Characteristics for Single Channel Speech Separation
    Mohammad H. Radfar
    Richard M. Dansereau
    Abolghasem Sayadiyan
    EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [29] Single-Channel Speech Separation Focusing on Attention DE
    Li, Xinshu
    Tan, Zhenhua
    Xia, Zhenche
    Wu, Danke
    Zhang, Bin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3204 - 3209
  • [30] Improved Phase Reconstruction in Single-Channel Speech Separation
    Mayer, Florian
    Mowlaee, Pejman
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1795 - 1799