Gain Adapted Optimum Mixture Estimation Scheme for Single Channel Speech Separation

被引:2
|
作者
Kapoor, Divneet Singh [1 ]
Kohli, Amit Kumar [2 ]
机构
[1] Chandigarh Grp Coll, Dept Elect & Commun Engn, Gharuan, Mohali, India
[2] Thapar Univ, Dept Elect & Commun Engn, Patiala 147004, Punjab, India
关键词
Single channel speech separation (SCSS); Optimum mixture estimator; Mixture-maximization (MixMax); Quadratic estimator; Gain adaptation; BLIND SOURCE SEPARATION; SEGREGATION; RECOGNITION; DRIVEN; SOUND;
D O I
10.1007/s00034-013-9566-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the proof of an Optimum mixture estimator for the single channel speech separation problem, which is a technique for separating two speech signals from a single recording of their mixture. The presented work is an attempt to solve a fundamental limitation in the current single channel speech separation techniques, in which it is assumed that the data used in the training as well as test phases of the separation model have the same energy levels. To overcome this limitation, a gain adapted Optimum mixture estimator is derived, which estimates the mixture of speech signals under the different signal-to-signal ratios (SSRs). Specifically, the speakers' gains are incorporated as unknown parameters into the separation model, and then the estimator is derived in terms of the source distributions and SSR. It is demonstrated that the use of the Optimum mixture estimator results in the lower estimation error than the non-linear mapping (log and inverse-log operations)-based Mixture-Maximization (MixMax) or Quadratic estimators. The experimental results based on the real speech data also depict that the proposed estimator improves the mixture estimation performance significantly when compared with MixMax or Quadratic estimators with the gain adaptation.
引用
收藏
页码:2335 / 2351
页数:17
相关论文
共 50 条
  • [41] INVESTIGATION OF A PARAMETRIC GAIN APPROACH TO SINGLE-CHANNEL SPEECH ENHANCEMENT
    Huang, Gongping
    Chen, Jingdong
    Benesty, Jacob
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 206 - 210
  • [42] Speech separation based on Gaussian mixture model probability density function estimation
    Yu, Xiao
    Hu, Guangrui
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (02): : 177 - 180
  • [43] Effect of speech priors in single-channel speech-music separation for ASR
    Demir, Cemil
    Cemgil, A. Taylan
    Saraclar, Murat
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1234 - 1237
  • [44] INFLUENCE OF INTERNAL ENERGY SAVING ON SELECTION OF OPTIMUM SCHEME OF HEATING FOR MIXTURE SEPARATION IN FRACTIONATING COLUMN
    Zakharov, M. K.
    Boichuk, A. A.
    CHEMICAL AND PETROLEUM ENGINEERING, 2019, 54 (11-12) : 901 - 909
  • [45] Influence of Internal Energy Saving on Selection of Optimum Scheme of Heating for Mixture Separation in Fractionating Column
    M. K. Zakharov
    A. A. Boichuk
    Chemical and Petroleum Engineering, 2019, 54 : 901 - 909
  • [46] Iterative blind separation of single channel speech separation based on GMM and Bayesian theory
    Institute of Signal Processing and Transmission, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
    Nanjing Youdian Daxue Xuebao (Ziran Kexue Ban)/Journal of Nanjing University of Posts and Telecommunications (Natural Science), 2008, 28 (06): : 1 - 5
  • [47] A single-channel mixture signal separation and simulation based on ISBF
    Meng, Qingjin
    Cheng, Xiefeng
    Tao, Yewei
    Xing, Baoling
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 1316 - 1320
  • [48] Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge
    Mowlaee, P.
    Saeidi, R.
    Tan, Z. -H.
    Christensen, M. G.
    Kinnunen, T.
    Franti, P.
    Jensen, S. H.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 684 - +
  • [49] Performance Comparison of HMM and VQ Based Single Channel Speech Separation
    Radfar, M. H.
    Chan, W-Y
    Dansereau, R. M.
    Wong, W.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1927 - 1930
  • [50] Deep Clustering in Complex Domain for Single-Channel Speech Separation
    Liu, Runling
    Tang, Yu
    Mang, Hongwei
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1463 - 1468