A new independent component analysis for speech recognitionand separation

被引:35
|
作者
Chien, Jen-Tzung [1 ]
Chen, Bo-Cheng [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
acoustic modeling; blind source separation (BSS); independent component analysis (ICA); nonparametric likelihood ratio (NLR); pronunciation variation; speech recognition; unsupervised learning;
D O I
10.1109/TSA.2005.858061
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel nonparametric likelihood ratio (NLR) objective function for independent component analysis (ICA). This function is derived through the statistical hypothesis test of independence of random observations. A likelihood ratio function is developed to measure the confidence toward independence. We accordingly estimate the demixing matrix by maximizing the likelihood ratio function and apply it to transform data into independent component space. Conventionally, the. test of independence was established assuming data distributions being Gaussian, which is improper to realize ICA. To avoid assuming Gaussianity in hypothesis testing, we propose a nonparametric approach where the distributions of random variables are calculated using kernel density functions. A new ICA is then fulfilled through the NLR objective function. Interestingly, we apply the proposed NLR-ICA algorithm for unsupervised learning of unknown pronunciation variations. The clusters of speech hidden Markov models are estimated to characterize multiple pronunciations of subword units for robust speech recognition. Also, the NiLR-ICA is applied to separate the linear mixture of speech and audio signals. In the experiments, NLR-ICA achieves better speech recognition performance compared to parametric and nonparametric minimum mutual information ICA.
引用
收藏
页码:1245 / 1254
页数:10
相关论文
共 50 条
  • [41] Separation of Electromagnetic Sources by the Method of Independent Component Analysis
    Leite Ferreira, Paulo Ixtanio
    Fontgalland, Glauco
    Aragao, Galba F.
    Barbin, Silvio E.
    2013 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2013, : 476 - 479
  • [42] Signal separation method using independent component analysis
    Yoshioka, M
    Omatu, S
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 753 - 756
  • [43] Separation of DOAS measurements data by independent component analysis
    Kim, HH
    Han, SH
    Bae, HD
    ADVANCES IN NONDESTRUCTIVE EVALUATION, PT 1-3, 2004, 270-273 : 703 - 708
  • [44] Independent component analysis based on machining error separation
    Zhang F.-P.
    Wu D.
    Zhang T.-G.
    Zhang L.-Y.
    Yang J.-B.
    Binggong Xuebao/Acta Armamentarii, 2016, 37 (09): : 1692 - 1699
  • [45] Independent component analysis for artefact separation in astrophysical images
    Funaro, M
    Oja, E
    Valpola, H
    NEURAL NETWORKS, 2003, 16 (3-4) : 469 - 478
  • [46] An Approach to Solving a Permutation Problem of Frequency Domain Independent Component Analysis for Blind Source Separation of Speech Signals
    Fujieda, Masaru
    Murakami, Takahiro
    Ishida, Yoshihisa
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 18, 2006, 18 : 64 - 68
  • [47] A new perspective on independent component analysis
    Zhao, H.
    Grigoriu, M.
    PROBABILISTIC ENGINEERING MECHANICS, 2015, 42 : 64 - 70
  • [48] INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT
    COMON, P
    SIGNAL PROCESSING, 1994, 36 (03) : 287 - 314
  • [49] Non-linear independent component analysis for speech recognition
    Omar, MK
    Hasegawa-Johnson, M
    CCCT 2003, VOL6, PROCEEDINGS: COMPUTER, COMMUNICATION AND CONTROL TECHNOLOGIES: III, 2003, : 204 - 209
  • [50] Independent component analysis based single channel speech enhancement
    Hong, L
    Rosca, J
    Balan, R
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 522 - 525