Robust speech recognition with on-line unsupervised acoustic feature compensation

被引:0
|
作者
Buera, Luis [1 ]
Miguel, Antonio [1 ]
Lleida, Eduardo [1 ]
Saz, Oscar [1 ]
Ortega, Alfonso [1 ]
机构
[1] Univ Zaragoza, GTC, E-50009 Zaragoza, Spain
关键词
robust speech recognition; feature vector normalization; acoustic model adaptation;
D O I
10.1109/ASRU.2007.4430092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An on-line unsupervised hybrid compensation technique is proposed to reduce the mismatch between training and testing conditions. It combines Multi-Environment Model based LInear Normalization with cross-probability model based on GMMs (MEMLIN CPM) with a novel acoustic model adaptation method based on rotation transformations. Hence, a set of rotation transformations is estimated with clean and MEMLIN CPM-normalized training data by linear regression in an unsupervised process. Thus, in testing, each MEMLIN CPM normalized frame is decoded using a modified Viterbi algorithm and expanded acoustic models, which are obtained from the reference ones and the set of rotation transformations. To test the proposed solution, some experiments with Spanish SpeechDat Car database were carried out. MEMLIN CPM over standard ETSI front-end parameters reaches 83.89% of average improvement in WER, while the introduced hybrid solution goes up to 92.07%. Also, the proposed hybrid technique was tested with Aurora 2 database, obtaining an average improvement of 68.88% with clean training.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [41] A new on-line robust approach to design noise-immune speech recognition systems
    Vargas, F
    Fagundes, RDR
    Barros, D
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2003, 19 (01): : 61 - 72
  • [42] A New On-Line Robust Approach to Design Noise-Immune Speech Recognition Systems
    Fabian Vargas
    Rubem D.R. Fagundes
    Daniel Barros
    Journal of Electronic Testing, 2003, 19 : 61 - 72
  • [43] Geometrical feature extraction for robust speech recognition
    Li, Xiaokun
    Kwan, Chiman
    2005 39TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2005, : 558 - 562
  • [44] Feature Adaptation for Robust Mobile Speech Recognition
    Lee, Hyeopwoo
    Yook, Dongsuk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398
  • [45] Noise Robust Recognition of Depression Status and Treatment Response from Speech via Unsupervised Feature Aggregation
    Gerczuk, Maurice
    Amiriparian, Shahin
    Kathan, Alexander
    Bauer, Jonathan
    Berking, Matthias
    Schuller, Bjoern W.
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [46] Temporal structure normalization of speech feature for robust speech recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 500 - 503
  • [47] Feature-dependent compensation of coders in speech recognition
    Yoma, NB
    Molina, C
    SIGNAL PROCESSING, 2006, 86 (01) : 38 - 49
  • [48] An on-line adaptive neural network for speech recognition
    Zhang L.-P.
    Li L.M.
    Chi Z.
    International Journal of Speech Technology, 1998, 2 (3) : 241 - 248
  • [49] On-Line Noise Suppression for Enhancing Speech Recognition
    Urbanowicz, K.
    Fronczak, P.
    Holyst, J. A.
    ACTA PHYSICA POLONICA A, 2009, 116 (02) : 119 - 126
  • [50] On-Line Sketch Recognition Using Direction Feature
    Deng, Wei
    Wu, Lingda
    Yu, Ronghuan
    Lai, Jiazhe
    HUMAN-COMPUTER INTERACTION - INTERACT 2013, PT III, 2013, 8119 : 259 - 266