Tree-structured vector quantization for speech recognition

被引:6
|
作者
Barszcz, M [1 ]
Chen, W [1 ]
Boulianne, G [1 ]
Kenny, P [1 ]
机构
[1] INRS Telecommun, Ile Des Soeurs, PQ H3E 1H6, Canada
来源
COMPUTER SPEECH AND LANGUAGE | 2000年 / 14卷 / 03期
关键词
Acoustic signal processing - Markov processes - Mathematical models - Speech analysis - Trees (mathematics) - Vector quantization;
D O I
10.1006/csla.2000.0143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe some new methods for constructing discrete acoustic phonetic hidden Markov models (HMMs) using tree quantizers having very large numbers (16-64 K) of leaf nodes and tree-structured smoothing techniques. We consider two criteria for constructing tree quantizers (minimum distortion and minimum entropy) and three types of smoothing (mixture smoothing, smoothing by adding 1 and Gaussian smoothing). We show that these methods are capable of achieving recognition accuracies which are generally comparable to those obtained with Gaussian mixture HMMs at a computational cost which is only marginally greater than that of conventional discrete HMMs. We present some evidence of superior performance in situations where the number of HMM distributions to be estimated is small compared with the amount of training data. We also show how our methods can accommodate feature vectors of much higher dimensionality than are traditionally used in speech recognition. (C) 2000 Academic Press.
引用
收藏
页码:227 / 239
页数:13
相关论文
共 50 条