Tree-structured vector quantization for speech recognition

被引:6
|
作者
Barszcz, M [1 ]
Chen, W [1 ]
Boulianne, G [1 ]
Kenny, P [1 ]
机构
[1] INRS Telecommun, Ile Des Soeurs, PQ H3E 1H6, Canada
来源
COMPUTER SPEECH AND LANGUAGE | 2000年 / 14卷 / 03期
关键词
Acoustic signal processing - Markov processes - Mathematical models - Speech analysis - Trees (mathematics) - Vector quantization;
D O I
10.1006/csla.2000.0143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe some new methods for constructing discrete acoustic phonetic hidden Markov models (HMMs) using tree quantizers having very large numbers (16-64 K) of leaf nodes and tree-structured smoothing techniques. We consider two criteria for constructing tree quantizers (minimum distortion and minimum entropy) and three types of smoothing (mixture smoothing, smoothing by adding 1 and Gaussian smoothing). We show that these methods are capable of achieving recognition accuracies which are generally comparable to those obtained with Gaussian mixture HMMs at a computational cost which is only marginally greater than that of conventional discrete HMMs. We present some evidence of superior performance in situations where the number of HMM distributions to be estimated is small compared with the amount of training data. We also show how our methods can accommodate feature vectors of much higher dimensionality than are traditionally used in speech recognition. (C) 2000 Academic Press.
引用
收藏
页码:227 / 239
页数:13
相关论文
共 50 条
  • [41] ADAPTIVE ENTROPY-CODED PRUNED TREE-STRUCTURED PREDICTIVE VECTOR QUANTIZATION OF IMAGES
    KIM, YH
    MODESTINO, JW
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1993, 41 (01) : 171 - 185
  • [42] Embedded quantization of line spectral frequencies using a multistage tree-structured vector quantizer
    Chu, Wai C.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1205 - 1217
  • [43] Bit rate on demand using pruned tree-structured hierarchical lookup vector quantization
    Mukherjee, K
    Mukherjee, A
    Acharya, T
    DCC '99 - DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1999, : 42 - 51
  • [44] IMAGE SEQUENCE CODING USING ADAPTIVE TREE-STRUCTURED VECTOR QUANTIZATION WITH MULTIPATH SEARCHING
    CHANG, RF
    CHEN, WT
    WANG, JS
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1992, 139 (01): : 9 - 14
  • [45] Image classification using adaptive-boosting and tree-structured discriminant vector quantization
    Ozonat, KM
    Gray, RM
    DCC 2004: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2004, : 556 - 556
  • [46] LOW-COMPLEXITY ENCODING OF SPEECH LSF PARAMETERS USING MULTISTAGE TREE-STRUCTURED VECTOR QUANTIZATION: APPLICATION TO THE MELP CODER
    Djamah, M.
    O'Shaughnessy, D.
    2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 1011 - 1015
  • [47] Speaker-independent speech recognition based on tree-structured speaker clustering
    Kosaka, T
    Matsunaga, S
    Sagayama, S
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
  • [48] Medical image compression using tree-structured vector quantization and fuzzy C-means
    Supot, S
    Yuttana, K
    Manas, S
    INTERNATIONAL JOURNAL OF NONLINEAR SCIENCES AND NUMERICAL SIMULATION, 2002, 3 (3-4) : 243 - 247
  • [49] DESIGN AND PERFORMANCE OF TREE-STRUCTURED VECTOR QUANTIZERS
    LIN, JH
    STORER, JA
    INFORMATION PROCESSING & MANAGEMENT, 1994, 30 (06) : 851 - 862
  • [50] Index compressed tree-structured vector quantisation
    Shanbehzadeh, J
    Ogunbona, PO
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 1999, 14 (03) : 229 - 243