Dynamic segmental vector quantization in isolated-word speech recognition

被引:0
|
作者
Nhat, VDM [1 ]
Lee, S [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Engn, Yongin 449701, Gyeonggi Do, South Korea
来源
PROCEEDINGS OF THE FOURTH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY | 2004年
关键词
dynamic segmental vector quantization; segmentation scheme; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The standard Vector Quantization (VQ) approach that uses a single vector quantizer for each entire duration of the utterance of each class suffers from the following two limitations: 1) high computational cost for large codehook sizes and 2) lack of explicit characterization of the sequential behavior. Both of two these disadvantages can be remedied by treating each utterance class as a concatenation of several information sub-sources, each of which is represented by a VQ codebook. With this approach, segmentation schemes obviously need to be investigated. And we call this VQ approach Dynamic Segmental Vector Quantization (DSVQ). This paper shows how to design DSVQ with some effective segmentation schemes. Better performances could be seen when applying this approach itself or mixed with Hidden Markov Model (HMM) in isolated-word speech recognition.
引用
收藏
页码:204 / 208
页数:5
相关论文
共 50 条
  • [31] WHOLE-WORD SEGMENTAL SPEECH RECOGNITION WITH ACOUSTIC WORD EMBEDDINGS
    Shi, Bowen
    Settle, Shane
    Livescu, Karen
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 164 - 171
  • [32] Development of HMM/neural network-based medium-vocabulary isolated-word Lithuanian speech recognition system
    Filipovic, M
    Lipeika, A
    INFORMATICA, 2004, 15 (04) : 465 - 474
  • [33] Matrix quantization with vector quantization error compensation for robust speech recognition
    Cong, L
    Asghar, S
    1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 131 - 136
  • [34] Real-Time Implementation of Isolated-Word Speech Recognition System on Raspberry Pi 3 Using WAT-MFCC
    Walid, Mohamed
    Bousselmi, Souha
    Dabbabi, Karim
    Cherif, Adnen
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (03): : 42 - 49
  • [35] Trellis encoded vector quantization for robust speech recognition
    Chou, W
    Seshadri, N
    Rahim, M
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2001 - 2004
  • [36] Kernel based clustering and vector quantization for speech recognition
    Satish, DS
    Sekhar, CC
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 315 - 324
  • [37] Tree-structured vector quantization for speech recognition
    Barszcz, M
    Chen, W
    Boulianne, G
    Kenny, P
    COMPUTER SPEECH AND LANGUAGE, 2000, 14 (03): : 227 - 239
  • [38] EXPERIMENTS FOR ISOLATED-WORD RECOGNITION WITH SINGLE-LAYER AND 2-LAYER PERCEPTRONS
    KAMMERER, BR
    KUPPER, WA
    NEURAL NETWORKS, 1990, 3 (06) : 693 - 706
  • [39] Dynamic features for segmental speech recognition.
    Harte, N
    Vaseghi, SV
    Milner, B
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 933 - 936
  • [40] Development of isolated word speech recognition system
    Lipeika, A
    Lipeikiene, J
    Telksnys, L
    INFORMATICA, 2002, 13 (01) : 37 - 46