Dynamic segmental vector quantization in isolated-word speech recognition

被引：0

作者：

Nhat, VDM ^{[1
]}

Lee, S ^{[1
]}

机构：

[1] Kyung Hee Univ, Dept Comp Engn, Yongin 449701, Gyeonggi Do, South Korea

来源：

PROCEEDINGS OF THE FOURTH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY | 2004年

关键词：

dynamic segmental vector quantization; segmentation scheme; speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The standard Vector Quantization (VQ) approach that uses a single vector quantizer for each entire duration of the utterance of each class suffers from the following two limitations: 1) high computational cost for large codehook sizes and 2) lack of explicit characterization of the sequential behavior. Both of two these disadvantages can be remedied by treating each utterance class as a concatenation of several information sub-sources, each of which is represented by a VQ codebook. With this approach, segmentation schemes obviously need to be investigated. And we call this VQ approach Dynamic Segmental Vector Quantization (DSVQ). This paper shows how to design DSVQ with some effective segmentation schemes. Better performances could be seen when applying this approach itself or mixed with Hidden Markov Model (HMM) in isolated-word speech recognition.

引用

页码：204 / 208

页数：5

共 50 条

[31] WHOLE-WORD SEGMENTAL SPEECH RECOGNITION WITH ACOUSTIC WORD EMBEDDINGS
Shi, Bowen
Settle, Shane
Livescu, Karen
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 164 - 171
[32] Development of HMM/neural network-based medium-vocabulary isolated-word Lithuanian speech recognition system
Filipovic, M
Lipeika, A
INFORMATICA, 2004, 15 (04) : 465 - 474
[33] Matrix quantization with vector quantization error compensation for robust speech recognition
Cong, L
Asghar, S
1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 131 - 136
[34] Real-Time Implementation of Isolated-Word Speech Recognition System on Raspberry Pi 3 Using WAT-MFCC
Walid, Mohamed
Bousselmi, Souha
Dabbabi, Karim
Cherif, Adnen
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (03): : 42 - 49
[35] Trellis encoded vector quantization for robust speech recognition
Chou, W
Seshadri, N
Rahim, M
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2001 - 2004
[36] Kernel based clustering and vector quantization for speech recognition
Satish, DS
Sekhar, CC
MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 315 - 324
[37] Tree-structured vector quantization for speech recognition
Barszcz, M
Chen, W
Boulianne, G
Kenny, P
COMPUTER SPEECH AND LANGUAGE, 2000, 14 (03): : 227 - 239
[38] EXPERIMENTS FOR ISOLATED-WORD RECOGNITION WITH SINGLE-LAYER AND 2-LAYER PERCEPTRONS
KAMMERER, BR
KUPPER, WA
NEURAL NETWORKS, 1990, 3 (06) : 693 - 706
[39] Dynamic features for segmental speech recognition.
Harte, N
Vaseghi, SV
Milner, B
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 933 - 936
[40] Development of isolated word speech recognition system
Lipeika, A
Lipeikiene, J
Telksnys, L
INFORMATICA, 2002, 13 (01) : 37 - 46

← 1 2 3 4 5 →