A very low bit rate speech coder using HMM-based speech recognition synthesis techniques

被引:0
|
作者
Tokuda, K [1 ]
Masuko, T [1 ]
Hiroi, J [1 ]
Kobayashi, T [1 ]
Kitamura, T [1 ]
机构
[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi 466, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a very low bit rate speech coder based on HMM (Hidden Markov Model). The encoder carries out phoneme recognition, and transmits phoneme indexes, state durations and pitch information to the decoder. In the decoder, phoneme HMMs are concatenated according to the phoneme indexes, and a sequence of mel-cepstral coefficient vectors is generated from the concatenated HMM by using an ML-based speech parameter generation technique. Finally we obtain synthetic speech by exciting the MLSA (Mel Log Spectrum Approximation) filter, whose coefficients are given by mel-cepstral coefficients, according to the pitch information. A subjective listening test shows that the performance of the proposed coder at about 150 bit/s (for the test data including 26% silence region) is comparable to a VQ-based vocoder at 400 bit/s (= 8 bit/frame x 50 frame/s) without pitch quantization for both coders.
引用
收藏
页码:609 / 612
页数:4
相关论文
共 50 条
  • [1] An HMM-based speaker adaptable very low bit rate speech coder
    Peng, H
    Zhu, J
    CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (02): : 135 - 139
  • [2] A very low bit rate speech coder based on a recognition/synthesis paradigm
    Lee, KS
    Cox, RV
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 482 - 491
  • [3] A SPEAKER ADAPTABLE VERY LOW BIT RATE SPEECH CODER BASED ON HMM
    彭煳
    朱杰
    Journal of Shanghai Jiaotong University, 2000, (02) : 1 - 5
  • [4] Improving the performance of HMM-based very low bit rate speech coding
    Hoshiya, T
    Sako, S
    Zen, H
    Tokuda, K
    Masuko, T
    Kobayashi, T
    Kitamura, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 800 - 803
  • [5] TTS based very low bit rate speech coder
    Lee, Ki-Seung
    Cox, Richard V.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 181 - 184
  • [6] TTS based very low bit rate speech coder
    Lee, KS
    Cox, RV
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 181 - 184
  • [7] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [8] Very low bit rate speech coding based on HMM with speaker adaptation
    Masuko, Takashi
    Kobayashi, Takao
    Tokuda, Keiichi
    Systems and Computers in Japan, 2006, 37 (02): : 67 - 78
  • [9] HMM-Based Speech Recognition Using Adaptive Framing
    Goh, Yeh-Huann
    Raveendran, Paramesran
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 226 - 230
  • [10] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
    Terashima R.
    Yoshimura T.
    Wakita T.
    Tokuda K.
    Kitamura T.
    IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564+3