An application of recurrent neural networks to low bit rate speech coding

被引:0
|
作者
Kohata, M
机构
来源
ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows a good performance of spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20 similar to 30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality.
引用
收藏
页码:57 / 60
页数:4
相关论文
共 50 条
  • [31] Multisensor very low bit rate speech coding using segment quantization
    McCree, Alan
    Brady, Kevin
    Quatieri, Thomas F.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3997 - +
  • [32] Multi stage matrix quantization for very low bit rate speech coding
    Ozaydin, S
    Baykal, B
    2001 IEEE THIRD WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, PROCEEDINGS, 2001, : 372 - 375
  • [33] A STREAMWISE GAN VOCODER FOR WIDEBAND SPEECH CODING AT VERY LOW BIT RATE
    Mustafa, Ahmed
    Buethe, Jan
    Korse, Srikanth
    Gupta, Kishan
    Fuchs, Guillaume
    Pia, Nicola
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 66 - 70
  • [34] OBJECTIVE QUALITY EVALUATION FOR LOW-BIT-RATE SPEECH CODING SYSTEMS
    KITAWAKI, N
    NAGABUCHI, H
    ITOH, K
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1988, 6 (02) : 242 - 248
  • [35] Speaker Dependent Mapping for Low Bit Rate Coding of Throat Microphone Speech
    Joseph, M. Anand
    Yegnanarayana, B.
    Gupta, Sanjeev
    Kesheorey, M. R.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1099 - +
  • [36] Efficient LSP quantization algorithm for very low bit rate speech coding
    Li, Junlin
    Cui, Huijun
    Tang, Kun
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2004, 44 (10): : 1422 - 1425
  • [37] ADAPTIVE DENSITY PULSE EXCITATION FOR LOW BIT-RATE SPEECH CODING
    AKAMINE, M
    MISEKI, K
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1995, E78A (02) : 199 - 207
  • [38] Efficient methods for high quality low bit rate wideband speech coding
    Bessette, B
    Salami, R
    Lefebvre, R
    Jelinek, M
    2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 114 - 116
  • [39] Bandwidth extension of narrowband speech for low bit-rate wideband coding
    Valin, JM
    Lefebvre, R
    2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 130 - 132
  • [40] Low bit-rate speech coding based on an improved sinusoidal model
    Ahmadi, S
    Spanias, AS
    SPEECH COMMUNICATION, 2001, 34 (04) : 369 - 390