An application of recurrent neural networks to low bit rate speech coding

被引:0
|
作者
Kohata, M
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows a good performance of spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20 similar to 30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality.
引用
收藏
页码:57 / 60
页数:4
相关论文
共 50 条
  • [1] An application of recurrent neural networks to low bit rate speech coding
    Kohata, M
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 314 - 317
  • [2] Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding
    Cernak, Milos
    Lazaridis, Alexandros
    Asaei, Afsaneh
    Garner, Philip N.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2301 - 2312
  • [3] THE APPLICATION OF ARTIFICIAL NEURAL NETWORK TECHNIQUES TO LOW BIT-RATE SPEECH CODING
    KAOURI, HA
    MCCANNY, JV
    FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, : 100 - 104
  • [4] On Compressibility of Neural Network Phonological Features for Low Bit Rate Speech Coding
    Asaei, Afsaneh
    Cernak, Milos
    Bourlard, Herve
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 418 - 422
  • [5] A Speech Enhancement Preprocessor for Low Bit Rate Speech Coding
    Zhao, Hanwu
    Zou, Xia
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 443 - +
  • [6] Application of the wavelet transform to the low-bit-rate speech coding system
    Moriai, S
    Hanazaki, I
    ELECTRICAL ENGINEERING IN JAPAN, 2004, 148 (03) : 62 - 71
  • [7] A preprocessor for low-bit-rate speech coding
    Kim, NS
    Chang, JH
    IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (10) : 318 - 321
  • [8] COMPREHENSIVE IMPROVEMENT IN LOW BIT RATE SPEECH CODING
    FAN, CX
    MA, HF
    DALLAS GLOBECOM 89, VOLS 1-3: COMMUNICATIONS TECHNOLOGY FOR THE 1990S AND BEYOND, 1989, : 1916 - 1920
  • [9] Low bit rate wideband WI speech coding
    Ritz, CH
    Burnett, IS
    Lukasiak, J
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 804 - 807
  • [10] LOW BIT RATE SPEECH CODING FOR PRACTICAL APPLICATIONS
    SOUTHCOTT, CB
    BOYD, I
    COLEMAN, AE
    HAMMETT, PG
    BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 22 - 40