VARIABLE BIT-RATE CELP CODING OF SPEECH WITH PHONETIC CLASSIFICATION

被引:0
|
作者
PAKSOY, E
SRINIVASAN, K
GERSHO, A
机构
来源
关键词
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
A variable bit-rate speech coder intended for digital cellular applications is described. A voice activity detection algorithm is used to distinguish active speech from background noise. Each frame of active speech is further classified to distinguish between three phonetic categories: voiced, unvoiced, and onset. Each input frame is assigned one of five bit rates according to voice activity and phonetic classification and coded using an analysis-by-synthesis algorithm tailored to the needs of the class that it belongs to. The resulting coder, called Variable Rate Phonetic Segmentation, produces good quality speech at an average bit-rate below 3 kbit/s when operating with a voice activity factor of 0.5. Informal subjective quality assessment for speech in clean and noisy backgrounds indicates a performance that is comparable to the TIA standard QCELP algorithm while operating at a 25% to 40% lower average bit rate.
引用
收藏
页码:591 / 601
页数:11
相关论文
共 50 条
  • [31] IMPROVING LOW BIT-RATE CODING
    Rumsey, Francis
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2010, 58 (12): : 1116 - 1121
  • [32] Low bit-rate speech coding by perceptually optimized noise excitation modulation
    Tsoukalas, D
    Mourjopoulos, J
    Kokkinakis, G
    SIGNAL PROCESSING, 1997, 56 (01) : 77 - 89
  • [33] THE APPLICATION OF ARTIFICIAL NEURAL NETWORK TECHNIQUES TO LOW BIT-RATE SPEECH CODING
    KAOURI, HA
    MCCANNY, JV
    FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, : 100 - 104
  • [34] LOW BIT-RATE SPEECH CODING WITH VQ-VAE AND A WAVENET DECODER
    Garbacea, Cristina
    van den Oord, Aaron
    Li, Yazhe
    Lim, Felicia S. C.
    Luebs, Alejandro
    Vinyals, Oriol
    Walters, Thomas C.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 735 - 739
  • [35] A MIXED EXCITATION LPC VOCODER MODEL FOR LOW BIT-RATE SPEECH CODING
    MCCREE, AV
    BARNWELL, TP
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04): : 242 - 250
  • [36] On the Study of Noise Allocation for Speech Signal in Low Bit-Rate Audio Coding
    Lee, Chang-Heon
    Oh, Hyen-O
    Kang, Hong-Goo
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (10) : 849 - 852
  • [37] Low bit-rate speech coding by perceptually optimized noise excitation modulation
    Univ of Patras, Patras, Greece
    Signal Process, 1 (77-89):
  • [38] A neural network-based video bit-rate control algorithm for variable bit-rate applications of versatile video coding standard
    Raufmehr, Farhad
    Salehi, Mohammad Reza
    Abiri, Ebrahim
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 96
  • [39] Steganography integrated into linear predictive coding for low bit-rate speech codec
    Peng Liu
    Songbin Li
    Haiqiang Wang
    Multimedia Tools and Applications, 2017, 76 : 2837 - 2859
  • [40] Low bit-rate speech coding based on multicomponent AFM signal model
    Bansal M.
    Sircar P.
    International Journal of Speech Technology, 2018, 21 (4) : 783 - 795