VARIABLE BIT-RATE CELP CODING OF SPEECH WITH PHONETIC CLASSIFICATION

被引:0
|
作者
PAKSOY, E
SRINIVASAN, K
GERSHO, A
机构
来源
关键词
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
A variable bit-rate speech coder intended for digital cellular applications is described. A voice activity detection algorithm is used to distinguish active speech from background noise. Each frame of active speech is further classified to distinguish between three phonetic categories: voiced, unvoiced, and onset. Each input frame is assigned one of five bit rates according to voice activity and phonetic classification and coded using an analysis-by-synthesis algorithm tailored to the needs of the class that it belongs to. The resulting coder, called Variable Rate Phonetic Segmentation, produces good quality speech at an average bit-rate below 3 kbit/s when operating with a voice activity factor of 0.5. Informal subjective quality assessment for speech in clean and noisy backgrounds indicates a performance that is comparable to the TIA standard QCELP algorithm while operating at a 25% to 40% lower average bit rate.
引用
收藏
页码:591 / 601
页数:11
相关论文
共 50 条
  • [41] 3.35kb/s low bit-rate speech coding algorithm
    Li, Yue
    Tang, Kun
    Cui, Huijuan
    Du, Wen
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2004, 44 (10): : 1410 - 1413
  • [42] Algorithms for Low Bit-Rate Coding with Adaptation to Statistical Characteristics of Speech Signal
    Saveliev, Anton
    Basov, Oleg
    Ronzhin, Andrey
    Ronzhin, Alexander
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 65 - 72
  • [43] Steganography integrated into linear predictive coding for low bit-rate speech codec
    Liu, Peng
    Li, Songbin
    wang, Haiqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (02) : 2837 - 2859
  • [44] Phase modelling of speech excitation for low bit-rate sinusoidal transform coding
    Sun, XQ
    Plante, F
    Cheetham, BMG
    Wong, KWT
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1691 - 1694
  • [45] ASYMPTOTIC ANALYSIS OF STATISTICAL MULTIPLEXING OF VARIABLE BIT-RATE AND CONSTANT BIT-RATE SOURCES
    GARCIA, J
    CASALS, O
    MODELLING AND PERFORMANCE EVALUATION OF ATM TECHNOLOGY, 1993, 15 : 137 - 155
  • [46] Noise post-processing for low bit-rate CELP coders
    Ehara, H
    Yasunaga, K
    Yoshida, K
    Hiwasaki, Y
    Mano, K
    Kaneko, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (06): : 1507 - 1516
  • [47] Noise post-processing for low bit-rate CELP coders
    Ehara, Hiroyuki
    Yasunaga, Kazutoshi
    Yoshida, Koji
    Hiwasaki, Yusuke
    Mano, Kazunori
    Kaneko, Takao
    IEICE Transactions on Information and Systems, 2004, E87-D (06) : 1507 - 1516
  • [48] A one-pass variable bit-rate video coding for storage media
    Song, BC
    Chun, KW
    ICCE: 2003 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2003, : 110 - 111
  • [49] A one-pass variable bit-rate video coding for storage media
    Song, BC
    Chun, KW
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2003, 49 (03) : 689 - 692