Efficient pitch filter encoding for variable rate speech processing

被引:2
|
作者
McClellan, S [1 ]
Gibson, JD [1 ]
Rutherford, BK [1 ]
机构
[1] Univ Alabama, Dept Elect & Comp Engn, Birmingham, AL 35294 USA
来源
基金
美国国家科学基金会;
关键词
pitch filter; quantization; variable rate; vector quantization;
D O I
10.1109/89.736327
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Analysis-by-synthesis techniques are used in a wide variety of speech coding standards and applications for rates below 16 kbps, The presence of a long-term predictor, commonly known as the adaptive codebook, is critical to coder performance at the lower rates. Unfortunately, the encoding rate and computational requirements for high-quality encoding of pitch filter parameters can be excessive. Several popular approaches explore the trade-off between predictor order, allocated bit rate, and computational requirements for long-term predictor optimization. Here, we investigate the relative performance of several longterm predictor structures and present a new approach to vector quantization of pitch filter coefficients having subjective quality equivalent to other schemes, but at a lower coding rate and requiring significantly less closed-loop computation. Performance is evaluated in a variable-rate CELP coder at an average rate of 2 kbps and in Federal Standard 1016 CELP.
引用
收藏
页码:18 / 29
页数:12
相关论文
共 50 条
  • [1] ON ENCODING PITCH AND LPC PARAMETERS FOR LOW-RATE SPEECH CODERS
    COPPERI, M
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1994, 5 (05): : 565 - 572
  • [2] Real-time speech processing of variable rate using pitch period estimation and fuzzy logic control
    Wang, MS
    Chang, CM
    Huang, CK
    2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III, CONFERENCE PROCEEDINGS, 2004, : 199 - 202
  • [3] PITCH PROCESSING IN MUSIC AND SPEECH
    Tillmann, Barbara
    ACOUSTICS AUSTRALIA, 2014, 42 (02) : 124 - 130
  • [4] Pitch processing in music and speech
    Tillmann, Barbara, 1600, Australian Acoustical Society, Singapore (42):
  • [5] VARIABLE-RATE SPEECH COMPRESSION BY ENCODING SUBSETS OF THE PARCOR COEFFICIENTS
    PAPAMICHALIS, PE
    BARNWELL, TP
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (03): : 706 - 713
  • [6] An efficient algorithm for pitch determination of speech signals-Kalman filter approach
    Salor, Ozgul
    Demirekler, Mubeccel
    Orguner, Umut
    2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 309 - +
  • [7] INFORMATION RATE OF PITCH SIGNAL IN SPEECH
    KOSHIKAW.T
    SUGIMOTO, T
    IRE TRANSACTIONS ON INFORMATION THEORY, 1962, 8 (05): : S92 - &
  • [8] EFFICIENT PITCH ESTIMATION FOR SPEECH AND MUSIC
    TUCKER, WH
    BATES, RHT
    ELECTRONICS LETTERS, 1977, 13 (12) : 357 - 358
  • [9] Efficient algorithms for speech pitch estimation
    Mei, XD
    Pan, JS
    Sun, SH
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 421 - 424
  • [10] THE EFFECTS OF ABSOLUTE PITCH AND TONE LANGUAGE ON PITCH PROCESSING AND ENCODING IN MUSICIANS
    Hutka, Stefanie A.
    Alain, Claude
    MUSIC PERCEPTION, 2015, 32 (04): : 344 - 354