Wide-band perceptual audio coding based on frequency-domain linear prediction

被引:0
|
作者
Motlicek, Petr [1 ]
Uallal, Vijay [2 ]
Hermansky, Hynek [1 ,3 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
[2] Int Comp Sci Inst, Berkeley, CA USA
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
来源
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS | 2007年
关键词
audio signal processing; data compression; linear predictive coding; Hilbert transform;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an extension of the very low bit-rate speech coding technique, exploiting predictability of the temporal evolution of spectral envelopes, for wide-band audio coding applications. Temporal envelopes in critically band-sized sub-bands are estimated using frequency domain linear prediction applied on relatively long time segments. The sub-band residual signals, which play an important role in acquiring high quality reconstruction, are processed using a heterodyning-based signal analysis technique. For reconstruction, their optimal parameters are estimated using a closed-loop analysis-by-synthesis technique driven by a perceptual model emulating simultaneous masking properties of the human auditory system. We discuss the advantages of the approach and show some properties on challenging audio recordings. The proposed technique is capable of encoding high quality, variable rate audio signals on bit-rates below 1bit/sample.
引用
收藏
页码:265 / +
页数:2
相关论文
共 50 条
  • [41] Bit allocation in wide band audio coding
    Yan, GM
    Dong, ZW
    5TH INTERNATIONAL SYMPOSIUM ON BROADCASTING TECHNOLOGY, PROCEEDINGS (ISBT'97, BEIJING), 1997, : 344 - 349
  • [42] WIDE-BAND HYBRID ANALOG/DIGITAL FREQUENCY DOMAIN ADAPTIVE FILTER.
    Morgul, Avni
    Grant, Peter M.
    Cowan, Colin F.N.
    1600, (ASSP-32):
  • [43] A WIDE-BAND RECORDER USING FREQUENCY BAND DIVISION
    COMERCI, FA
    ISA TRANSACTIONS, 1965, 4 (03) : 240 - &
  • [44] All-pole modeling of wide-band speech with symmetric linear prediction
    Alku, P
    Bäckström, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 152 - 155
  • [45] Frequency-domain algorithms for audio signal enhancement based on transient modification
    Goodwin, Michael M.
    Avendano, Carlos
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2006, 54 (09): : 827 - 840
  • [46] Frequency-domain algorithms for audio signal enhancement based on transient modification
    Goodwin, Michael M.
    Avendano, Carlos
    AES: Journal of the Audio Engineering Society, 2006, 54 (09): : 827 - 840
  • [47] Frequency-Domain Synthesis of Ultra-Wide Band Antennas with a Flat Response
    Deacu, Daniela
    Tamas, Razvan D.
    Petrescu, Teodor
    2014 INTERNATIONAL WORKSHOP ON ANTENNA TECHNOLOGY: "SMALL ANTENNAS, NOVEL EM STRUCTURES AND MATERIALS, AND APPLICATIONS" (IWAT), 2014, : 314 - 317
  • [48] A robust watermarking system based on the properties of low frequency in perceptual audio coding
    Wang, CT
    Chen, TS
    Xu, ZM
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2004, E87A (08) : 2152 - 2159
  • [49] WIDE-BAND CODING FOR UNCOORDINATED MULTIPLE ACCESS COMMUNICATION
    MOWBRAY, RS
    GRANT, PM
    ELECTRONICS & COMMUNICATION ENGINEERING JOURNAL, 1992, 4 (06): : 351 - 361
  • [50] Wide-band and wide-dynamic-range recording and reproduction of digital audio
    Komamura, Mitsuya
    AES: Journal of the Audio Engineering Society, 1995, 43 (1-2): : 29 - 39