A computationally efficient cochlear filter bank for perceptual audio coding

被引:0
|
作者
Baumgarte, F [1 ]
机构
[1] Lucent Techno, Bell Labs, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
来源
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM | 2001年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many applications in auditory modeling require analysis filters that approximate the frequency selectivity given by psychophysical data, e.g. from masking experiments using narrow-band maskers. This frequency selectivity is largely determined by the spectral decomposition process inside the human cochlea. Currently used spectral decomposition schemes for masking modeling in audio coding generally do not achieve the non-uniform time and frequency resolution provided by the cochlea. These applications rather take advantage of the computational efficiency of uniform filter banks or transforms at the expense of coding gain. This paper presents a suitable analysis filter-bank structure employing cascaded low-order IIR filters and appropriate downsampling to increase efficiency. In an application example, the filter responses were optimized to model auditory masking effects. The results show that the time and frequency resolution of the filter bank matches or exceeds the masking properties. Thus, the filter bank enables improved masking modeling for audio coding at low computational costs.
引用
收藏
页码:3265 / 3268
页数:4
相关论文
共 50 条
  • [31] Efficient audio coding using perfect reconstruction noncausal IIR filter banks
    Creusere, CD
    Mitra, SK
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (02): : 115 - 123
  • [32] A Computationally Efficient Design for Prototype Filters of an M-Channel Cosine Modulated Filter Bank
    Rayavarapu, Neela. R.
    Prakash, Neelam Rup
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 13, 2006, 13 : 272 - +
  • [33] An Efficient FPGA-Based Accelerator for Perceptual Weighting Filter in Speech Coding
    Singh, Dilip
    Chandel, Rajeevan
    IETE TECHNICAL REVIEW, 2024, 41 (04) : 441 - 453
  • [34] A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems
    Damjan Vlaj
    Bojan Kotnik
    Bogomir Horvat
    Zdravko Kačič
    EURASIP Journal on Advances in Signal Processing, 2005
  • [35] A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems
    Vlaj, D
    Kotnik, B
    Horvat, B
    Kacic, Z
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (04) : 487 - 497
  • [36] Computationally efficient amplitude modulated sinusoidal audio coding using frequency-domain linear prediction
    Christensen, Mads Grasboll
    Jensen, Soren Holdt
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 4919 - 4922
  • [37] An analysis of perceptual artifacts in MPEG scalable audio coding
    Creusere, CD
    DCC 2002: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2002, : 152 - 161
  • [38] Perceptual Coding of High-Quality Digital Audio
    Brandenburg, Karlheinz
    Faller, Christof
    Herre, Juergen
    Johnston, James D.
    Kleijn, W. Bastiaan
    PROCEEDINGS OF THE IEEE, 2013, 101 (09) : 1905 - 1919
  • [39] Towards a new perceptual coding paradigm for audio signals
    Der, R
    Kabal, P
    Chan, WY
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 457 - 460
  • [40] Study on rounding errors of INTMDCT in perceptual audio coding
    Li, T
    Rahardja, S
    Yu, RS
    Koh, SN
    ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 753 - 758