A computationally efficient cochlear filter bank for perceptual audio coding

被引:0
|
作者
Baumgarte, F [1 ]
机构
[1] Lucent Techno, Bell Labs, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
来源
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM | 2001年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many applications in auditory modeling require analysis filters that approximate the frequency selectivity given by psychophysical data, e.g. from masking experiments using narrow-band maskers. This frequency selectivity is largely determined by the spectral decomposition process inside the human cochlea. Currently used spectral decomposition schemes for masking modeling in audio coding generally do not achieve the non-uniform time and frequency resolution provided by the cochlea. These applications rather take advantage of the computational efficiency of uniform filter banks or transforms at the expense of coding gain. This paper presents a suitable analysis filter-bank structure employing cascaded low-order IIR filters and appropriate downsampling to increase efficiency. In an application example, the filter responses were optimized to model auditory masking effects. The results show that the time and frequency resolution of the filter bank matches or exceeds the masking properties. Thus, the filter bank enables improved masking modeling for audio coding at low computational costs.
引用
收藏
页码:3265 / 3268
页数:4
相关论文
共 50 条
  • [41] Understanding perceptual distortion in MPEG scalable audio coding
    Creusere, CD
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 422 - 431
  • [42] A review of algorithms for perceptual coding of digital audio signals
    Painter, T
    Spanias, A
    DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 179 - 208
  • [43] Multiple description perceptual audio coding with correlating transforms
    Arean, R
    Kovacevic, J
    Goyal, VK
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02): : 140 - 145
  • [44] INTMDCT - A link between perceptual and lossless audio coding
    Geiger, R
    Herre, J
    Koller, J
    Brandenburg, K
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1813 - 1816
  • [45] Computationally efficient MCTF for scalable video coding
    Karunakar, A. K.
    Manohara, Pai M. M.
    2006 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, VOLS 1 AND 2, 2007, : 474 - 479
  • [46] Computationally efficient generic adaptive filter (CEGAF)
    Abid, Muqaddas
    Ishtiaq, Muhammad
    Khan, Farman Ali
    Khan, Salabat
    Ahmad, Rashid
    Shah, Peer Azmat
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S7111 - S7121
  • [47] Computationally efficient generic adaptive filter (CEGAF)
    Muqaddas Abid
    Muhammad Ishtiaq
    Farman Ali Khan
    Salabat Khan
    Rashid Ahmad
    Peer Azmat Shah
    Cluster Computing, 2019, 22 : 7111 - 7121
  • [48] COMPUTATIONALLY EFFICIENT BANK OF RECTANGULAR DIGITAL-FILTERS
    ZELI, GW
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1975, AE11 (02) : 229 - 233
  • [49] Modifying transients for efficient coding of audio
    Vafin, R
    Heusdens, R
    Kleijn, WB
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3285 - 3288
  • [50] Modulation frequency and efficient audio coding
    Atlas, LE
    Vinton, MS
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS XI, 2001, 4474 : 1 - 8