Speech and audio coding using temporal masking

被引:0
|
作者
Gunawan, TS [1 ]
Ambikairajah, E [1 ]
Senn, D [1 ]
机构
[1] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
关键词
temporal masking model; simultaneous masking model; Gammatone filters; wavelet packet; PESQ; subjective listening test;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a comparison of three auditory temporal masking models for speech and audio coding applications. The first model was developed based upon the existing forward masking psychoacoustic data with an assumption of ail approximately 200 ms. The model's dynamic parameters were derived from this data. The previously developed second model was,: based upon the principle of an exponential decay following higher energy stimuli, where the masking effects have a relatively short duration. The existing third model best matches the previously reported forward masking, data using ail exponential curve but the effects of the Forward masking are restricted to 100-200ms. Objective assessments employing the PESQ measure reveal that these three ternporal models have potential for removing perceptually redundant information in speech and audio coding, applications. Results show that the incorporation of temporal masking along with simultaneous masking into a speech/audio coding algorithm results in a further bit rate reduction of approximately 17% compared with simultaneous masking alone. while preserving perceptual quality.
引用
收藏
页码:31 / 42
页数:12
相关论文
共 50 条
  • [31] WIDE-BAND SPEECH AND AUDIO CODING
    NOLL, P
    IEEE COMMUNICATIONS MAGAZINE, 1993, 31 (11) : 34 - 44
  • [32] A novel fast algorithm for speech and audio coding
    Guz, Umit
    Gurkan, Hakan
    Yarman, B. Siddik
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 4020 - +
  • [33] Hybrid Audio Coding for speech and audio below medium bit bate
    Makino, K
    Matsumoto, J
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - 2000 DIGEST OF TECHNICAL PAPERS, 2000, : 264 - 265
  • [34] A new audio coding scheme using a forward masking model and perceptually weighted vector quantization
    Huang, YH
    Chiueh, TD
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 325 - 335
  • [35] Variable rate speech coding using straight and temporal decomposition
    Nguyen, PC
    Akagi, M
    2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 26 - 28
  • [36] Interpolative coding of speech parameters using hierarchical temporal decomposition
    Ghaemmaghami, S
    Deriche, M
    Sridharan, S
    DIGITAL SIGNAL PROCESSING, 2003, 13 (03) : 433 - 456
  • [37] Speech Enhancement using Temporal Masking in Presence of Near-end Noise
    Premananda, B. S.
    Uma, B., V
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, COMMUNICATION, CONTROL AND COMPUTING (I4C), 2014, : 263 - 266
  • [38] Very low rate speech coding using temporal decomposition
    Ghaemmaghami, S
    Sridharan, S
    ELECTRONICS LETTERS, 1999, 35 (06) : 456 - 457
  • [39] Speech separation using DUET and binary masking with temporal smoothing in cepstral domain
    Misssaoui, Ibrahim
    Lachiri, Zied
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
  • [40] Combined coding of audio and speech signals using LPC and the discrete wavelet transform
    Mason, M
    Boland, S
    Sridharan, S
    Deriche, M
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 747 - 750