Speech and audio coding using temporal masking

被引：0

作者：

Gunawan, TS ^{[1
]}

Ambikairajah, E ^{[1
]}

Senn, D ^{[1
]}

机构：

[1] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia

来源：

SIGNAL PROCESSING FOR TELECOMMUNICATIONS AND MULTIMEDIA | 2005年 / 27卷

关键词：

temporal masking model; simultaneous masking model; Gammatone filters; wavelet packet; PESQ; subjective listening test;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a comparison of three auditory temporal masking models for speech and audio coding applications. The first model was developed based upon the existing forward masking psychoacoustic data with an assumption of ail approximately 200 ms. The model's dynamic parameters were derived from this data. The previously developed second model was,: based upon the principle of an exponential decay following higher energy stimuli, where the masking effects have a relatively short duration. The existing third model best matches the previously reported forward masking, data using ail exponential curve but the effects of the Forward masking are restricted to 100-200ms. Objective assessments employing the PESQ measure reveal that these three ternporal models have potential for removing perceptually redundant information in speech and audio coding, applications. Results show that the incorporation of temporal masking along with simultaneous masking into a speech/audio coding algorithm results in a further bit rate reduction of approximately 17% compared with simultaneous masking alone. while preserving perceptual quality.

引用

页码：31 / 42

页数：12

共 50 条

[31] WIDE-BAND SPEECH AND AUDIO CODING
NOLL, P
IEEE COMMUNICATIONS MAGAZINE, 1993, 31 (11) : 34 - 44
[32] A novel fast algorithm for speech and audio coding
Guz, Umit
Gurkan, Hakan
Yarman, B. Siddik
2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 4020 - +
[33] Hybrid Audio Coding for speech and audio below medium bit bate
Makino, K
Matsumoto, J
IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - 2000 DIGEST OF TECHNICAL PAPERS, 2000, : 264 - 265
[34] A new audio coding scheme using a forward masking model and perceptually weighted vector quantization
Huang, YH
Chiueh, TD
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 325 - 335
[35] Variable rate speech coding using straight and temporal decomposition
Nguyen, PC
Akagi, M
2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 26 - 28
[36] Interpolative coding of speech parameters using hierarchical temporal decomposition
Ghaemmaghami, S
Deriche, M
Sridharan, S
DIGITAL SIGNAL PROCESSING, 2003, 13 (03) : 433 - 456
[37] Speech Enhancement using Temporal Masking in Presence of Near-end Noise
Premananda, B. S.
Uma, B., V
2014 INTERNATIONAL CONFERENCE ON CIRCUITS, COMMUNICATION, CONTROL AND COMPUTING (I4C), 2014, : 263 - 266
[38] Very low rate speech coding using temporal decomposition
Ghaemmaghami, S
Sridharan, S
ELECTRONICS LETTERS, 1999, 35 (06) : 456 - 457
[39] Speech separation using DUET and binary masking with temporal smoothing in cepstral domain
Misssaoui, Ibrahim
Lachiri, Zied
WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
[40] Combined coding of audio and speech signals using LPC and the discrete wavelet transform
Mason, M
Boland, S
Sridharan, S
Deriche, M
IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 747 - 750

← 1 2 3 4 5 →