A perceptual model for sinusoidal audio coding based on spectral integration

被引:49
|
作者
van de Par, S [1 ]
Kohlrausch, A
Heusdens, R
Jensen, J
Jensen, SH
机构
[1] Philips Res Labs, Digital Signal Proc Grp, NL-5656 AA Eindhoven, Netherlands
[2] Eindhoven Univ Technol, Dept Technol Management, NL-5600 MB Eindhoven, Netherlands
[3] Delft Univ Technol, Dept Mediamat, NL-2600 GA Delft, Netherlands
[4] Aalborg Univ, Inst Electron Syst, Dept Commun Technol, DK-9220 Aalborg, Denmark
关键词
audio coding; psychoacoustical modelling; auditory masking; spectral masking; sinusoidal modelling; psychoacoustical matching pursuit;
D O I
10.1155/ASP.2005.1292
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion delectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.
引用
收藏
页码:1292 / 1304
页数:13
相关论文
共 50 条
  • [31] Sinusoidal analysis-synthesis of audio using perceptual criteria
    Painter, Ted
    Spanias, Andreas
    1600, Hindawi Publishing Corporation (2003):
  • [32] Sinusoidal Analysis-Synthesis of Audio Using Perceptual Criteria
    Ted Painter
    Andreas Spanias
    EURASIP Journal on Advances in Signal Processing, 2003
  • [33] Sinusoidal analysis-synthesis of audio using perceptual criteria
    Painter, T
    Spanias, A
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (01) : 15 - 20
  • [34] SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL
    Al-Moussawy Raed
    Journal of Electronics(China), 2004, (03) : 213 - 221
  • [35] AUDIO CODING BASED ON SPECTRAL RECOVERY BY CONVOLUTIONAL NEURAL NETWORK
    Shin, Seong-Hyeon
    Beack, Seung Kwon
    Lee, Taejin
    Park, Hochong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 725 - 729
  • [36] Scalable audio coding employing sorted sinusoidal parameters
    Raad, M
    Burnett, IS
    Mertins, A
    ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 174 - 177
  • [37] Jointly optimal quantization of parameters in sinusoidal audio coding
    Vafin, R
    Kleijn, WB
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 247 - 250
  • [38] Amplitude modulated sinusoidal models for audio modeling and coding
    Christensen, MG
    Andersen, SV
    Jensen, SH
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 1334 - 1342
  • [39] SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL
    Al-Moussawy Raed
    Journal of Electronics, 2004, (03) : 213 - 221
  • [40] Theoretical Implementation on 3D Audio Based on Cochlear Perceptual Coding
    Li, Nian
    2018 5TH INTERNATIONAL SYMPOSIUM ON COMPUTER, COMMUNICATION, CONTROL AND AUTOMATION (3CA 2018), 2018, : 136 - 140