Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding

被引:9
|
作者
Ruiz Reyes, N. [1 ]
Vera Candeas, P. [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen 23071, Spain
关键词
Adaptive signal models; matching pursuit; overcomplete dictionary; parametric audio coding; perceptual matching pursuit; scalability; sparse approximations; SPEECH;
D O I
10.1109/TASL.2009.2037396
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper deals with the application of adaptive signal models for parametric audio coding. A fully parametric audio coder, which decomposes the audio signal into sinusoids, transients and noise, is here proposed. Adaptive signal models for sinusoidal, transient, and noise modeling are therefore included in the parametric scheme in order to achieve high-quality and low bit-rate audio coding. In this paper, a new sinusoidal modeling method based on a perceptual distortion measure is proposed. For transient modeling, a fast and effective method based on matching pursuit with a mixed dictionary is chosen. The residue of the previous models is analyzed as a noise-like signal. The proposed parametric audio coder allows high quality audio coding for one-channel audio signals at 16 kbits/s (average bit rate). A bit-rate scalable version of the parametric audio coder is also proposed in this work. Bit-rate scalability is intended for audio streaming applications, which are highly demanded nowadays. The performance of the proposed parametric audio coders (non-scalable and scalable coders) is assessed in comparison to widely used audio coders operating at similar bit rates.
引用
收藏
页码:447 / 460
页数:14
相关论文
共 50 条
  • [21] A scalable and lossless audio coding system based on integer transform
    Zhang, Yong
    Gao, Ge
    2006 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES,VOLS 1-3, 2006, : 464 - +
  • [22] Fixed Quality Layered Audio Based on Scalable Lossless Coding
    Li, Te
    Rahardja, Susanto
    Koh, Soo Ngee
    IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (03) : 422 - 432
  • [23] Sparse autoencoder based multiple audio objects coding method
    Zhang, Shuang
    Wu, Xihong
    Qu, Tianshu
    146TH AES CONVENTION, 2019,
  • [24] SIGNAL-ADAPTIVE SWITCHING OF OVERLAP RATIO IN AUDIO TRANSFORM CODING
    Helmrich, Christian R.
    Edler, Bernd
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 639 - 643
  • [25] SIGNAL-ADAPTIVE TRANSFORM KERNEL SWITCHING FOR STEREO AUDIO CODING
    Helmrich, Christian R.
    Edler, Bernd
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [26] Adaptive Multiple Subtraction Based on Sparse Coding
    Liu, Jinlin
    Lu, Wenkai
    Zhang, Yingqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (03): : 1318 - 1324
  • [27] Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding
    Petrovsky, Alexey
    Azarov, Elias
    Petrovsky, Alexander
    SIGNAL PROCESSING, 2011, 91 (06) : 1489 - 1504
  • [28] Adaptive Context Recognition Based on Audio Signal
    Zeng, Zhi
    Li, Xin
    Ma, Xiaohong
    Ji, Qiang
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2507 - 2510
  • [29] Fine grain scalable perceptual and lossless audio coding based on IntMDCT
    Geiger, R
    Herre, J
    Schuller, G
    Sporer, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 445 - 448
  • [30] Bandwidth-Scalable Stereo Audio Coding Based on a Layered Structure
    Lee, Young Han
    Kim, Deok Su
    Kim, Hong Kook
    Sung, Jongmo
    Lee, Mi Suk
    Bae, Hyun Joo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12) : 2540 - 2544