Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding

被引：9

作者：

Ruiz Reyes, N. ^{[1
]}

Vera Candeas, P. ^{[1
]}

机构：

[1] Univ Jaen, Telecommun Engn Dept, Jaen 23071, Spain

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2010年 / 18卷 / 03期

关键词：

Adaptive signal models; matching pursuit; overcomplete dictionary; parametric audio coding; perceptual matching pursuit; scalability; sparse approximations; SPEECH;

D O I：

10.1109/TASL.2009.2037396

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper deals with the application of adaptive signal models for parametric audio coding. A fully parametric audio coder, which decomposes the audio signal into sinusoids, transients and noise, is here proposed. Adaptive signal models for sinusoidal, transient, and noise modeling are therefore included in the parametric scheme in order to achieve high-quality and low bit-rate audio coding. In this paper, a new sinusoidal modeling method based on a perceptual distortion measure is proposed. For transient modeling, a fast and effective method based on matching pursuit with a mixed dictionary is chosen. The residue of the previous models is analyzed as a noise-like signal. The proposed parametric audio coder allows high quality audio coding for one-channel audio signals at 16 kbits/s (average bit rate). A bit-rate scalable version of the parametric audio coder is also proposed in this work. Bit-rate scalability is intended for audio streaming applications, which are highly demanded nowadays. The performance of the proposed parametric audio coders (non-scalable and scalable coders) is assessed in comparison to widely used audio coders operating at similar bit rates.

引用

页码：447 / 460

页数：14

共 50 条

[21] A scalable and lossless audio coding system based on integer transform
Zhang, Yong
Gao, Ge
2006 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES,VOLS 1-3, 2006, : 464 - +
[22] Fixed Quality Layered Audio Based on Scalable Lossless Coding
Li, Te
Rahardja, Susanto
Koh, Soo Ngee
IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (03) : 422 - 432
[23] Sparse autoencoder based multiple audio objects coding method
Zhang, Shuang
Wu, Xihong
Qu, Tianshu
146TH AES CONVENTION, 2019,
[24] SIGNAL-ADAPTIVE SWITCHING OF OVERLAP RATIO IN AUDIO TRANSFORM CODING
Helmrich, Christian R.
Edler, Bernd
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 639 - 643
[25] SIGNAL-ADAPTIVE TRANSFORM KERNEL SWITCHING FOR STEREO AUDIO CODING
Helmrich, Christian R.
Edler, Bernd
2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
[26] Adaptive Multiple Subtraction Based on Sparse Coding
Liu, Jinlin
Lu, Wenkai
Zhang, Yingqiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (03): : 1318 - 1324
[27] Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding
Petrovsky, Alexey
Azarov, Elias
Petrovsky, Alexander
SIGNAL PROCESSING, 2011, 91 (06) : 1489 - 1504
[28] Adaptive Context Recognition Based on Audio Signal
Zeng, Zhi
Li, Xin
Ma, Xiaohong
Ji, Qiang
19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2507 - 2510
[29] Fine grain scalable perceptual and lossless audio coding based on IntMDCT
Geiger, R
Herre, J
Schuller, G
Sporer, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 445 - 448
[30] Bandwidth-Scalable Stereo Audio Coding Based on a Layered Structure
Lee, Young Han
Kim, Deok Su
Kim, Hong Kook
Sung, Jongmo
Lee, Mi Suk
Bae, Hyun Joo
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12) : 2540 - 2544

← 1 2 3 4 5 →