Scalable Audio Coding Using Trellis-Based Optimized Joint Entropy Coding and Quantization

被引：3

作者：

Movassagh, Mahmood ^{[1
]}

Kabal, Peter ^{[1
]}

机构：

[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 0E9, Canada

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2016年 / 24卷 / 12期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Audio coding; bit-rate scalability; entropy coding; quantization; AAC; SLS; BOUNDS;

D O I：

10.1109/TASLP.2016.2607339

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

There is a considerable performance gap between the current scalable audio coding schemes and a nonscalable coder operating at the same bitrate. This suboptimality results from the independent coding of the layers in these systems. One of the aspects that plays a role in this suboptimality is the entropy coding. In practical audio coding systems including MPEG advanced audio coding (AAC), the transform domain coefficients are quantized using an entropy-constrained quantizer. InMPEG-4 scalable AAC (S-AAC), the quantization and coding are performed separately at each layer. In case of Huffman coding, the redundancy introduced by the entropy coding at each layer is larger at lower quantization resolutions. Also, the redundancy for the overall coder becomes larger as the number of layers increases. In fact, there is a trade-off between the overall redundancy and the fine-grain scalability in which the bitrate per layer is smaller and more layers are required. In this paper, a fine-grain scalable coder for audio signals is proposed where the entropy coding of a quantizer is made scalable via joint design of entropy coding and quantization. By constructing a Huffman-like coding tree where the internal nodes can be mapped to the reconstruction points, the tree can be pruned at any internal node to control the rate-distortion (RD) performance of the encoder in a fine-grain manner. A set of metrics and a trellis-based approach is proposed to create a coding tree so that an appropriate path is generated on the RD plane. The results show the proposed method outperforms the scalable audio coding performed based on reconstruction error quantization as used in practical systems, e.g., in S-AAC.

引用

页码：2288 / 2300

页数：13

共 50 条

[1] A trellis-based optimal parameter value selection for audio coding
Aggarwal, A
Regunathan, SL
Rose, K
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 623 - 633
[2] Joint speech/audio coding based scalable perceptual audio coding
Gao, Li
Hu, Ruimin
Yang, Yuhong
2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
[3] Trellis-based optimization of MPEG-4 advanced audio coding
Aggarwal, A
Regunathan, SL
Rose, K
2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 142 - 144
[4] JOINT ENTROPY-SCALABLE CODING OF AUDIO SIGNALS
Movassagh, Mahmood
Thiemann, Joachim
Kabal, Peter
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2961 - 2964
[5] Entropy Coding of Spectral Envelopes for Speech And Audio Coding Using Distribution Quantization
Korse, Srikanth
Jaehnel, Tobias
Baeckstroem, Tom
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2543 - 2547
[6] Residual image coding using trellis quantization
Eriksson, Tomas
Goertz, Norbert
Novak, Mirek
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1689 - 1692
[7] Trellis-based Equalization Schemes for Physical Layer Network Coding
Schmidt, Armin
Gerstacker, Wolfgang
2012 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2012,
[8] Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding
Melkote, Vinay
Rose, Kenneth
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 330 - 341
[9] Turbo and trellis-based constructions for source coding with side information
Chou, J
Pradhan, SS
Ramchandran, K
DCC 2003: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2003, : 33 - 42
[10] Adaptive Audio Steganography Based on Advanced Audio Coding and Syndrome-Trellis Coding
Luo, Weiqi
Zhang, Yue
Li, Haodong
DIGITAL FORENSICS AND WATERMARKING, 2017, 10431 : 177 - 186

← 1 2 3 4 5 →