Scalable Audio Coding Using Trellis-Based Optimized Joint Entropy Coding and Quantization

被引:3
|
作者
Movassagh, Mahmood [1 ]
Kabal, Peter [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 0E9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Audio coding; bit-rate scalability; entropy coding; quantization; AAC; SLS; BOUNDS;
D O I
10.1109/TASLP.2016.2607339
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There is a considerable performance gap between the current scalable audio coding schemes and a nonscalable coder operating at the same bitrate. This suboptimality results from the independent coding of the layers in these systems. One of the aspects that plays a role in this suboptimality is the entropy coding. In practical audio coding systems including MPEG advanced audio coding (AAC), the transform domain coefficients are quantized using an entropy-constrained quantizer. InMPEG-4 scalable AAC (S-AAC), the quantization and coding are performed separately at each layer. In case of Huffman coding, the redundancy introduced by the entropy coding at each layer is larger at lower quantization resolutions. Also, the redundancy for the overall coder becomes larger as the number of layers increases. In fact, there is a trade-off between the overall redundancy and the fine-grain scalability in which the bitrate per layer is smaller and more layers are required. In this paper, a fine-grain scalable coder for audio signals is proposed where the entropy coding of a quantizer is made scalable via joint design of entropy coding and quantization. By constructing a Huffman-like coding tree where the internal nodes can be mapped to the reconstruction points, the tree can be pruned at any internal node to control the rate-distortion (RD) performance of the encoder in a fine-grain manner. A set of metrics and a trellis-based approach is proposed to create a coding tree so that an appropriate path is generated on the RD plane. The results show the proposed method outperforms the scalable audio coding performed based on reconstruction error quantization as used in practical systems, e.g., in S-AAC.
引用
收藏
页码:2288 / 2300
页数:13
相关论文
共 50 条
  • [1] A trellis-based optimal parameter value selection for audio coding
    Aggarwal, A
    Regunathan, SL
    Rose, K
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 623 - 633
  • [2] Joint speech/audio coding based scalable perceptual audio coding
    Gao, Li
    Hu, Ruimin
    Yang, Yuhong
    2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
  • [3] Trellis-based optimization of MPEG-4 advanced audio coding
    Aggarwal, A
    Regunathan, SL
    Rose, K
    2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 142 - 144
  • [4] JOINT ENTROPY-SCALABLE CODING OF AUDIO SIGNALS
    Movassagh, Mahmood
    Thiemann, Joachim
    Kabal, Peter
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2961 - 2964
  • [5] Entropy Coding of Spectral Envelopes for Speech And Audio Coding Using Distribution Quantization
    Korse, Srikanth
    Jaehnel, Tobias
    Baeckstroem, Tom
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2543 - 2547
  • [6] Residual image coding using trellis quantization
    Eriksson, Tomas
    Goertz, Norbert
    Novak, Mirek
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1689 - 1692
  • [7] Trellis-based Equalization Schemes for Physical Layer Network Coding
    Schmidt, Armin
    Gerstacker, Wolfgang
    2012 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2012,
  • [8] Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding
    Melkote, Vinay
    Rose, Kenneth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 330 - 341
  • [9] Turbo and trellis-based constructions for source coding with side information
    Chou, J
    Pradhan, SS
    Ramchandran, K
    DCC 2003: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2003, : 33 - 42
  • [10] Adaptive Audio Steganography Based on Advanced Audio Coding and Syndrome-Trellis Coding
    Luo, Weiqi
    Zhang, Yue
    Li, Haodong
    DIGITAL FORENSICS AND WATERMARKING, 2017, 10431 : 177 - 186