RATE-DISTORTION-OPTIMIZATION FOR DEEP IMAGE COMPRESSION

被引:3
|
作者
Schaefer, Michael [1 ]
Pientka, Sophie [1 ]
Pfaff, Jonathan [1 ]
Schwarz, Heiko [1 ]
Marpe, Detlev [1 ]
Wiegand, Thomas [1 ]
机构
[1] Fraunhofer Inst Telecommun, Video Commun & Applicat Dept, Heinrich Hertz Inst, Einsteinufer 37, D-10587 Berlin, Germany
关键词
High Efficiency Video Coding (HEVC); Versatile Video Coding (VVC); Deep Learning; Auto-Encoder; Rate-Distortion-Optimization;
D O I
10.1109/ICIP42928.2021.9506513
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the capabilities of massive GPU hardware, there has been a surge of using artificial neural networks (ANN) for still image compression. These compression systems usually consist of convolutional layers and can be considered as non-linear transform coding. Notably, these ANNs are based on an end-to-end approach where the encoder determines a compressed version of the image as features. In contrast to this, existing image and video codecs employ a block-based architecture with signal-dependent encoder optimizations. A basic requirement for designing such optimizations is estimating the impact of the quantization error on the resulting bitrate and distortion. As for non-linear, multi-layered neural networks, this is a difficult problem. This paper presents a performant auto-encoder architecture for still image compression, which represents the compressed features at multiple scales. Then, we demonstrate how an algorithm, which tests multiple feature candidates, can reduce the Lagrangian cost and optimize compression efficiency. The algorithm avoids multiple network executions by pre-estimating the impact of the quantization on the distortion by a higher-order polynomial.
引用
收藏
页码:3737 / 3741
页数:5
相关论文
共 50 条
  • [31] Normal mesh compression based on rate-distortion optimization
    Sim, JY
    Kim, CS
    Kuo, CCJ
    Lee, SU
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 13 - 16
  • [32] Efficient Rate-Distortion Optimization for HDR Video Compression
    Mir, Junaid
    Kulupana, Gosala
    Talagala, Dumidu S.
    Arachchi, Hemantha Kodikara
    Fernando, Anil
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2017,
  • [33] Light Field Image Compression Based on Bi-Level View Compensation With Rate-Distortion Optimization
    Hou, Junhui
    Chen, Jie
    Chau, Lap-Pui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (02) : 517 - 530
  • [34] Interferential multispectral image compression with classified weighted rate-distortion optimization and adaptive coding depth control
    Wang, Keyan
    Li, Yunsong
    Guo, Jie
    Liu, Kai
    Wu, Chengke
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2008, : 651 - 655
  • [35] A Novel Image Compression Algorithm Based on Multitree Dictionary and Perceptual-based Rate-Distortion Optimization
    Hua, Kai-Lung
    Ahmadiyah, Adhatus Solichah
    Anistyasari, Yeni
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (02) : 475 - 489
  • [36] ADVANCING THE RATE-DISTORTION-COMPUTATION FRONTIER FOR NEURAL IMAGE COMPRESSION
    Minnen, David
    Johnston, Nick
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2940 - 2944
  • [37] Rate-Distortion-Cognition Controllable Versatile Neural Image Compression
    Liu, Jinming
    Feng, Ruoyu
    Qi, Yunpeng
    Chen, Qiuyu
    Chen, Zhibo
    Zeng, Wenjun
    Jin, Xin
    COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 329 - 348
  • [38] Rate-Distortion Based Sparse Coding for Image Set Compression
    Zhang, Xinfeng
    Lin, Weisi
    Ma, Siwei
    Wang, Shiqi
    Ga, Wen
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [39] Rate-distortion optimized color quantization for compound image compression
    Ding, Wenpeng
    Lu, Yan
    Wu, Feng
    Li, Shipeng
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2007, PTS 1 AND 2, 2007, 6508
  • [40] Variable Rate Deep Image Compression With Modulated Autoencoder
    Yang, Fei
    Herranz, Luis
    van de Weijer, Joost
    Guitian, Jose A. Iglesias
    Lopez, Antonio M.
    Mozerov, Mikhail G.
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 331 - 335