End-to-End Optimized ROI Image Compression

被引:52
|
作者
Cai, Chunlei [1 ]
Chen, Li [1 ]
Zhang, Xiaoyun [1 ]
Gao, Zhiyong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Region of interest; lossy image compression; object segmentation; ROI coding; rate distortion optimization; convolutional neural network;
D O I
10.1109/TIP.2019.2960869
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compressing an image with more bits automatically allocated to the region of interest (ROI) than to the background can both protect key information and reduce substantial redundancy. This paper models ROI image compression as an optimization problem of minimizing a weighted sum of the rate of the image and distortion of the ROI. The traditional framework solves this problem by cascading ROI prediction and ROI coding, through which achieving the optimized solution is impossible. To improve coding performance, we propose a novel deep-learning-based unified framework that can achieve rate distortion optimization for ROI compression. Specifically, the proposed framework includes a pair of ROI encoder and decoder convolutional neural networks and a learned entropy codec. The encoder network simultaneously generates multiscale representations that support efficient rate allocation and an implicit ROI mask that guides rate allocation. The proposed framework can automatically complete ROI image compression, and it can be optimized from data in an end-to-end manner. To effectively train the framework by back propagation, we develop a soft-to-hard ROI prediction scheme to make the entire framework differential. To improve visual quality, we propose a hierarchical distortion loss function to protect both pixel-level fidelity for ROI and structural similarity for the entire image. The proposed framework is implemented in two scenarios: salient-target and face-target ROI compression. Comparative experiments demonstrate the advantages of the proposed framework over the traditional framework, including considerably better subjective visual quality, significantly higher objective ROI compression performance and execution efficiency.
引用
收藏
页码:3442 / 3457
页数:16
相关论文
共 50 条
  • [41] Reducing The Amortization Gap of Entropy Bottleneck In End-to-End Image Compression
    Balcilar, Muhammet
    Damodaran, Bharath
    Hellier, Pierre
    2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 115 - 119
  • [42] Towards End-to-End Compression in Lustre
    Fuchs, Anna
    Squar, Jannek
    Kuhn, Michael
    2024 23RD INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, ISPDC 2024, 2024,
  • [43] END-TO-END DETECTION-SEGMENTATION NETWORK WITH ROI CONVOLUTION
    Zhang, Zichen
    Tang, Min
    Cobzas, Dana
    Zonoobi, Dornoosh
    Jagersand, Martin
    Jaremko, Jacob L.
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 1509 - 1512
  • [44] Two-Stage Octave Residual Network for End-to-End Image Compression
    Chen, Fangdong
    Xu, Yumeng
    Wang, Li
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3922 - 3929
  • [45] Learning True Rate-Distortion-Optimization for End-To-End Image Compression
    Brand, Fabian
    Fischer, Kristian
    Kopte, Alexander
    Kaup, Andre
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 443 - 443
  • [46] A new end-to-end image compression system based on convolutional neural networks
    Akyazi, Pinar
    Ebrahimi, Touradj
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [47] CONTENT-ADAPTIVE PARALLEL ENTROPY CODING FOR END-TO-END IMAGE COMPRESSION
    Li, Shujia
    Wang, Dezhao
    Fan, Zejia
    Liu, Jiaying
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3195 - 3199
  • [48] Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure
    Li, Yang
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Wang, Yue
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4297 - 4305
  • [49] An End-to-End Deep Learning Image Compression Framework Based on Semantic Analysis
    Wang, Cheng
    Han, Yifei
    Wang, Weidong
    APPLIED SCIENCES-BASEL, 2019, 9 (17):
  • [50] End-to-end deep multispectral image compression based on interspectral prediction network
    Kong, Fanqiang
    Meng, Yuxin
    Li, Dan
    Hu, Kedi
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (03)