End-to-End Optimized ROI Image Compression

被引：52

作者：

Cai, Chunlei ^{[1
]}

Chen, Li ^{[1
]}

Zhang, Xiaoyun ^{[1
]}

Gao, Zhiyong ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Dept Elect Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金; 上海市自然科学基金;

关键词：

Region of interest; lossy image compression; object segmentation; ROI coding; rate distortion optimization; convolutional neural network;

D O I：

10.1109/TIP.2019.2960869

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Compressing an image with more bits automatically allocated to the region of interest (ROI) than to the background can both protect key information and reduce substantial redundancy. This paper models ROI image compression as an optimization problem of minimizing a weighted sum of the rate of the image and distortion of the ROI. The traditional framework solves this problem by cascading ROI prediction and ROI coding, through which achieving the optimized solution is impossible. To improve coding performance, we propose a novel deep-learning-based unified framework that can achieve rate distortion optimization for ROI compression. Specifically, the proposed framework includes a pair of ROI encoder and decoder convolutional neural networks and a learned entropy codec. The encoder network simultaneously generates multiscale representations that support efficient rate allocation and an implicit ROI mask that guides rate allocation. The proposed framework can automatically complete ROI image compression, and it can be optimized from data in an end-to-end manner. To effectively train the framework by back propagation, we develop a soft-to-hard ROI prediction scheme to make the entire framework differential. To improve visual quality, we propose a hierarchical distortion loss function to protect both pixel-level fidelity for ROI and structural similarity for the entire image. The proposed framework is implemented in two scenarios: salient-target and face-target ROI compression. Comparative experiments demonstrate the advantages of the proposed framework over the traditional framework, including considerably better subjective visual quality, significantly higher objective ROI compression performance and execution efficiency.

引用

页码：3442 / 3457

页数：16

共 50 条

[41] Reducing The Amortization Gap of Entropy Bottleneck In End-to-End Image Compression
Balcilar, Muhammet
Damodaran, Bharath
Hellier, Pierre
2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 115 - 119
[42] Towards End-to-End Compression in Lustre
Fuchs, Anna
Squar, Jannek
Kuhn, Michael
2024 23RD INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, ISPDC 2024, 2024,
[43] END-TO-END DETECTION-SEGMENTATION NETWORK WITH ROI CONVOLUTION
Zhang, Zichen
Tang, Min
Cobzas, Dana
Zonoobi, Dornoosh
Jagersand, Martin
Jaremko, Jacob L.
2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 1509 - 1512
[44] Two-Stage Octave Residual Network for End-to-End Image Compression
Chen, Fangdong
Xu, Yumeng
Wang, Li
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3922 - 3929
[45] Learning True Rate-Distortion-Optimization for End-To-End Image Compression
Brand, Fabian
Fischer, Kristian
Kopte, Alexander
Kaup, Andre
DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 443 - 443
[46] A new end-to-end image compression system based on convolutional neural networks
Akyazi, Pinar
Ebrahimi, Touradj
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
[47] CONTENT-ADAPTIVE PARALLEL ENTROPY CODING FOR END-TO-END IMAGE COMPRESSION
Li, Shujia
Wang, Dezhao
Fan, Zejia
Liu, Jiaying
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3195 - 3199
[48] Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure
Li, Yang
Wang, Shiqi
Zhang, Xinfeng
Wang, Shanshe
Ma, Siwei
Wang, Yue
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4297 - 4305
[49] An End-to-End Deep Learning Image Compression Framework Based on Semantic Analysis
Wang, Cheng
Han, Yifei
Wang, Weidong
APPLIED SCIENCES-BASEL, 2019, 9 (17):
[50] End-to-end deep multispectral image compression based on interspectral prediction network
Kong, Fanqiang
Meng, Yuxin
Li, Dan
Hu, Kedi
JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (03)

← 1 2 3 4 5 →