High efficiency deep image compression via channel-wise scale adaptive latent representation learning

被引：0

作者：

Wu, Chenhao ^{[1
]}

Wu, Qingbo ^{[1
]}

Ngan, King Ngi ^{[1
]}

Li, Hongliang ^{[1
]}

Meng, Fanman ^{[1
]}

Xu, Linfeng ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006, Xiyuan Ave, West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2025年 / 130卷

关键词：

Learned image compression; Efficient decoding; Scale-adaptive; Inter-channel upconversion; Inter-scale hyperprior;

D O I：

10.1016/j.image.2024.117227

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recent learning based neural image compression methods have achieved impressive rate-distortion (RD) performance via the sophisticated context entropy model, which performs well in capturing the spatial correlations of latent features. However, due to the dependency on the adjacent or distant decoded features, existing methods require an inefficient serial processing structure, which significantly limits its practicability. Instead of pursuing computationally expensive entropy estimation, we propose to reduce the spatial redundancy via the channel-wise scale adaptive latent representation learning, whose entropy coding is spatially context- free and parallelizable. Specifically, the proposed encoder adaptively determines the scale of the latent features via a learnable binary mask, which is optimized with the RD cost. In this way, lower-scale latent representation will be allocated to the channels with higher spatial redundancy, which consumes fewer bits and vice versa. The downscaled latent features could be well recovered with a lightweight inter-channel upconversion module in the decoder. To compensate for the entropy estimation performance degradation, we further develop an inter-scale hyperprior entropy model, which supports the high efficiency parallel encoding/decoding within each scale of the latent features. Extensive experiments are conducted to illustrate the efficacy of the proposed method. Our method achieves bitrate savings of 18.23%, 19.36%, and 27.04% over HEVC Intra, along with decoding speeds that are 46 times, 48 times, and 51 times faster than the baseline method on the Kodak, Tecnick, and CLIC datasets, respectively.

引用

页数：14

共 50 条

[1] CHANNEL-WISE PROGRESSIVE LEARNING FOR LOSSLESS IMAGE COMPRESSION
Rhee, Hochang
Fang, Yeong Il
Kim, Seyun
Cho, Nam lk
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1113 - 1117
[2] Deep metric learning via group channel-wise ensemble
Li, Ping
Zhao, Guopan
Chen, Jiajun
Xu, Xianghua
KNOWLEDGE-BASED SYSTEMS, 2023, 259
[3] Deep metric learning via group channel-wise ensemble
Li, Ping
Zhao, Guopan
Chen, Jiajun
Xu, Xianghua
KNOWLEDGE-BASED SYSTEMS, 2023, 259
[4] Graph Representation Learning via Hard and Channel-Wise Attention Networks
Gao, Hongyang
Ji, Shuiwang
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 741 - 749
[5] LEARNED IMAGE COMPRESSION WITH CHANNEL-WISE GROUPED CONTEXT MODELING
Yuan, Liang
Luo, Jixiang
Li, Shaohui
Dai, Wenrui
Li, Chenglin
Zou, Junni
Xiong, Hongkai
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2099 - 2103
[6] Channel-Wise Feature Decorrelation for Enhanced Learned Image Compression
Pakdaman, Farhad
Gabbouj, Moncef
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1635 - 1639
[7] CHANNEL-WISE AUTOREGRESSIVE ENTROPY MODELS FOR LEARNED IMAGE COMPRESSION
Minnen, David
Singh, Saurabh
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3339 - 3343
[8] Fake Colorized Image Detection with Channel-wise Convolution based Deep-learning Framework
Zhuo, Long
Tan, Shunquan
Zeng, Jishen
Li, Bin
2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 733 - 736
[9] Underwater image enhancement via a channel-wise transmission estimation network
Wang, Qiang
Fu, Bo
Fan, Huijie
IET IMAGE PROCESSING, 2023, 17 (10) : 2958 - 2971
[10] CMAA: Channel-wise multi-scale adaptive attention network for metallographic image semantic segmentation
Sun, Yongliang
Huang, Xiangyang
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 276

← 1 2 3 4 5 →