High efficiency deep image compression via channel-wise scale adaptive latent representation learning

被引：0

作者：

Wu, Chenhao ^{[1
]}

Wu, Qingbo ^{[1
]}

Ngan, King Ngi ^{[1
]}

Li, Hongliang ^{[1
]}

Meng, Fanman ^{[1
]}

Xu, Linfeng ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006, Xiyuan Ave, West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2025年 / 130卷

关键词：

Learned image compression; Efficient decoding; Scale-adaptive; Inter-channel upconversion; Inter-scale hyperprior;

D O I：

10.1016/j.image.2024.117227

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recent learning based neural image compression methods have achieved impressive rate-distortion (RD) performance via the sophisticated context entropy model, which performs well in capturing the spatial correlations of latent features. However, due to the dependency on the adjacent or distant decoded features, existing methods require an inefficient serial processing structure, which significantly limits its practicability. Instead of pursuing computationally expensive entropy estimation, we propose to reduce the spatial redundancy via the channel-wise scale adaptive latent representation learning, whose entropy coding is spatially context- free and parallelizable. Specifically, the proposed encoder adaptively determines the scale of the latent features via a learnable binary mask, which is optimized with the RD cost. In this way, lower-scale latent representation will be allocated to the channels with higher spatial redundancy, which consumes fewer bits and vice versa. The downscaled latent features could be well recovered with a lightweight inter-channel upconversion module in the decoder. To compensate for the entropy estimation performance degradation, we further develop an inter-scale hyperprior entropy model, which supports the high efficiency parallel encoding/decoding within each scale of the latent features. Extensive experiments are conducted to illustrate the efficacy of the proposed method. Our method achieves bitrate savings of 18.23%, 19.36%, and 27.04% over HEVC Intra, along with decoding speeds that are 46 times, 48 times, and 51 times faster than the baseline method on the Kodak, Tecnick, and CLIC datasets, respectively.

引用

页数：14

共 50 条

[31] Learning region-wise deep feature representation for image analysis
Zhu X.
Wang Q.
Li P.
Zhang X.-Y.
Wang L.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (11) : 14775 - 14784
[32] Adaptive Latent Graph Representation Learning for Image-Text Matching
Tian, Mengxiao
Wu, Xinxiao
Jia, Yunde
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 (471-482) : 471 - 482
[33] Deep learning-based post-disaster building inspection with channel-wise attention and semi-supervised learning
Tang, Wen
Mondal, Tarutal Ghosh
Wu, Rih-Teng
Subedi, Abhishek
Jahanshahi, Mohammad R.
SMART STRUCTURES AND SYSTEMS, 2023, 31 (04) : 365 - 381
[34] Video-Based Deception Detection via Capsule Network With Channel-Wise Attention and Supervised Contrastive Learning
Gao, Shuai
Chen, Lin
Fang, Yuancheng
Xiao, Shengbing
Li, Hui
Yang, Xuezhi
Song, Rencheng
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2024, 5 : 660 - 670
[35] SPATIALLY-ADAPTIVE LEARNING-BASED IMAGE COMPRESSION WITH HIERARCHICAL MULTI-SCALE LATENT SPACES
Brand, Fabian
Kopte, Alexander
Fischer, Kristian
Kaup, Andre
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1660 - 1664
[36] Image representation and compression via adaptive multi-Gabor representations
Li, SD
MATHEMATICS OF DATA/IMAGE CODING, COMPRESSION, AND ENCRYPTION, 1998, 3456 : 67 - 76
[37] COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Park, Jongmin
Lee, Jooyoung
Kim, Munchurl
arXiv, 2023,
[38] COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Park, Jongmin
Lee, Jooyoung
Kim, Munchurl
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12780 - 12789
[39] Adaptive Encrypted Traffic Characterization via Deep Representation Learning
Wintrode, Jonathan
DeTienne, David
2022 INTERMOUNTAIN ENGINEERING, TECHNOLOGY AND COMPUTING (IETC), 2022,
[40] Deep document clustering via adaptive hybrid representation learning
Ren, Lina
Qin, Yongbin
Chen, Yanping
Lin, Chuan
Huang, Ruizhang
KNOWLEDGE-BASED SYSTEMS, 2023, 281

← 1 2 3 4 5 →