High efficiency deep image compression via channel-wise scale adaptive latent representation learning

被引：0

作者：

Wu, Chenhao ^{[1
]}

Wu, Qingbo ^{[1
]}

Ngan, King Ngi ^{[1
]}

Li, Hongliang ^{[1
]}

Meng, Fanman ^{[1
]}

Xu, Linfeng ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006, Xiyuan Ave, West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2025年 / 130卷

关键词：

Learned image compression; Efficient decoding; Scale-adaptive; Inter-channel upconversion; Inter-scale hyperprior;

D O I：

10.1016/j.image.2024.117227

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recent learning based neural image compression methods have achieved impressive rate-distortion (RD) performance via the sophisticated context entropy model, which performs well in capturing the spatial correlations of latent features. However, due to the dependency on the adjacent or distant decoded features, existing methods require an inefficient serial processing structure, which significantly limits its practicability. Instead of pursuing computationally expensive entropy estimation, we propose to reduce the spatial redundancy via the channel-wise scale adaptive latent representation learning, whose entropy coding is spatially context- free and parallelizable. Specifically, the proposed encoder adaptively determines the scale of the latent features via a learnable binary mask, which is optimized with the RD cost. In this way, lower-scale latent representation will be allocated to the channels with higher spatial redundancy, which consumes fewer bits and vice versa. The downscaled latent features could be well recovered with a lightweight inter-channel upconversion module in the decoder. To compensate for the entropy estimation performance degradation, we further develop an inter-scale hyperprior entropy model, which supports the high efficiency parallel encoding/decoding within each scale of the latent features. Extensive experiments are conducted to illustrate the efficacy of the proposed method. Our method achieves bitrate savings of 18.23%, 19.36%, and 27.04% over HEVC Intra, along with decoding speeds that are 46 times, 48 times, and 51 times faster than the baseline method on the Kodak, Tecnick, and CLIC datasets, respectively.

引用

页数：14

共 50 条

[41] Adaptive Compression of Massive MIMO Channel State Information With Deep Learning
Mismar, Faris B.
Kaya, Aliye Ozge
IEEE Networking Letters, 2024, 6 (04): : 267 - 271
[42] Compression of YOLOv3 via Block-wise and Channel-wise Pruning for Real-time and Complicated Autonomous Driving Environment Sensing Applications
Li, Jiaqi
Zhao, Yanan
Gao, Li
Cui, Feng
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5107 - 5114
[43] Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
Oh, Yongjeong
Lee, Jaeho
Brinton, Christopher G.
Jeon, Yo-Seb
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
[44] Unsupervised feature selection via adaptive hypergraph regularized latent representation learning
Ding, Deqiong
Yang, Xiaogao
Xia, Fei
Ma, Tiefeng
Liu, Haiyun
Tang, Chang
NEUROCOMPUTING, 2020, 378 : 79 - 97
[45] Unsupervised feature selection via adaptive hypergraph regularized latent representation learning
Ding, Deqiong
Yang, Xiaogao
Xia, Fei
Ma, Tiefeng
Liu, Haiyun
Tang, Chang
Neurocomputing, 2021, 378 : 79 - 97
[46] Representation Learning Based on Autoencoder and Deep Adaptive Clustering for Image Clustering
Yu, Siquan
Liu, Jiaxin
Han, Zhi
Li, Yong
Tang, Yandong
Wu, Chengdong
MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
[47] Medical image fusion via decoupled representation and component-wise regularization learning
Zhang, Rui
Sun, Haoze
Deng, Lizhen
Zhu, Hu
Qian, Wei
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
[48] Towards Channel-Wise Bidirectional Representation Learning with Fixed-Point Positional Encoding for SoH Estimation of Lithium-Ion Battery
Pham, Thien
Truong, Loi
Bui, Hung
Tran, Thang
Garg, Akhil
Gao, Liang
Quan, Tho
ELECTRONICS, 2023, 12 (01)
[49] High Efficiency Deep-learning Based Video Compression
Tang, Lv
Zhang, Xinfeng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
[50] Hyperspectral Image Compression via Cross-Channel Contrastive Learning
Guo, Yuanyuan
Chong, Yanwen
Pan, Shaoming
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

← 1 2 3 4 5 →