High efficiency deep image compression via channel-wise scale adaptive latent representation learning

被引:0
|
作者
Wu, Chenhao [1 ]
Wu, Qingbo [1 ]
Ngan, King Ngi [1 ]
Li, Hongliang [1 ]
Meng, Fanman [1 ]
Xu, Linfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006, Xiyuan Ave, West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China
关键词
Learned image compression; Efficient decoding; Scale-adaptive; Inter-channel upconversion; Inter-scale hyperprior;
D O I
10.1016/j.image.2024.117227
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent learning based neural image compression methods have achieved impressive rate-distortion (RD) performance via the sophisticated context entropy model, which performs well in capturing the spatial correlations of latent features. However, due to the dependency on the adjacent or distant decoded features, existing methods require an inefficient serial processing structure, which significantly limits its practicability. Instead of pursuing computationally expensive entropy estimation, we propose to reduce the spatial redundancy via the channel-wise scale adaptive latent representation learning, whose entropy coding is spatially context- free and parallelizable. Specifically, the proposed encoder adaptively determines the scale of the latent features via a learnable binary mask, which is optimized with the RD cost. In this way, lower-scale latent representation will be allocated to the channels with higher spatial redundancy, which consumes fewer bits and vice versa. The downscaled latent features could be well recovered with a lightweight inter-channel upconversion module in the decoder. To compensate for the entropy estimation performance degradation, we further develop an inter-scale hyperprior entropy model, which supports the high efficiency parallel encoding/decoding within each scale of the latent features. Extensive experiments are conducted to illustrate the efficacy of the proposed method. Our method achieves bitrate savings of 18.23%, 19.36%, and 27.04% over HEVC Intra, along with decoding speeds that are 46 times, 48 times, and 51 times faster than the baseline method on the Kodak, Tecnick, and CLIC datasets, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] CHANNEL-WISE PROGRESSIVE LEARNING FOR LOSSLESS IMAGE COMPRESSION
    Rhee, Hochang
    Fang, Yeong Il
    Kim, Seyun
    Cho, Nam lk
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1113 - 1117
  • [2] Deep metric learning via group channel-wise ensemble
    Li, Ping
    Zhao, Guopan
    Chen, Jiajun
    Xu, Xianghua
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [3] Deep metric learning via group channel-wise ensemble
    Li, Ping
    Zhao, Guopan
    Chen, Jiajun
    Xu, Xianghua
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [4] Graph Representation Learning via Hard and Channel-Wise Attention Networks
    Gao, Hongyang
    Ji, Shuiwang
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 741 - 749
  • [5] LEARNED IMAGE COMPRESSION WITH CHANNEL-WISE GROUPED CONTEXT MODELING
    Yuan, Liang
    Luo, Jixiang
    Li, Shaohui
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2099 - 2103
  • [6] Channel-Wise Feature Decorrelation for Enhanced Learned Image Compression
    Pakdaman, Farhad
    Gabbouj, Moncef
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1635 - 1639
  • [7] CHANNEL-WISE AUTOREGRESSIVE ENTROPY MODELS FOR LEARNED IMAGE COMPRESSION
    Minnen, David
    Singh, Saurabh
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3339 - 3343
  • [8] Fake Colorized Image Detection with Channel-wise Convolution based Deep-learning Framework
    Zhuo, Long
    Tan, Shunquan
    Zeng, Jishen
    Li, Bin
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 733 - 736
  • [9] Underwater image enhancement via a channel-wise transmission estimation network
    Wang, Qiang
    Fu, Bo
    Fan, Huijie
    IET IMAGE PROCESSING, 2023, 17 (10) : 2958 - 2971
  • [10] CMAA: Channel-wise multi-scale adaptive attention network for metallographic image semantic segmentation
    Sun, Yongliang
    Huang, Xiangyang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 276