High efficiency deep image compression via channel-wise scale adaptive latent representation learning

被引:0
|
作者
Wu, Chenhao [1 ]
Wu, Qingbo [1 ]
Ngan, King Ngi [1 ]
Li, Hongliang [1 ]
Meng, Fanman [1 ]
Xu, Linfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006, Xiyuan Ave, West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China
关键词
Learned image compression; Efficient decoding; Scale-adaptive; Inter-channel upconversion; Inter-scale hyperprior;
D O I
10.1016/j.image.2024.117227
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent learning based neural image compression methods have achieved impressive rate-distortion (RD) performance via the sophisticated context entropy model, which performs well in capturing the spatial correlations of latent features. However, due to the dependency on the adjacent or distant decoded features, existing methods require an inefficient serial processing structure, which significantly limits its practicability. Instead of pursuing computationally expensive entropy estimation, we propose to reduce the spatial redundancy via the channel-wise scale adaptive latent representation learning, whose entropy coding is spatially context- free and parallelizable. Specifically, the proposed encoder adaptively determines the scale of the latent features via a learnable binary mask, which is optimized with the RD cost. In this way, lower-scale latent representation will be allocated to the channels with higher spatial redundancy, which consumes fewer bits and vice versa. The downscaled latent features could be well recovered with a lightweight inter-channel upconversion module in the decoder. To compensate for the entropy estimation performance degradation, we further develop an inter-scale hyperprior entropy model, which supports the high efficiency parallel encoding/decoding within each scale of the latent features. Extensive experiments are conducted to illustrate the efficacy of the proposed method. Our method achieves bitrate savings of 18.23%, 19.36%, and 27.04% over HEVC Intra, along with decoding speeds that are 46 times, 48 times, and 51 times faster than the baseline method on the Kodak, Tecnick, and CLIC datasets, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Learning region-wise deep feature representation for image analysis
    Zhu X.
    Wang Q.
    Li P.
    Zhang X.-Y.
    Wang L.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (11) : 14775 - 14784
  • [32] Adaptive Latent Graph Representation Learning for Image-Text Matching
    Tian, Mengxiao
    Wu, Xinxiao
    Jia, Yunde
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 (471-482) : 471 - 482
  • [33] Deep learning-based post-disaster building inspection with channel-wise attention and semi-supervised learning
    Tang, Wen
    Mondal, Tarutal Ghosh
    Wu, Rih-Teng
    Subedi, Abhishek
    Jahanshahi, Mohammad R.
    SMART STRUCTURES AND SYSTEMS, 2023, 31 (04) : 365 - 381
  • [34] Video-Based Deception Detection via Capsule Network With Channel-Wise Attention and Supervised Contrastive Learning
    Gao, Shuai
    Chen, Lin
    Fang, Yuancheng
    Xiao, Shengbing
    Li, Hui
    Yang, Xuezhi
    Song, Rencheng
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2024, 5 : 660 - 670
  • [35] SPATIALLY-ADAPTIVE LEARNING-BASED IMAGE COMPRESSION WITH HIERARCHICAL MULTI-SCALE LATENT SPACES
    Brand, Fabian
    Kopte, Alexander
    Fischer, Kristian
    Kaup, Andre
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1660 - 1664
  • [36] Image representation and compression via adaptive multi-Gabor representations
    Li, SD
    MATHEMATICS OF DATA/IMAGE CODING, COMPRESSION, AND ENCRYPTION, 1998, 3456 : 67 - 76
  • [37] COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
    Park, Jongmin
    Lee, Jooyoung
    Kim, Munchurl
    arXiv, 2023,
  • [38] COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
    Park, Jongmin
    Lee, Jooyoung
    Kim, Munchurl
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12780 - 12789
  • [39] Adaptive Encrypted Traffic Characterization via Deep Representation Learning
    Wintrode, Jonathan
    DeTienne, David
    2022 INTERMOUNTAIN ENGINEERING, TECHNOLOGY AND COMPUTING (IETC), 2022,
  • [40] Deep document clustering via adaptive hybrid representation learning
    Ren, Lina
    Qin, Yongbin
    Chen, Yanping
    Lin, Chuan
    Huang, Ruizhang
    KNOWLEDGE-BASED SYSTEMS, 2023, 281