High efficiency deep image compression via channel-wise scale adaptive latent representation learning

被引:0
|
作者
Wu, Chenhao [1 ]
Wu, Qingbo [1 ]
Ngan, King Ngi [1 ]
Li, Hongliang [1 ]
Meng, Fanman [1 ]
Xu, Linfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006, Xiyuan Ave, West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China
关键词
Learned image compression; Efficient decoding; Scale-adaptive; Inter-channel upconversion; Inter-scale hyperprior;
D O I
10.1016/j.image.2024.117227
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent learning based neural image compression methods have achieved impressive rate-distortion (RD) performance via the sophisticated context entropy model, which performs well in capturing the spatial correlations of latent features. However, due to the dependency on the adjacent or distant decoded features, existing methods require an inefficient serial processing structure, which significantly limits its practicability. Instead of pursuing computationally expensive entropy estimation, we propose to reduce the spatial redundancy via the channel-wise scale adaptive latent representation learning, whose entropy coding is spatially context- free and parallelizable. Specifically, the proposed encoder adaptively determines the scale of the latent features via a learnable binary mask, which is optimized with the RD cost. In this way, lower-scale latent representation will be allocated to the channels with higher spatial redundancy, which consumes fewer bits and vice versa. The downscaled latent features could be well recovered with a lightweight inter-channel upconversion module in the decoder. To compensate for the entropy estimation performance degradation, we further develop an inter-scale hyperprior entropy model, which supports the high efficiency parallel encoding/decoding within each scale of the latent features. Extensive experiments are conducted to illustrate the efficacy of the proposed method. Our method achieves bitrate savings of 18.23%, 19.36%, and 27.04% over HEVC Intra, along with decoding speeds that are 46 times, 48 times, and 51 times faster than the baseline method on the Kodak, Tecnick, and CLIC datasets, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Adaptive Compression of Massive MIMO Channel State Information With Deep Learning
    Mismar, Faris B.
    Kaya, Aliye Ozge
    IEEE Networking Letters, 2024, 6 (04): : 267 - 271
  • [42] Compression of YOLOv3 via Block-wise and Channel-wise Pruning for Real-time and Complicated Autonomous Driving Environment Sensing Applications
    Li, Jiaqi
    Zhao, Yanan
    Gao, Li
    Cui, Feng
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5107 - 5114
  • [43] Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
    Oh, Yongjeong
    Lee, Jaeho
    Brinton, Christopher G.
    Jeon, Yo-Seb
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [44] Unsupervised feature selection via adaptive hypergraph regularized latent representation learning
    Ding, Deqiong
    Yang, Xiaogao
    Xia, Fei
    Ma, Tiefeng
    Liu, Haiyun
    Tang, Chang
    NEUROCOMPUTING, 2020, 378 : 79 - 97
  • [45] Unsupervised feature selection via adaptive hypergraph regularized latent representation learning
    Ding, Deqiong
    Yang, Xiaogao
    Xia, Fei
    Ma, Tiefeng
    Liu, Haiyun
    Tang, Chang
    Neurocomputing, 2021, 378 : 79 - 97
  • [46] Representation Learning Based on Autoencoder and Deep Adaptive Clustering for Image Clustering
    Yu, Siquan
    Liu, Jiaxin
    Han, Zhi
    Li, Yong
    Tang, Yandong
    Wu, Chengdong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [47] Medical image fusion via decoupled representation and component-wise regularization learning
    Zhang, Rui
    Sun, Haoze
    Deng, Lizhen
    Zhu, Hu
    Qian, Wei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [48] Towards Channel-Wise Bidirectional Representation Learning with Fixed-Point Positional Encoding for SoH Estimation of Lithium-Ion Battery
    Pham, Thien
    Truong, Loi
    Bui, Hung
    Tran, Thang
    Garg, Akhil
    Gao, Liang
    Quan, Tho
    ELECTRONICS, 2023, 12 (01)
  • [49] High Efficiency Deep-learning Based Video Compression
    Tang, Lv
    Zhang, Xinfeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
  • [50] Hyperspectral Image Compression via Cross-Channel Contrastive Learning
    Guo, Yuanyuan
    Chong, Yanwen
    Pan, Shaoming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61