Towards an Efficient Remote Sensing Image Compression Network with Visual State Space Model

被引:0
|
作者
Wang, Yongqiang [1 ]
Liang, Feng [1 ]
Wang, Shang [1 ]
Chen, Hang [1 ]
Cao, Qi [1 ]
Fu, Haisheng [1 ]
Chen, Zhenjiao [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Peoples R China
[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing image compression; state space model; visual Mamba; rate-distortion performance; image compression network;
D O I
10.3390/rs17030425
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the past few years, deep learning has achieved remarkable advancements in the area of image compression. Remote sensing image compression networks focus on enhancing the similarity between the input and reconstructed images, effectively reducing the storage and bandwidth requirements for high-resolution remote sensing images. As the network's effective receptive field (ERF) expands, it can capture more feature information across the remote sensing images, thereby reducing spatial redundancy and improving compression efficiency. However, the majority of these learned image compression (LIC) techniques are primarily CNN-based and transformer-based, often failing to balance the global ERF and computational complexity optimally. To alleviate this issue, we propose a learned remote sensing image compression network with visual state space model named VMIC to achieve a better trade-off between computational complexity and performance. Specifically, instead of stacking small convolution kernels or heavy self-attention mechanisms, we employ a 2D-bidirectional selective scan mechanism. Every element within the feature map aggregates data from multiple spatial positions, establishing a globally effective receptive field with linear computational complexity. We extend it to an omni-selective scan for the global-spatial correlations within our Channel and Global Context Entropy Model (CGCM), enabling the integration of spatial and channel priors to minimize redundancy across slices. Experimental results demonstrate that the proposed method achieves superior trade-off between rate-distortion performance and complexity. Furthermore, in comparison to traditional codecs and learned image compression algorithms, our model achieves BD-rate reductions of -4.48%, -9.80% over the state-of-the-art VTM on the AID and NWPU VHR-10 datasets, respectively, as well as -6.73% and -7.93% on the panchromatic and multispectral images of the WorldView-3 remote sensing dataset.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] FusionMamba: Efficient Remote Sensing Image Fusion With State Space Model
    Peng, Siran
    Zhu, Xiangyu
    Deng, Haoyu
    Deng, Liang-Jian
    Lei, Zhen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] Contour-Enhanced Visual State-Space Model for Remote Sensing Image Classification
    Yan, Liyue
    Zhang, Xing
    Wang, Kafeng
    Zhang, Dejin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [3] RSMamba: Remote Sensing Image Classification With State Space Model
    Chen, Keyan
    Chen, Bowen
    Liu, Chenyang
    Li, Wenyuan
    Zou, Zhengxia
    Shi, Zhenwei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [4] RS3Mamba: Visual State Space Model for Remote Sensing Image Semantic Segmentation
    Ma, Xianping
    Zhang, Xiaokang
    Pun, Man-On
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [5] RSCaMa: Remote Sensing Image Change Captioning With State Space Model
    Liu, Chenyang
    Chen, Keyan
    Chen, Bowen
    Zhang, Haotian
    Zou, Zhengxia
    Shi, Zhenwei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [6] A LOSSLESS COMPRESSION ALGORITHM OF REMOTE SENSING IMAGE FOR SPACE APPLICATIONS
    Sui Yuping Yang Chengyu Liu Yanjun Wang Jun Wei Zhonghui He Xin(Changchun Institute of Optics
    JournalofElectronics(China), 2008, (05) : 647 - 651
  • [7] Simple and Efficient Remote Sensing Image Transformation for Lossless Compression
    Sepehrband, Farshid
    Ghamisi, Pedram
    Mortazavi, Mohammad
    Choupan, Jeiran
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [8] A novel context model for remote sensing image compression
    Wang Qingyuan
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2011: SPACE EXPLORATION TECHNOLOGIES AND APPLICATIONS, 2011, 8196
  • [9] Mixed Entropy Model Enhanced Residual Attention Network for Remote Sensing Image Compression
    Gao, Junjun
    Teng, Qizhi
    He, Xiaohai
    Chen, Zhengxin
    Ren, Chao
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 10117 - 10129
  • [10] Mixed Entropy Model Enhanced Residual Attention Network for Remote Sensing Image Compression
    Junjun Gao
    Qizhi Teng
    Xiaohai He
    Zhengxin Chen
    Chao Ren
    Neural Processing Letters, 2023, 55 : 10117 - 10129