Human-Machine Collaborative Image Compression Method Based on Implicit Neural Representations

被引:1
|
作者
Li, Huanyang [1 ]
Zhang, Xinfeng [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
关键词
Image compression; image coding for machine; implicit neural representation;
D O I
10.1109/JETCAS.2024.3386639
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the explosive increase in the volume of images intended for analysis by AI, image coding for machine have been proposed to transmit information in a machine-interpretable format, thereby enhancing image compression efficiency. However, such efficient coding schemes often lead to issues like loss of image details and features, and unclear semantic information due to high data compression ratio, making them less suitable for human vision domains. Thus, it is a critical problem to balance image visual quality and machine vision accuracy at a given compression ratio. To address these issues, we introduce a human-machine collaborative image coding framework based on Implicit Neural Representations (INR), which effectively reduces the transmitted information for machine vision tasks at the decoding side while maintaining high-efficiency image compression for human vision against INR compression framework. To enhance the model's perception of images for machine vision, we design a semantic embedding enhancement module to assist in understanding image semantics. Specifically, we employ the Swin Transformer model to initialize image features, ensuring that the embedding of the compression model are effectively applicable to downstream visual tasks. Extensive experimental results demonstrate that our method significantly outperforms other image compression methods in classification tasks while ensuring image compression efficiency.
引用
收藏
页码:198 / 208
页数:11
相关论文
共 50 条
  • [1] Human-Machine Collaborative Image and Video Compression: A Survey
    Li, Huanyang
    Zhang, Xinfeng
    Wang, Shiqi
    Wang, Shanshe
    Pan, Jingshan
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (06)
  • [2] Implicit Neural Representations for Image Compression
    Strumpler, Yannick
    Postels, Janis
    Yang, Ren
    Van Gool, Luc
    Tombari, Federico
    COMPUTER VISION, ECCV 2022, PT XXVI, 2022, 13686 : 74 - 91
  • [3] Hyperspectral Image Compression Using Implicit Neural Representations
    Rezasoltani, Shima
    Qureshi, Faisal Z.
    Proceedings - 2023 20th Conference on Robots and Vision, CRV 2023, 2023, : 248 - 255
  • [4] Hyperspectral Image Compression Using Implicit Neural Representations
    Rezasoltani, Shima
    Qureshi, Faisal Z.
    2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 248 - 255
  • [5] Hyperspectral Image Compression Using Sampling and Implicit Neural Representations
    Rezasoltani, Shima
    Qureshi, Faisal Z.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [6] Hyperspectral Image Compression Using Sampling and Implicit Neural Representations
    Rezasoltani, Shima
    Qureshi, Faisal Z.
    arXiv, 2023,
  • [7] Learned Image Coding for Human-Machine Collaborative Optimization
    He, Jingbo
    He, Xiaohai
    Xiong, Shuhua
    Chen, Honggang
    IEEE TRANSACTIONS ON BROADCASTING, 2025, 71 (01) : 203 - 216
  • [8] Entropy-Constrained Implicit Neural Representations for Deep Image Compression
    Lee, Soonbin
    Jeong, Jong-Beom
    Ryu, Eun-Seok
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 663 - 667
  • [9] Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision
    Mao, Qi
    Wang, Chongyu
    Wang, Meng
    Wang, Shiqi
    Chen, Ruijie
    Jin, Libiao
    Ma, Siwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 408 - 422
  • [10] Compression with Bayesian Implicit Neural Representations
    Guo, Zongyu
    Flamich, Gergely
    He, Jiajun
    Chen, Zhibo
    Hernandez-Lobato, Jose Miguel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,