End-to-end Image Compression with Swin-Transformer

被引:1
|
作者
Wang, Meng [1 ]
Zhang, Kai [2 ]
Zhang, Li [2 ]
Li, Yue [2 ]
Li, Junru [3 ]
Wang, Yue [3 ]
Wang, Shiqi [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Bytedance Inc, San Diego, CA 92122 USA
[3] Beijing Bytedance Technol Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Image compression; end-to-end compression; transformer; convolution;
D O I
10.1109/VCIP56404.2022.10008895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an end-to-end image compression framework, which cooperates with the swin-transformer modules to capture the localized and non-localized similarities in image compression. In particular, the swin-transformer modules are deployed in the analysis and synthesis stages, interleaving with convolution layers. The transformer layers are expected to perceive more flexible receptive fields, such that the spatially localized and non-localized redundancies could be more effectively eliminated. The proposed method reveals the excellent capability of signal conjunction and prediction, leading to the improvement of the rate and distortion performance. Experimental results show that the proposed method is superior to the existing methods on both natural scene and screen content images, where 22.46% BD-Rate savings are achieved when compared with the BPG. Over 30% BD-Rate gains could be observed with screen content images when compared with the classical hyper-prior end-to-end coding method.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Fully Integerized End-to-End Learned Image Compression
    Fang, Yimian
    Fei, Wen
    Li, Shaohui
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 337 - 337
  • [22] Learning End-to-End Lossy Image Compression: A Benchmark
    Hu, Yueyu
    Yang, Wenhan
    Ma, Zhan
    Liu, Jiaying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4194 - 4211
  • [23] Transformer Model Compression for End-to-End Speech Recognition on Mobile Devices
    Ben Letaifa, Leila
    Rouas, Jean-Luc
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 439 - 443
  • [24] End-to-end image compression method based on perception metric
    Shuai Liu
    Yingcong Huang
    Huoxiang Yang
    Yongsheng Liang
    Wei Liu
    Signal, Image and Video Processing, 2022, 16 : 1803 - 1810
  • [25] End-to-end system consideration of the Galileo image compression system
    Cheung, K
    Tong, K
    Belongie, M
    IGARSS '96 - 1996 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM: REMOTE SENSING FOR A SUSTAINABLE FUTURE, VOLS I - IV, 1996, : 1035 - 1038
  • [26] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [27] End-to-end optimized image compression with competition of prior distributions
    Brummer, Benoit
    De Vleeschouwer, Christophe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
  • [28] End-to-End Learned Image Compression with Augmented Normalizing Flows
    Ho, Yung-Han
    Chan, Chih-Chun
    Peng, Wen-Hsiao
    Hang, Hsueh-Ming
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
  • [29] An end-to-end spike-based image compression architecture
    Doutsi, Effrosyni
    Antonini, Marc
    Tsakalides, Panagiotis
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 818 - 820
  • [30] End-to-end image compression method based on perception metric
    Liu, Shuai
    Huang, Yingcong
    Yang, Huoxiang
    Liang, Yongsheng
    Liu, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1803 - 1810