End-to-end Image Compression with Swin-Transformer

被引：1

作者：

Wang, Meng ^{[1
]}

Zhang, Kai ^{[2
]}

Zhang, Li ^{[2
]}

Li, Yue ^{[2
]}

Li, Junru ^{[3
]}

Wang, Yue ^{[3
]}

Wang, Shiqi ^{[1
]}

机构：

[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[2] Bytedance Inc, San Diego, CA 92122 USA

[3] Beijing Bytedance Technol Co Ltd, Beijing, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2022年

基金：

中国国家自然科学基金;

关键词：

Image compression; end-to-end compression; transformer; convolution;

D O I：

10.1109/VCIP56404.2022.10008895

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose an end-to-end image compression framework, which cooperates with the swin-transformer modules to capture the localized and non-localized similarities in image compression. In particular, the swin-transformer modules are deployed in the analysis and synthesis stages, interleaving with convolution layers. The transformer layers are expected to perceive more flexible receptive fields, such that the spatially localized and non-localized redundancies could be more effectively eliminated. The proposed method reveals the excellent capability of signal conjunction and prediction, leading to the improvement of the rate and distortion performance. Experimental results show that the proposed method is superior to the existing methods on both natural scene and screen content images, where 22.46% BD-Rate savings are achieved when compared with the BPG. Over 30% BD-Rate gains could be observed with screen content images when compared with the classical hyper-prior end-to-end coding method.

引用

页数：5

共 50 条

[21] Fully Integerized End-to-End Learned Image Compression
Fang, Yimian
Fei, Wen
Li, Shaohui
Dai, Wenrui
Li, Chenglin
Zou, Junni
Xiong, Hongkai
2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 337 - 337
[22] Learning End-to-End Lossy Image Compression: A Benchmark
Hu, Yueyu
Yang, Wenhan
Ma, Zhan
Liu, Jiaying
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4194 - 4211
[23] Transformer Model Compression for End-to-End Speech Recognition on Mobile Devices
Ben Letaifa, Leila
Rouas, Jean-Luc
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 439 - 443
[24] End-to-end image compression method based on perception metric
Shuai Liu
Yingcong Huang
Huoxiang Yang
Yongsheng Liang
Wei Liu
Signal, Image and Video Processing, 2022, 16 : 1803 - 1810
[25] End-to-end system consideration of the Galileo image compression system
Cheung, K
Tong, K
Belongie, M
IGARSS '96 - 1996 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM: REMOTE SENSING FOR A SUSTAINABLE FUTURE, VOLS I - IV, 1996, : 1035 - 1038
[26] End-to-End Learning-Based Image Compression: A Review
Chen Jimin
Lin Zehao
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
[27] End-to-end optimized image compression with competition of prior distributions
Brummer, Benoit
De Vleeschouwer, Christophe
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
[28] End-to-End Learned Image Compression with Augmented Normalizing Flows
Ho, Yung-Han
Chan, Chih-Chun
Peng, Wen-Hsiao
Hang, Hsueh-Ming
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
[29] An end-to-end spike-based image compression architecture
Doutsi, Effrosyni
Antonini, Marc
Tsakalides, Panagiotis
2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 818 - 820
[30] End-to-end image compression method based on perception metric
Liu, Shuai
Huang, Yingcong
Yang, Huoxiang
Liang, Yongsheng
Liu, Wei
SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1803 - 1810

← 1 2 3 4 5 →