Swin-GAN: generative adversarial network based on shifted windows transformer architecture for image generation

被引:5
|
作者
Wang, Shibin [1 ]
Gao, Zidiao [1 ]
Liu, Dong [1 ]
机构
[1] Henan Normal Univ, Sch Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
来源
VISUAL COMPUTER | 2023年 / 39卷 / 12期
基金
中国国家自然科学基金;
关键词
GAN; Transformer; Self-attention; Image generation;
D O I
10.1007/s00371-022-02714-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is well known that every successful generative adversarial network (GAN) relies on the convolutional neural networks (CNN)-based generators and discriminators. However, CNN cannot process the long-range dependencies because its convolution operator has a local receptive field, which can bring some issues to GAN, such as the optimization, the loss of feature resolution and the fine details. To meet the problem of long-term dependence, we propose a GAN model based on shifted windows Transformer architecture, called Swin-GAN, in which the CNN architecture is replaced by Transformer. In our model, we build a memory-friendly generator based on the shifted window attention mechanism to gradually increase the resolution of feature maps at each stage. Another, we build a multi-scale discriminator to split the image into patches of different sizes as the input at different stages, which can achieve the balance between capturing global contextual semantic information and local detailed features. To further improve the fidelity and stability, we use the techniques such as data enhancement, layer normalization and relative position coding in our model. Compared with the current schemes, the experimental results show that our scheme has better performance, fewer parameters and lower computational cost. Specifically, Params value of Swin-GAN model is 30.254M, and Floating-Point Operations Per Second (FLOPs) value is 4.086G. Inception Score (IS) is 9.04 and Frechet Inception Distance (FID) is 9.23 in CIFAR-10.
引用
收藏
页码:6085 / 6095
页数:11
相关论文
共 50 条
  • [1] Swin-GAN: generative adversarial network based on shifted windows transformer architecture for image generation
    Shibin Wang
    Zidiao Gao
    Dong Liu
    The Visual Computer, 2023, 39 : 6085 - 6095
  • [2] An Iris Image Super-Resolution Model Based on Swin Transformer and Generative Adversarial Network
    Lu, Hexin
    Zhu, Xiaodong
    Cui, Jingwei
    Jiang, Haifeng
    ALGORITHMS, 2024, 17 (03)
  • [3] CT-GAN: A conditional Generative Adversarial Network of transformer architecture for text-to-image
    Zhang, Xin
    Jiao, Wentao
    Wang, Bing
    Tian, Xuedong
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [4] fire-GAN: Flame Image Generation Algorithm Based on Generative Adversarial Network
    Qin Kui
    Hou Xinguo
    Zhou Feng
    Yan Zhengjun
    Bu Leping
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (12)
  • [5] Hyperspectral Image Classification Based on Transformer and Generative Adversarial Network
    Wang, Yajie
    Shi, Zhonghui
    Han, Shengyu
    Wei, Zhihao
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 212 - 225
  • [6] SWCGAN: Generative Adversarial Network Combining Swin Transformer and CNN for Remote Sensing Image Super-Resolution
    Tu, Jingzhi
    Mei, Gang
    Ma, Zhengjing
    Piccialli, Francesco
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 5662 - 5673
  • [7] Generative Adversarial Network With Transformer for Hyperspectral Image Classification
    Hao, Siyuan
    Xia, Yufeng
    Ye, Yuanxin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [8] REALISTIC FACE IMAGE GENERATION BASED ON GENERATIVE ADVERSARIAL NETWORK
    Zhang, Ting
    Tian, Wen-Hong
    Zheng, Ting-Ying
    Li, Zu-Ning
    Du, Xue-Mei
    Li, Fan
    2019 16TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP), 2019, : 303 - 306
  • [9] Image Generation Method Based on Improved Generative Adversarial Network
    Zhang H.
    Recent Advances in Computer Science and Communications, 2023, 16 (07) : 43 - 50
  • [10] SwinGAN: A dual-domain Swin Transformer-based generative adversarial network for MRI reconstruction
    Zhao, Xiang
    Yang, Tiejun
    Li, Bingjie
    Zhang, Xin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153