Statistics Enhancement Generative Adversarial Networks for Diverse Conditional Image Synthesis

被引:1
|
作者
Zuo, Zhiwen [1 ]
Li, Ailin [2 ]
Wang, Zhizhong [2 ]
Zhao, Lei [2 ]
Dong, Jianfeng [1 ]
Wang, Xun [1 ]
Wang, Meng [3 ]
机构
[1] Zhejiang Gongshang Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[3] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Codes; Image synthesis; Task analysis; Mutual information; Random variables; Generators; Generative adversarial networks; Diverse conditional image synthesis; generative adversarial network; mode collapse; mutual information;
D O I
10.1109/TCSVT.2023.3348471
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Conditional generative adversarial networks (cGANs) aim to synthesize diverse images given the input conditions and the latent codes, but they are prone to map an input to a single output regardless of the variations in latent code, which is also well known as the mode collapse problem of cGANs. To alleviate the problem, in this paper, we investigate explicitly enhancing the statistical dependency between the latent code and the synthesized image in cGANs by utilizing mutual information neural estimators to estimate and maximize the conditional mutual information (CMI) between them given the input condition. The method provides a new perspective from information theory to improve diversity for cGANs and can facilitate many existing conditional image synthesis frameworks with a simple neural estimator extension. Moreover, our studies show that several key designs, including the neural estimator choice, the neural estimator's network design, and the sampling strategy, are crucial to the success of the method. Extensive experiments on four popular conditional image synthesis tasks, including class-conditioned image generation, paired and unpaired image-to-image translation, and text-to-image generation, demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页码:6167 / 6180
页数:14
相关论文
共 50 条
  • [1] SAR image synthesis based on conditional generative adversarial networks
    Wang, Jianyu
    Li, Jingwen
    Sun, Bing
    Zuo, Zhixiong
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (21): : 8093 - 8097
  • [2] Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis
    Mao, Qi
    Lee, Hsin-Ying
    Tseng, Hung-Yu
    Ma, Siwei
    Yang, Ming-Hsuan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1429 - 1437
  • [3] DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
    Liu, Rui
    Ge, Yixiao
    Choi, Ching Lam
    Wang, Xiaogang
    Li, Hongsheng
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16372 - 16381
  • [4] DivCo: Diverse conditional image synthesis via contrastive generative adversarial network
    Liu, Rui
    Ge, Yixiao
    Choi, Ching Lam
    Wang, Xiaogang
    Li, Hongsheng
    arXiv, 2021,
  • [5] Enhanced Text-to-Image Synthesis Conditional Generative Adversarial Networks
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    Lim, Jit Yan
    IAENG International Journal of Computer Science, 2022, 49 (01) : 1 - 7
  • [6] FAST-CONVERGING CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS FOR IMAGE SYNTHESIS
    Li, Chengcheng
    Wang, Zi
    Qi, Hairong
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2132 - 2136
  • [7] Image quality enhancement of a CD-SEM image using conditional generative adversarial networks
    Midoh, Yoshihiro
    Nakamae, Koji
    METROLOGY, INSPECTION, AND PROCESS CONTROL FOR MICROLITHOGRAPHY XXXIII, 2019, 10959
  • [8] Research on Medical Image Enhancement Method Based on Conditional Entropy Generative Adversarial Networks
    Li H.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [9] Image Synthesis in Multi-Contrast MRI With Conditional Generative Adversarial Networks
    Dar, Salman U. H.
    Yurt, Mahmut
    Karacan, Levent
    Erdem, Aykut
    Erdem, Erkut
    Cukur, Tolga
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) : 2375 - 2388
  • [10] Text to Image Synthesis Using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks
    Tibebu, Haileleol
    Malik, Aadin
    De Silva, Varuna
    INTELLIGENT COMPUTING, VOL 1, 2022, 506 : 560 - 580