Statistics Enhancement Generative Adversarial Networks for Diverse Conditional Image Synthesis

被引:1
|
作者
Zuo, Zhiwen [1 ]
Li, Ailin [2 ]
Wang, Zhizhong [2 ]
Zhao, Lei [2 ]
Dong, Jianfeng [1 ]
Wang, Xun [1 ]
Wang, Meng [3 ]
机构
[1] Zhejiang Gongshang Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[3] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Codes; Image synthesis; Task analysis; Mutual information; Random variables; Generators; Generative adversarial networks; Diverse conditional image synthesis; generative adversarial network; mode collapse; mutual information;
D O I
10.1109/TCSVT.2023.3348471
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Conditional generative adversarial networks (cGANs) aim to synthesize diverse images given the input conditions and the latent codes, but they are prone to map an input to a single output regardless of the variations in latent code, which is also well known as the mode collapse problem of cGANs. To alleviate the problem, in this paper, we investigate explicitly enhancing the statistical dependency between the latent code and the synthesized image in cGANs by utilizing mutual information neural estimators to estimate and maximize the conditional mutual information (CMI) between them given the input condition. The method provides a new perspective from information theory to improve diversity for cGANs and can facilitate many existing conditional image synthesis frameworks with a simple neural estimator extension. Moreover, our studies show that several key designs, including the neural estimator choice, the neural estimator's network design, and the sampling strategy, are crucial to the success of the method. Extensive experiments on four popular conditional image synthesis tasks, including class-conditioned image generation, paired and unpaired image-to-image translation, and text-to-image generation, demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页码:6167 / 6180
页数:14
相关论文
共 50 条
  • [41] FittingGAN: Fitting image Generation Based on Conditional Generative Adversarial Networks
    Li, Yanhua
    Wang, Jianping
    Zhang, Xiaomei
    Cao, Yangjie
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 741 - 745
  • [42] Underwater Image Enhancement Using Stacked Generative Adversarial Networks
    Ye, Xinchen
    Xu, Hongcan
    Ji, Xiang
    Xu, Rui
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 514 - 524
  • [43] Generative Adversarial Networks for Retinal Image Enhancement with Pathological Information
    Pham, Quang T. M.
    Shin, Jitae
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [44] Resolution-Preserving Generative Adversarial Networks for Image Enhancement
    Lee, Donghyeon
    Lee, Sangheon
    Lee, Hoseong
    Lee, Kyujoong
    Lee, Hyuk-Jae
    IEEE ACCESS, 2019, 7 : 110344 - 110357
  • [45] The Defense of Adversarial Example with Conditional Generative Adversarial Networks
    Yu, Fangchao
    Wang, Li
    Fang, Xianjin
    Zhang, Youwen
    SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
  • [46] Traffic Sign Image Synthesis with Generative Adversarial Networks
    Luo, Hengliang
    Kong, Qingqun
    Wu, Fuchao
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2540 - 2545
  • [47] TEXT TO IMAGE SYNTHESIS WITH ERUDITE GENERATIVE ADVERSARIAL NETWORKS
    Zhang, Zhiqiang
    Yu, Wenxin
    Jiang, Ning
    Zhou, Jinjia
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2438 - 2442
  • [48] A Survey of Image Synthesis and Editing with Generative Adversarial Networks
    Xian Wu
    Kun Xu
    Peter Hall
    Tsinghua Science and Technology, 2017, 22 (06) : 660 - 674
  • [49] A Survey of Image Synthesis and Editing with Generative Adversarial Networks
    Wu, Xian
    Xu, Kun
    Hall, Peter
    TSINGHUA SCIENCE AND TECHNOLOGY, 2017, 22 (06) : 660 - 674
  • [50] Image Denoising With Generative Adversarial Networks and its Application to Cell Image Enhancement
    Chen, Songkui
    Shi, Daming
    Sadiq, Muhammad
    Cheng, Xiaochun
    IEEE ACCESS, 2020, 8 : 82819 - 82831