Statistics Enhancement Generative Adversarial Networks for Diverse Conditional Image Synthesis

被引:1
|
作者
Zuo, Zhiwen [1 ]
Li, Ailin [2 ]
Wang, Zhizhong [2 ]
Zhao, Lei [2 ]
Dong, Jianfeng [1 ]
Wang, Xun [1 ]
Wang, Meng [3 ]
机构
[1] Zhejiang Gongshang Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[3] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Codes; Image synthesis; Task analysis; Mutual information; Random variables; Generators; Generative adversarial networks; Diverse conditional image synthesis; generative adversarial network; mode collapse; mutual information;
D O I
10.1109/TCSVT.2023.3348471
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Conditional generative adversarial networks (cGANs) aim to synthesize diverse images given the input conditions and the latent codes, but they are prone to map an input to a single output regardless of the variations in latent code, which is also well known as the mode collapse problem of cGANs. To alleviate the problem, in this paper, we investigate explicitly enhancing the statistical dependency between the latent code and the synthesized image in cGANs by utilizing mutual information neural estimators to estimate and maximize the conditional mutual information (CMI) between them given the input condition. The method provides a new perspective from information theory to improve diversity for cGANs and can facilitate many existing conditional image synthesis frameworks with a simple neural estimator extension. Moreover, our studies show that several key designs, including the neural estimator choice, the neural estimator's network design, and the sampling strategy, are crucial to the success of the method. Extensive experiments on four popular conditional image synthesis tasks, including class-conditioned image generation, paired and unpaired image-to-image translation, and text-to-image generation, demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页码:6167 / 6180
页数:14
相关论文
共 50 条
  • [21] A Survey of Image Translation Based on Conditional Generative Adversarial Networks
    Tu H.
    Wang W.
    Chen J.
    Li G.
    Wu F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (01): : 14 - 32
  • [22] Deep generative adversarial networks for infrared image enhancement
    Guei, Axel-Christian
    Akhloufi, Moulay A.
    THERMOSENSE: THERMAL INFRARED APPLICATIONS XL, 2018, 10661
  • [23] Underwater Attentional Generative Adversarial Networks for Image Enhancement
    Wang, Ning
    Chen, Tingkai
    Kong, Xiangjun
    Chen, Yanzheng
    Wang, Rongfeng
    Gong, Yongjun
    Song, Shiji
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2023, 53 (03) : 490 - 500
  • [24] Photoacoustic image synthesis with generative adversarial networks
    Schellenberg, Melanie
    Groehl, Janek
    Dreher, Kris K.
    Noelke, Jan-Hinrich
    Holzwarth, Niklas
    Tizabi, Minu D.
    Seitel, Alexander
    Maier-Hein, Lena
    PHOTOACOUSTICS, 2022, 28
  • [25] StegGAN: hiding image within image using conditional generative adversarial networks
    Singh, Brijesh
    Sharma, Prasen Kumar
    Huddedar, Shashank Anil
    Sur, Arijit
    Mitra, Pinaki
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 40511 - 40533
  • [26] StegGAN: hiding image within image using conditional generative adversarial networks
    Brijesh Singh
    Prasen Kumar Sharma
    Shashank Anil Huddedar
    Arijit Sur
    Pinaki Mitra
    Multimedia Tools and Applications, 2022, 81 : 40511 - 40533
  • [27] Enhanced dataset synthesis using conditional generative adversarial networks
    Mert, Ahmet
    BIOMEDICAL ENGINEERING LETTERS, 2023, 13 (01) : 41 - 48
  • [28] Enhanced dataset synthesis using conditional generative adversarial networks
    Ahmet Mert
    Biomedical Engineering Letters, 2023, 13 : 41 - 48
  • [29] Conditional Generative Adversarial Capsule Networks
    Kong R.
    Huang G.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (01): : 94 - 107
  • [30] Conditional Generative Adversarial Networks for Data Augmentation of a Neonatal Image Dataset
    Lyra, Simon
    Mustafa, Arian
    Rixen, Joeran
    Borik, Stefan
    Lueken, Markus
    Leonhardt, Steffen
    SENSORS, 2023, 23 (02)