Controllable image generation based on causal representation learning

被引:2
|
作者
Huang, Shanshan [1 ]
Wang, Yuanhao [1 ]
Gong, Zhili [1 ]
Liao, Jun [1 ]
Wang, Shu [2 ]
Liu, Li [1 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[2] Southwest Univ, Sch Mat & Energy, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Image generation; Controllable image editing; Causal structure learning; Causal representation learning; MODEL;
D O I
10.1631/FITEE.2300303
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence generated content (AIGC) has emerged as an indispensable tool for producing large-scale content in various forms, such as images, thanks to the significant role that AI plays in imitation and production. However, interpretability and controllability remain challenges. Existing AI methods often face challenges in producing images that are both flexible and controllable while considering causal relationships within the images. To address this issue, we have developed a novel method for causal controllable image generation (CCIG) that combines causal representation learning with bi-directional generative adversarial networks (GANs). This approach enables humans to control image attributes while considering the rationality and interpretability of the generated images and also allows for the generation of counterfactual images. The key of our approach, CCIG, lies in the use of a causal structure learning module to learn the causal relationships between image attributes and joint optimization with the encoder, generator, and joint discriminator in the image generation module. By doing so, we can learn causal representations in image's latent space and use causal intervention operations to control image generation. We conduct extensive experiments on a real-world dataset, CelebA. The experimental results illustrate the effectiveness of CCIG.
引用
收藏
页码:135 / 148
页数:14
相关论文
共 50 条
  • [31] Variable-length image compression based on controllable learning network
    Zhao, Dong
    Sun, Jiande
    Chen, Lei
    Wu, Yulin
    Zhou, Hongchao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) : 20065 - 20087
  • [32] Variable-length image compression based on controllable learning network
    Dong Zhao
    Jiande Sun
    Lei Chen
    Yulin Wu
    Hongchao Zhou
    Multimedia Tools and Applications, 2021, 80 : 20065 - 20087
  • [33] Controllable Music Playlist Generation Based on Knowledge Graph and Reinforcement Learning
    Sakurai, Keigo
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    SENSORS, 2022, 22 (10)
  • [34] GaitSCM: Causal representation learning for gait recognition
    Huo, Wei
    Wang, Ke
    Tang, Jun
    Wang, Nian
    Liang, Dong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 243
  • [35] General Identifiability and Achievability for Causal Representation Learning
    Varici, Burak
    Acarturk, Emre
    Shanmugam, Karthikeyan
    Tajer, Ali
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [36] Causal Representation Learning via Counterfactual Intervention
    Li, Xiutian
    Sun, Siqi
    Feng, Rui
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3234 - 3242
  • [37] Attention-based causal representation learning for out-of-distribution recommendation
    Gan, Yuehua
    Wang, Qianqian
    Huang, Zhejun
    Yang, Lili
    APPLIED INTELLIGENCE, 2024, 54 (24) : 12964 - 12978
  • [38] Integrating Image-Based and Knowledge-Based Representation Learning
    Xie, Ruobing
    Heinrich, Stefan
    Liu, Zhiyuan
    Weber, Cornelius
    Yao, Yuan
    Wermter, Stefan
    Sun, Maosong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (02) : 169 - 178
  • [39] Fake Colorized Image Detection Based on Special Image Representation and Transfer Learning
    Salman, Khalid A.
    Shaker, Khalid A.
    Al-Janabi, Sufyan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2023, 22 (03)
  • [40] BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
    Li, Dongxu
    Li, Junnan
    Hoi, Steven C. H.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,