Controllable image synthesis methods, applications and challenges: a comprehensive survey

被引:0
|
作者
Huang, Shanshan [1 ]
Li, Qingsong [1 ]
Liao, Jun [1 ]
Wang, Shu [3 ]
Liu, Li [1 ]
Li, Lian [2 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400000, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci Informat Engn, Hefei 230601, Peoples R China
[3] Southwest Univ, Sch Mat & Energy, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Controllable image synthesis; Deep generative model; Causal learning; GAN inversion; Interpretable representation learning; Artificial intelligence-generated content; ADVERSARIAL NETWORKS; GAN INVERSION; TRANSLATION; GENERATION;
D O I
10.1007/s10462-024-10987-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Controllable Image Synthesis (CIS) is a methodology that allows users to generate desired images or manipulate specific attributes of images by providing precise input conditions or modifying latent representations. In recent years, CIS has attracted considerable attention in the field of image processing, with significant advances in consistency, controllability and harmony. However, several challenges still remain, particularly regarding the fine-grained controllability and interpretability of synthesized images. In this paper, we comprehensively and systematically review the CIS from problem definition, taxonomy and evaluation systems to existing challenges and future research directions. First, the definition of CIS is given, and several representative deep generative models are introduced in detail. Second, the existing CIS methods are divided into three categories according to the different control manners used and discuss the typical work in each category critically. Furthermore, we introduce the public datasets and evaluation metrics commonly used in image synthesis and analyze the representative CIS methods. Finally, we present several open issues and discuss the future research direction of CIS.
引用
收藏
页数:46
相关论文
共 50 条
  • [1] A comprehensive survey of federated transfer learning: challenges, methods and applications
    Guo, Wei
    Zhuang, Fuzhen
    Zhang, Xiao
    Tong, Yiqi
    Dong, Jin
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (06)
  • [2] Explainability of artificial intelligence methods, applications and challenges: A comprehensive survey
    Ding, Weiping
    Abdel-Basset, Mohamed
    Hawash, Hossam
    Ali, Ahmed M.
    INFORMATION SCIENCES, 2022, 615 : 238 - 292
  • [3] A comprehensive survey of federated transfer learning: challenges, methods and applications
    GUO Wei
    ZHUANG Fuzhen
    ZHANG Xiao
    TONG Yiqi
    DONG Jin
    Frontiers of Computer Science, 2024, 18 (06)
  • [4] A Comprehensive Survey on Methods for Image Integrity
    Capasso, Paola
    Cattaneo, Giuseppe
    DE Marsico, Maria
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (11)
  • [5] Diffusion Models: A Comprehensive Survey of Methods and Applications
    Yang, Ling
    Zhang, Zhilong
    Song, Yang
    Hong, Shenda
    Xu, Runsheng
    Zhao, Yue
    Zhang, Wentao
    Cui, Bin
    Yang, Ming-Hsuan
    ACM COMPUTING SURVEYS, 2024, 56 (04)
  • [6] A survey on sentiment analysis methods, applications, and challenges
    Wankhade, Mayur
    Rao, Annavarapu Chandra Sekhara
    Kulkarni, Chaitanya
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (07) : 5731 - 5780
  • [7] A survey on sentiment analysis methods, applications, and challenges
    Mayur Wankhade
    Annavarapu Chandra Sekhara Rao
    Chaitanya Kulkarni
    Artificial Intelligence Review, 2022, 55 : 5731 - 5780
  • [8] A comprehensive survey on synthetic infrared image synthesis
    Upadhyay, Avinash
    Sharma, Manoj
    Mukherjee, Prerana
    Singhal, Amit
    Lall, Brejesh
    INFRARED PHYSICS & TECHNOLOGY, 2025, 147
  • [9] A comprehensive survey on image encryption: Taxonomy, challenges, and future directions
    Saberikamarposhti, Morteza
    Ghorbani, Amirabbas
    Yadollahi, Mehdi
    CHAOS SOLITONS & FRACTALS, 2024, 178
  • [10] A survey of methods for addressing the challenges of referring image segmentation
    Ji, Lixia
    Du, Yunlong
    Dang, Yiping
    Gao, Wenzhao
    Zhang, Han
    NEUROCOMPUTING, 2024, 583