Controllable image synthesis methods, applications and challenges: a comprehensive survey

被引:0
|
作者
Huang, Shanshan [1 ]
Li, Qingsong [1 ]
Liao, Jun [1 ]
Wang, Shu [3 ]
Liu, Li [1 ]
Li, Lian [2 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400000, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci Informat Engn, Hefei 230601, Peoples R China
[3] Southwest Univ, Sch Mat & Energy, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Controllable image synthesis; Deep generative model; Causal learning; GAN inversion; Interpretable representation learning; Artificial intelligence-generated content; ADVERSARIAL NETWORKS; GAN INVERSION; TRANSLATION; GENERATION;
D O I
10.1007/s10462-024-10987-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Controllable Image Synthesis (CIS) is a methodology that allows users to generate desired images or manipulate specific attributes of images by providing precise input conditions or modifying latent representations. In recent years, CIS has attracted considerable attention in the field of image processing, with significant advances in consistency, controllability and harmony. However, several challenges still remain, particularly regarding the fine-grained controllability and interpretability of synthesized images. In this paper, we comprehensively and systematically review the CIS from problem definition, taxonomy and evaluation systems to existing challenges and future research directions. First, the definition of CIS is given, and several representative deep generative models are introduced in detail. Second, the existing CIS methods are divided into three categories according to the different control manners used and discuss the typical work in each category critically. Furthermore, we introduce the public datasets and evaluation metrics commonly used in image synthesis and analyze the representative CIS methods. Finally, we present several open issues and discuss the future research direction of CIS.
引用
收藏
页数:46
相关论文
共 50 条
  • [41] A comprehensive survey of image and video forgery techniques: variants, challenges, and future directions
    Syed Tufael Nabi
    Munish Kumar
    Paramjeet Singh
    Naveen Aggarwal
    Krishan Kumar
    Multimedia Systems, 2022, 28 : 939 - 992
  • [42] A Survey of Image Synthesis Methods for Visual Machine Learning
    Tsirikoglou, A.
    Eilertsen, G.
    Unger, J.
    COMPUTER GRAPHICS FORUM, 2020, 39 (06) : 426 - 451
  • [43] Survey on Adversarial Attack and Defense for Medical Image Analysis: Methods and Challenges
    Dong, Junhao
    Chen, Junxi
    Xie, Xiaohua
    Lai, Jianhuang
    Chen, Hao
    ACM COMPUTING SURVEYS, 2025, 57 (03)
  • [44] A comprehensive survey on community detection methods and applications in complex information networks
    Diboune, Abdelhani
    Slimani, Hachem
    Nacer, Hassina
    Bey, Kadda Beghdad
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [45] A Survey on Multimodal Deep Learning for Image Synthesis Applications, methods, datasets, evaluation metrics, and results comparison
    Luo, Sanbi
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 108 - 120
  • [46] A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities
    Song, Yisheng
    Wang, Ting
    Cai, Puyu
    Mondal, Subrota K.
    Sahoo, Jyoti Prakash
    ACM COMPUTING SURVEYS, 2023, 55 (13S)
  • [47] 6G: A comprehensive survey on technologies, applications, challenges, and research problems
    Mahmoud, Haitham Hassan H.
    Amer, Amira A.
    Ismail, Tawfik
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (04)
  • [48] Evolutionary computation for feature selection in classification: A comprehensive survey of solutions, applications and challenges
    Song, Xianfang
    Zhang, Yong
    Zhang, Wanqiu
    He, Chunlin
    Hu, Ying
    Wang, Jian
    Gong, Dunwei
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 90
  • [49] A Comprehensive Survey on Deep Facial Expression Recognition: Challenges, Applications, and Future Guidelines
    Sajjad, Muhammad
    Ullah, Fath U. Min
    Ullah, Mohib
    Christodoulou, Georgia
    Cheikh, Faouzi Alaya
    Hijji, Mohammad
    Muhammad, Khan
    Rodrigues, Joel J. P. C.
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 68 : 817 - 840
  • [50] A Comprehensive Survey on Vehicular Networking: Communications, Applications, Challenges, and Upcoming Research Directions
    Hussein, Nehad Hameed
    Yaw, Chong Tak
    Koh, Siaw Paw
    Tiong, Sieh Kiong
    Chong, Kok Hen
    IEEE ACCESS, 2022, 10 : 86127 - 86180