Controllable image synthesis methods, applications and challenges: a comprehensive survey

被引:0
|
作者
Huang, Shanshan [1 ]
Li, Qingsong [1 ]
Liao, Jun [1 ]
Wang, Shu [3 ]
Liu, Li [1 ]
Li, Lian [2 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400000, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci Informat Engn, Hefei 230601, Peoples R China
[3] Southwest Univ, Sch Mat & Energy, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Controllable image synthesis; Deep generative model; Causal learning; GAN inversion; Interpretable representation learning; Artificial intelligence-generated content; ADVERSARIAL NETWORKS; GAN INVERSION; TRANSLATION; GENERATION;
D O I
10.1007/s10462-024-10987-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Controllable Image Synthesis (CIS) is a methodology that allows users to generate desired images or manipulate specific attributes of images by providing precise input conditions or modifying latent representations. In recent years, CIS has attracted considerable attention in the field of image processing, with significant advances in consistency, controllability and harmony. However, several challenges still remain, particularly regarding the fine-grained controllability and interpretability of synthesized images. In this paper, we comprehensively and systematically review the CIS from problem definition, taxonomy and evaluation systems to existing challenges and future research directions. First, the definition of CIS is given, and several representative deep generative models are introduced in detail. Second, the existing CIS methods are divided into three categories according to the different control manners used and discuss the typical work in each category critically. Furthermore, we introduce the public datasets and evaluation metrics commonly used in image synthesis and analyze the representative CIS methods. Finally, we present several open issues and discuss the future research direction of CIS.
引用
收藏
页数:46
相关论文
共 50 条
  • [11] A survey on image and video cosegmentation: Methods, challenges and analyses
    Ren, Yan
    Kong, Adams Wai Kin
    Jiao, Licheng
    PATTERN RECOGNITION, 2020, 103 (103)
  • [12] A Comprehensive Survey of Isocontouring Methods: Applications, Limitations and Perspectives
    Buescher, Keno Jann
    Degel, Jan Philipp
    Oellerich, Jan
    ALGORITHMS, 2024, 17 (02)
  • [13] A Comprehensive Survey on the Process, Methods, Evaluation, and Challenges of Feature Selection
    Islam, Md Rashedul
    Lima, Aklima Akter
    Das, Sujoy Chandra
    Mridha, M. F.
    Prodeep, Akibur Rahman
    Watanobe, Yutaka
    IEEE ACCESS, 2022, 10 : 99595 - 99632
  • [14] A comprehensive survey of digital twins: Applications, technologies and security challenges
    Jeremiah, Sekione Reward
    El Azzaoui, Abir
    Xiong, Neal N.
    Park, Jong Hyuk
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 151
  • [15] Prescribed performance control approaches, applications and challenges: A comprehensive survey
    Bu, Xiangwei
    ASIAN JOURNAL OF CONTROL, 2023, 25 (01) : 241 - 261
  • [16] A comprehensive survey on Arabic text augmentation: approaches, challenges, and applications
    Ahmed Adel ElSabagh
    Shahira Shaaban Azab
    Hesham Ahmed Hefny
    Neural Computing and Applications, 2025, 37 (10) : 7015 - 7048
  • [17] A survey on neural topic models: methods, applications, and challenges
    Xiaobao Wu
    Thong Nguyen
    Anh Tuan Luu
    Artificial Intelligence Review, 57
  • [18] A survey on neural topic models: methods, applications, and challenges
    Wu, Xiaobao
    Nguyen, Thong
    Luu, Anh Tuan
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (02)
  • [19] Using eDNA to survey amphibians: Methods, applications, and challenges
    Sun, Xiaoxuan
    Guo, Ningning
    Gao, Jianan
    Xiao, Nengwen
    BIOTECHNOLOGY AND BIOENGINEERING, 2024, 121 (02) : 456 - 471
  • [20] A Comprehensive Survey of Optical Remote Sensing Image Segmentation Methods
    Wang, Yongzhi
    Lv, Hua
    Deng, Rui
    Zhuang, Shengbing
    CANADIAN JOURNAL OF REMOTE SENSING, 2020, 46 (05) : 501 - 531