Controllable image synthesis methods, applications and challenges: a comprehensive survey

被引:0
|
作者
Huang, Shanshan [1 ]
Li, Qingsong [1 ]
Liao, Jun [1 ]
Wang, Shu [3 ]
Liu, Li [1 ]
Li, Lian [2 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400000, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci Informat Engn, Hefei 230601, Peoples R China
[3] Southwest Univ, Sch Mat & Energy, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Controllable image synthesis; Deep generative model; Causal learning; GAN inversion; Interpretable representation learning; Artificial intelligence-generated content; ADVERSARIAL NETWORKS; GAN INVERSION; TRANSLATION; GENERATION;
D O I
10.1007/s10462-024-10987-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Controllable Image Synthesis (CIS) is a methodology that allows users to generate desired images or manipulate specific attributes of images by providing precise input conditions or modifying latent representations. In recent years, CIS has attracted considerable attention in the field of image processing, with significant advances in consistency, controllability and harmony. However, several challenges still remain, particularly regarding the fine-grained controllability and interpretability of synthesized images. In this paper, we comprehensively and systematically review the CIS from problem definition, taxonomy and evaluation systems to existing challenges and future research directions. First, the definition of CIS is given, and several representative deep generative models are introduced in detail. Second, the existing CIS methods are divided into three categories according to the different control manners used and discuss the typical work in each category critically. Furthermore, we introduce the public datasets and evaluation metrics commonly used in image synthesis and analyze the representative CIS methods. Finally, we present several open issues and discuss the future research direction of CIS.
引用
收藏
页数:46
相关论文
共 50 条
  • [31] A Comprehensive Survey on Affective Computing: Challenges, Trends, Applications, and Future Directions
    Afzal, Sitara
    Khan, Haseeb Ali
    Piran, Md Jalil
    Lee, Jong Weon
    IEEE ACCESS, 2024, 12 : 96150 - 96168
  • [32] A comprehensive survey on support vector machine classification: Applications, challenges and trends
    Cervantes, Jair
    Garcia-Lamont, Farid
    Rodriguez-Mazahua, Lisbeth
    Lopez, Asdrubal
    NEUROCOMPUTING, 2020, 408 : 189 - 215
  • [33] Underwater image enhancement: a comprehensive review, recent trends, challenges and applications
    Smitha Raveendran
    Mukesh D. Patil
    Gajanan K. Birajdar
    Artificial Intelligence Review, 2021, 54 : 5413 - 5467
  • [34] Underwater image enhancement: a comprehensive review, recent trends, challenges and applications
    Raveendran, Smitha
    Patil, Mukesh D.
    Birajdar, Gajanan K.
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (07) : 5413 - 5467
  • [35] Federated Learning for Predictive Maintenance: A Survey of Methods, Applications, and Challenges
    Purkayastha, Arnab A.
    Aggarwal, Shobhit
    2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 238 - 242
  • [36] Image synthesis with adversarial networks: A comprehensive survey and case studies
    Shamsolmoali, Pourya
    Zareapoor, Masoumeh
    Granger, Eric
    Zhou, Huiyu
    Wang, Ruili
    Celebi, M. Emre
    Yang, Jie
    INFORMATION FUSION, 2021, 72 : 126 - 146
  • [37] Image Matching in Deep Learning Era: Methods, Applications and Challenges
    Kong, Qing-Qunu
    Wu, Fu-Chaou
    Fan, Bin
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (07): : 1485 - 1520
  • [38] Conventional to Deep Ensemble Methods for Hyperspectral Image Classification: A Comprehensive Survey
    Ullah, Farhan
    Ullah, Irfan
    Khan, Rehan Ullah
    Khan, Salabat
    Khan, Khalil
    Pau, Giovanni
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3878 - 3916
  • [39] A comprehensive survey to study the utilities of image segmentation methods in clinical routine
    Mohapatra, Rashmita Kumari
    Jolly, Lochan
    Lyngdoh, Dalamchwami Chen
    Mourya, Gajendra Kumar
    Mangalote, Iffa Afsa Changaai
    Alam, Syed Intekhab
    Dakua, Sarada Prasad
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2023, 13 (01):
  • [40] A comprehensive survey of image and video forgery techniques: variants, challenges, and future directions
    Nabi, Syed Tufael
    Kumar, Munish
    Singh, Paramjeet
    Aggarwal, Naveen
    Kumar, Krishan
    MULTIMEDIA SYSTEMS, 2022, 28 (03) : 939 - 992