Comprehensive exploration of diffusion models in image generation: a survey

被引:0
|
作者
Chen, Hang [1 ]
Xiang, Qian [2 ,3 ,4 ]
Hu, Jiaxin [1 ]
Ye, Meilin [1 ]
Yu, Chao [1 ]
Cheng, Hao [1 ]
Zhang, Lei [1 ]
机构
[1] Hubei Polytech Univ, Sch Elect & Elect Informat Engn, Huangshi 435003, Peoples R China
[2] Wuchang Shouyi Univ, Coll Informat Sci & Engn, Wuhan 430064, Peoples R China
[3] Gongqing Inst Sci & Technol, Jiujiang 332020, Peoples R China
[4] Wuhan Nanhua Ind Equipments Engn CO LTD, Wuhan 430200, Peoples R China
基金
中国国家自然科学基金;
关键词
Image generation; Diffusion models; Generative models; Data privacy; Data security; FAKE IMAGES; TEXT; SUPERRESOLUTION;
D O I
10.1007/s10462-025-11110-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of deep learning technology has led to the emergence of diffusion models as a promising generative model with diverse applications. These include image generation, audio and video synthesis, molecular design, and text generation. The distinctive generation mechanism and exceptional generation quality of diffusion models have made them a valuable tool in these diverse fields. However, with the extensive deployment of diffusion models in the domain of image generation, concerns pertaining to data privacy, data security, and artistic ethics have emerged with increasing prominence. Given the accelerated pace of development in the field of diffusion models, the majority of extant surveys are deficient in two respects: firstly, they fail to encompass the latest advances in diffusion-based image synthesis; and secondly, they seldom consider the potential social implications of diffusion models. In order to address these issues, this paper presents a comprehensive survey of the most recent applications of diffusion models in the field of image generation. Furthermore, it provides an in-depth analysis of the potential social impacts that may result from their use. Firstly, this paper presents a systematic survey of the background principles and theoretical foundations of diffusion models. Subsequently, this paper provides a detailed examination of the most recent applications of diffusion models across a range of image generation subfields, including style transfer, image completion, image editing, super-resolution, and beyond. Finally, we present a comprehensive examination of these social issues, addressing data privacy concerns, such as the potential for data leakage and the implementation of protective measures during model training. We also analyse the risk of malicious exploitation of the model and the defensive strategies employed to mitigate such risks. Additionally, we examine the implications of the authenticity and originality of generated images on artistic creativity and copyright protection.
引用
收藏
页数:49
相关论文
共 50 条
  • [31] Conditional Image-to-Video Generation with Latent Flow Diffusion Models
    Ni, Haomiao
    Shi, Changhao
    Li, Kai
    Huang, Sharon X.
    Min, Martin Renqiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18444 - 18455
  • [32] DiffMat: Latent diffusion models for image-guided material generation
    Yuan, Liang
    Yan, Dingkun
    Saito, Suguru
    Fujishiro, Issei
    VISUAL INFORMATICS, 2024, 8 (01): : 6 - 14
  • [33] Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning
    Miaol, Zichen
    Wang, Jiang
    Wang, Ze
    Yang, Zhengyuan
    Wang, Lijuan
    Qiu, Qiang
    Liu, Zicheng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10844 - 10853
  • [34] Predictive microstructure image generation using denoising diffusion probabilistic models
    Azqadan, Erfan
    Jahed, Hamid
    Arami, Arash
    ACTA MATERIALIA, 2023, 261
  • [35] A Comprehensive Survey of Image Steganography
    Kalaiarasi, G.
    Sudharani, B.
    Jonnalagadda, Sharon Christiana
    Battula, Harsha Vardhan
    Sanagala, Bhavana
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1225 - 1229
  • [36] Bridging the metrics gap in image style transfer: A comprehensive survey of models and criteria
    Zhou, Xiaotong
    Zheng, Yuhui
    Yang, Junming
    NEUROCOMPUTING, 2025, 624
  • [37] Few-shot biomedical image segmentation using diffusion models: Beyond image generation
    Khosravi, Bardia
    Rouzrokh, Pouria
    Mickley, John P.
    Faghani, Shahriar
    Mulford, Kellen
    Yang, Linjun
    Larson, A. Noelle
    Howe, Benjamin M.
    Erickson, Bradley J.
    Taunton, Michael J.
    Wyles, Cody C.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [38] Comprehensive Survey of OLAP Models
    Kaur, Harkiran
    Kaur, Gursimran
    HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 415 - 422
  • [39] Biomedical Image Segmentation Using Denoising Diffusion Probabilistic Models: A Comprehensive Review and Analysis
    Liu, Zengxin
    Ma, Caiwen
    She, Wenji
    Xie, Meilin
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [40] Denoising diffusion probabilistic models for 3D medical image generation
    Khader, Firas
    Mueller-Franzes, Gustav
    Arasteh, Soroosh Tayebi
    Han, Tianyu
    Haarburger, Christoph
    Schulze-Hagen, Maximilian
    Schad, Philipp
    Engelhardt, Sandy
    Baessler, Bettina
    Foersch, Sebastian
    Stegmaier, Johannes
    Kuhl, Christiane
    Nebelung, Sven
    Kather, Jakob Nikolas
    Truhn, Daniel
    SCIENTIFIC REPORTS, 2023, 13 (01):