Comprehensive exploration of diffusion models in image generation: a survey

被引:0
|
作者
Chen, Hang [1 ]
Xiang, Qian [2 ,3 ,4 ]
Hu, Jiaxin [1 ]
Ye, Meilin [1 ]
Yu, Chao [1 ]
Cheng, Hao [1 ]
Zhang, Lei [1 ]
机构
[1] Hubei Polytech Univ, Sch Elect & Elect Informat Engn, Huangshi 435003, Peoples R China
[2] Wuchang Shouyi Univ, Coll Informat Sci & Engn, Wuhan 430064, Peoples R China
[3] Gongqing Inst Sci & Technol, Jiujiang 332020, Peoples R China
[4] Wuhan Nanhua Ind Equipments Engn CO LTD, Wuhan 430200, Peoples R China
基金
中国国家自然科学基金;
关键词
Image generation; Diffusion models; Generative models; Data privacy; Data security; FAKE IMAGES; TEXT; SUPERRESOLUTION;
D O I
10.1007/s10462-025-11110-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of deep learning technology has led to the emergence of diffusion models as a promising generative model with diverse applications. These include image generation, audio and video synthesis, molecular design, and text generation. The distinctive generation mechanism and exceptional generation quality of diffusion models have made them a valuable tool in these diverse fields. However, with the extensive deployment of diffusion models in the domain of image generation, concerns pertaining to data privacy, data security, and artistic ethics have emerged with increasing prominence. Given the accelerated pace of development in the field of diffusion models, the majority of extant surveys are deficient in two respects: firstly, they fail to encompass the latest advances in diffusion-based image synthesis; and secondly, they seldom consider the potential social implications of diffusion models. In order to address these issues, this paper presents a comprehensive survey of the most recent applications of diffusion models in the field of image generation. Furthermore, it provides an in-depth analysis of the potential social impacts that may result from their use. Firstly, this paper presents a systematic survey of the background principles and theoretical foundations of diffusion models. Subsequently, this paper provides a detailed examination of the most recent applications of diffusion models across a range of image generation subfields, including style transfer, image completion, image editing, super-resolution, and beyond. Finally, we present a comprehensive examination of these social issues, addressing data privacy concerns, such as the potential for data leakage and the implementation of protective measures during model training. We also analyse the risk of malicious exploitation of the model and the defensive strategies employed to mitigate such risks. Additionally, we examine the implications of the authenticity and originality of generated images on artistic creativity and copyright protection.
引用
收藏
页数:49
相关论文
共 50 条
  • [41] Denoising diffusion probabilistic models for 3D medical image generation
    Firas Khader
    Gustav Müller-Franzes
    Soroosh Tayebi Arasteh
    Tianyu Han
    Christoph Haarburger
    Maximilian Schulze-Hagen
    Philipp Schad
    Sandy Engelhardt
    Bettina Baeßler
    Sebastian Foersch
    Johannes Stegmaier
    Christiane Kuhl
    Sven Nebelung
    Jakob Nikolas Kather
    Daniel Truhn
    Scientific Reports, 13 (1)
  • [42] CartoonDiff: Training-free Cartoon Image Generation with Diffusion Transformer Models
    He, Feihong
    Li, Gang
    Si, Lingyu
    Yan, Leilei
    Hou, Shimeng
    Dong, Hongwei
    Li, Fanzhang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3825 - 3829
  • [43] Comparative Review of Text-to-Image Generation Techniques Based on Diffusion Models
    Gao, Xinyu
    Du, Fang
    Song, Lijuan
    Computer Engineering and Applications, 2024, 60 (24) : 44 - 64
  • [44] MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
    Zhao, Jing
    Zheng, Heliang
    Wang, Chaoyue
    Lan, Long
    Yang, Wenjing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22535 - 22545
  • [45] A Survey on Generative Diffusion Models
    Cao, Hanqun
    Tan, Cheng
    Gao, Zhangyang
    Xu, Yilun
    Chen, Guangyong
    Heng, Pheng-Ann
    Li, Stan Z.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2814 - 2830
  • [46] Diffusion Models in Vision: A Survey
    Croitoru, Florinel-Alin
    Hondru, Vlad
    Ionescu, Radu Tudor
    Shah, Mubarak
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 10850 - 10869
  • [47] A Survey on Video Diffusion Models
    Xing, Zhen
    Feng, Qijun
    Chen, Haoran
    Dai, Qi
    Hu, Hang
    Xu, Hang
    Wu, Zuxuan
    Jiang, Yu-gang
    ACM COMPUTING SURVEYS, 2025, 57 (02)
  • [48] Echo from Noise: Synthetic Ultrasound Image Generation Using Diffusion Models for Real Image Segmentation
    Stojanovski, David
    Hermida, Uxio
    Lamata, Pablo
    Beqiri, Arian
    Gomez, Alberto
    SIMPLIFYING MEDICAL ULTRASOUND, ASMUS 2023, 2023, 14337 : 34 - 43
  • [49] Lightweight diffusion models: a survey
    Song, Wei
    Ma, Wen
    Zhang, Ming
    Zhang, Yanghao
    Zhao, Xiaobing
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (06)
  • [50] Hyperspectral image segmentation: a comprehensive survey
    Grewal, Reaya
    Kasana, Singara Singh
    Kasana, Geeta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (14) : 20819 - 20872