Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

被引:0
|
作者
Shan, Shawn [1 ]
Ding, Wenxin [1 ]
Passananti, Josephine [1 ]
Wu, Stanley [1 ]
Zheng, Haitao [1 ]
Zhao, Ben Y. [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
D O I
10.1109/SP54263.2024.00207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trained on billions of images, diffusion-based text-to-image models seem impervious to traditional data poisoning attacks, which typically require poison samples approaching 20% of the training set. In this paper, we show that state-of-the-art text-to-image generative models are in fact highly vulnerable to poisoning attacks. Our work is driven by two key insights. First, while diffusion models are trained on billions of samples, the number of training samples associated with a specific concept or prompt is generally on the order of thousands. This suggests that these models will be vulnerable to prompt-specific poisoning attacks that corrupt a model's ability to respond to specific targeted prompts. Second, poison samples can be carefully crafted to maximize poison potency to ensure success with very few samples. We introduce Nightshade, a prompt-specific poisoning attack optimized for potency that can completely control the output of a prompt in Stable Diffusion's newest model (SDXL) with less than 100 poisoned training samples. Nightshade also generates stealthy poison images that look visually identical to their benign counterparts, and produces poison effects that "bleed through" to related concepts. More importantly, a moderate number of Nightshade attacks on independent prompts can destabilize a model and disable its ability to generate images for any and all prompts. Finally, we propose the use of Nightshade and similar tools as a defense for content owners against web scrapers that ignore opt-out/do-not-crawl directives, and discuss potential implications for both model trainers and content owners.
引用
收藏
页码:807 / 825
页数:19
相关论文
共 50 条
  • [41] How Text-to-Image Generative AI Is Transforming Mediated Action
    Vartiainen, Henriikka
    Tedre, Matti
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2024, 44 (02) : 12 - 22
  • [42] A survey of generative adversarial networks and their application in text-to-image synthesis
    Zeng, Wu
    Zhu, Heng-liang
    Lin, Chuan
    Xiao, Zheng-ying
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (12): : 7142 - 7181
  • [43] TextControlGAN: Text-to-Image Synthesis with Controllable Generative Adversarial Networks
    Ku, Hyeeun
    Lee, Minhyeok
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [44] Sequential Semantic Generative Communication for Progressive Text-to-Image Generation
    Nam, Hyelin
    Park, Jihong
    Choi, Jinho
    Kim, Seong-Lyun
    2023 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON, 2023,
  • [45] Navigating Text-to-Image Generative Bias Across Indic Languages
    Mittall, Surbhi
    Sudan, Arnav
    Vatsa, Mayank
    Singh, Richa
    Glaser, Tamar
    Hassner, Tal
    COMPUTER VISION - ECCV 2024, PT LXXXVIII, 2025, 15146 : 53 - 67
  • [46] Text-to-Image Synthesis With Generative Models: Methods, Datasets, Performance Metrics, Challenges, and Future Direction
    Alhabeeb, Sarah K.
    Al-Shargabi, Amal A.
    IEEE ACCESS, 2024, 12 : 24412 - 24427
  • [47] Survey About Generative Adversarial Network and Text-to-Image Synthesis
    Lai, Lina
    Mi, Yu
    Zhou, Longlong
    Rao, Jiyong
    Xu, Tianyang
    Song, Xiaoning
    Computer Engineering and Applications, 2023, 59 (19): : 21 - 39
  • [48] Muse: Text-To-Image Generation via Masked Generative Transformers
    Chang, Huiwen
    Zhang, Han
    Barber, Jarred
    Maschinot, A. J.
    Lezama, Jose
    Jiang, Lu
    Yang, Ming-Hsuan
    Murphy, Kevin
    Freeman, William T.
    Rubinstein, Michael
    Li, Yuanzhen
    Krishnan, Dilip
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [49] Decoupling Control in Text-to-Image Diffusion Models
    Cao, Shitong
    Zhang, Xuejie
    Wang, Jin
    Zhou, Xiaobing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 312 - 322
  • [50] Enhanced Text-to-Image Synthesis Conditional Generative Adversarial Networks
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    Lim, Jit Yan
    IAENG International Journal of Computer Science, 2022, 49 (01) : 1 - 7