Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

被引:0
|
作者
Shan, Shawn [1 ]
Ding, Wenxin [1 ]
Passananti, Josephine [1 ]
Wu, Stanley [1 ]
Zheng, Haitao [1 ]
Zhao, Ben Y. [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
D O I
10.1109/SP54263.2024.00207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trained on billions of images, diffusion-based text-to-image models seem impervious to traditional data poisoning attacks, which typically require poison samples approaching 20% of the training set. In this paper, we show that state-of-the-art text-to-image generative models are in fact highly vulnerable to poisoning attacks. Our work is driven by two key insights. First, while diffusion models are trained on billions of samples, the number of training samples associated with a specific concept or prompt is generally on the order of thousands. This suggests that these models will be vulnerable to prompt-specific poisoning attacks that corrupt a model's ability to respond to specific targeted prompts. Second, poison samples can be carefully crafted to maximize poison potency to ensure success with very few samples. We introduce Nightshade, a prompt-specific poisoning attack optimized for potency that can completely control the output of a prompt in Stable Diffusion's newest model (SDXL) with less than 100 poisoned training samples. Nightshade also generates stealthy poison images that look visually identical to their benign counterparts, and produces poison effects that "bleed through" to related concepts. More importantly, a moderate number of Nightshade attacks on independent prompts can destabilize a model and disable its ability to generate images for any and all prompts. Finally, we propose the use of Nightshade and similar tools as a defense for content owners against web scrapers that ignore opt-out/do-not-crawl directives, and discuss potential implications for both model trainers and content owners.
引用
收藏
页码:807 / 825
页数:19
相关论文
共 50 条
  • [21] Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
    Lee, Jaewoong
    Jang, Sangwon
    Jo, Jaehyeong
    Yoon, Jaehong
    Kim, Yunji
    Kim, Jin-Hwa
    Ha, Jung-Woo
    Hwang, Sung Ju
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23195 - 23205
  • [22] Examining the Text-to-Image Community of Practice: Why and How do People Prompt Generative AIs?
    Sanchez, Teo
    2023 PROCEEDINGS OF THE 15TH CONFERENCE ON CREATIVITY AND COGNITION, C&C 2023, 2023, : 43 - 61
  • [23] PROMPTIST: Automated Prompt Optimization for Text-to-Image Synthesis
    Li, WeiJie
    Wane, Jin
    Zhang, Xuejie
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 295 - 306
  • [24] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
    D'Inca, Moreno
    Peruzzo, Elia
    Mancini, Massimiliano
    Xu, Dejia
    Goel, Vidit
    Xu, Xinggian
    Wang, Zhangyang
    Shi, Humphrey
    Sebe, Nicu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 12225 - 12235
  • [25] Using artificial intelligence in craft education: crafting with text-to-image generative models
    Vartiainen, Henriikka
    Tedre, Matti
    DIGITAL CREATIVITY, 2023, 34 (01) : 1 - 21
  • [26] FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models
    Zhao, Lin
    Zhao, Tianchen
    Lin, Zinan
    Ning, Xuefei
    Dai, Guohao
    Yang, Huazhong
    Wan, Yu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16122 - 16131
  • [27] PromptMagician: Interactive Prompt Engineering for Text-to-Image Creation
    Feng, Yingchaojie
    Wang, Xingbo
    Wong, Kam Kwai
    Wang, Sijia
    Lu, Yuhong
    Zhu, Minfeng
    Wang, Baicheng
    Chen, Wei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 295 - 305
  • [28] Generative adversarial text-to-image generation with style image constraint
    Zekang Wang
    Li Liu
    Huaxiang Zhang
    Dongmei Liu
    Yu Song
    Multimedia Systems, 2023, 29 : 3291 - 3303
  • [29] Generative adversarial text-to-image generation with style image constraint
    Wang, Zekang
    Liu, Li
    Zhang, Huaxiang
    Liu, Dongmei
    Song, Yu
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3291 - 3303
  • [30] Semantic Object Accuracy for Generative Text-to-Image Synthesis
    Hinz, Tobias
    Heinrich, Stefan
    Wermter, Stefan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1552 - 1565