RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation with Natural Prompts

被引：9

作者：

Liu, Han ^{[1
]}

Wu, Yuhao ^{[1
]}

Zhai, Shixuan ^{[1
]}

Yuan, Bo ^{[2
]}

Zhang, Ning ^{[1
]}

机构：

[1] Washington Univ, St Louis, MO 63110 USA

[2] Rutgers State Univ, Piscataway, NJ USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01972

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The field of text-to-image generation has made remarkable strides in creating high-fidelity and photorealistic images. As this technology gains popularity, there is a growing concern about its potential security risks. However, there has been limited exploration into the robustness of these models from an adversarial perspective. Existing research has primarily focused on untargeted settings, and lacks holistic consideration for reliability (attack success rate) and stealthiness (imperceptibility). In this paper, we propose RIATIG, a reliable and imperceptible adversarial attack against text-to-image models via inconspicuous examples. By formulating the example crafting as an optimization process and solving it using a genetic-based method, our proposed attack can generate imperceptible prompts for text-to-image generation models in a reliable way. Evaluation of six popular text-to-image generation models demonstrates the efficiency and stealthiness of our attack in both white-box and black-box settings. To allow the community to build on top of our findings, we've made the artifacts available(1).

引用

页码：20585 / 20594

页数：10

共 50 条

[31] GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Tao, Ming
Bao, Bing-Kun
Tang, Hao
Xu, Changsheng
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14214 - 14223
[32] Advancements in adversarial generative text-to-image models: a review
Zaghloul, Rawan
Rawashdeh, Enas
Bani-Ata, Tomader
IMAGING SCIENCE JOURNAL, 2024,
[33] Adversarial Robustification via Text-to-Image Diffusion Models
Choi, Daewon
Jeong, Jongheon
Jang, Huiwon
Shin, Jinwoo
COMPUTER VISION - ECCV 2024, PT LXXXI, 2025, 15139 : 158 - 177
[34] Semantic Distance Adversarial Learning for Text-to-Image Synthesis
Yuan, Bowen
Sheng, Yefei
Bao, Bing-Kun
Chen, Yi-Ping Phoebe
Xu, Changsheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1255 - 1266
[35] ADVERSARIAL NETS WITH PERCEPTUAL LOSSES FOR TEXT-TO-IMAGE SYNTHESIS
Cha, Miriam
Gwon, Youngjune
Kung, H. T.
2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
[36] Prompt Refinement with Image Pivot for Text-to-Image Generation
Zhan, Jingtao
Ai, Qingyao
Liu, Yiqun
Pan, Yingwei
Yao, Ting
Mao, Jiaxin
Ma, Shaoping
Mei, Tao
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 941 - 954
[37] Development and Classification of Image Dataset for Text-to-Image Generation
Kumar M.
Mittal M.
Singh S.
Journal of The Institution of Engineers (India): Series B, 2024, 105 (04) : 787 - 796
[38] SAW-GAN: Multi-granularity Text Fusion Generative Adversarial Networks for text-to-image generation
Jin, Dehu
Yu, Qi
Yu, Lan
Qi, Meng
KNOWLEDGE-BASED SYSTEMS, 2024, 294
[39] DAC-GAN: Dual Auxiliary Consistency Generative Adversarial Network for Text-to-Image Generation
Wang, Zhiwei
Yang, Jing
Cui, Jiajun
Liu, Jiawei
Wang, Jiahao
COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 3 - 19
[40] DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Huang, Mengqi
Mao, Zhendong
Wang, Penghui
Wang, Quan
Zhang, Yongdong
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4345 - 4354

← 1 2 3 4 5 →