PAT: Geometry-Aware Hard-Label Black-Box Adversarial Attacks on Text

被引：3

作者：

Ye, Muchao ^{[1
]}

Chen, Jinghui ^{[1
]}

Miao, Chenglin ^{[2
]}

Liu, Han ^{[3
]}

Wang, Ting ^{[1
]}

Ma, Fenglong ^{[1
]}

机构：

[1] Penn State Univ, University Pk, PA 16802 USA

[2] Iowa State Univ, Ames, IA USA

[3] Dalian Univ Technol, Dalian, Liaoning, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

hard-label adversarial attack; robustness of language model;

D O I：

10.1145/3580305.3599461

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Despite a plethora of prior explorations, conducting text adversarial attacks in practical settings is still challenging with the following constraints: black box - the inner structure of the victim model is unknown; hard label - the attacker only has access to the top-1 prediction results; and semantic preservation - the perturbation needs to preserve the original semantics. In this paper, we present PAT,1 a novel adversarial attack method employed under all these constraints. Specifically, PAT explicitly models the adversarial and non-adversarial prototypes and incorporates them to measure semantic changes for replacement selection in the hard-label black-box setting to generate high-quality samples. In each iteration, PAT finds original words that can be replaced back and selects better candidate words for perturbed positions in a geometry-aware manner guided by this estimation, which maximally improves the perturbation construction and minimally impacts the original semantics. Extensive evaluation with benchmark datasets and state-of-the-art models shows that PAT outperforms existing text adversarial attacks in terms of both attack effectiveness and semantic preservation. Moreover, we validate the efficacy of PAT against industry-leading natural language processing platforms in real-world settings.

引用

页码：3093 / 3104

页数：12

共 50 条

[41] Heuristic Black-Box Adversarial Attacks on Video Recognition Models
Wei, Zhipeng
Chen, Jingjing
Wei, Xingxing
Jiang, Linxi
Chua, Tat-Seng
Zhou, Fengfeng
Jiang, Yu-Gang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12338 - 12345
[42] Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos
Wei, Zhipeng
Chen, Jingjing
Zhang, Hao
Jiang, Linxi
Jiang, Yu-Gang
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 587 - 593
[43] LeapAttack: Hard-Label Adversarial Attack on Text via Gradient-Based Optimization
Ye, Muchao
Chen, Jinghui
Miao, Chenglin
Wang, Ting
Ma, Fenglong
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2307 - 2315
[44] Black-box attacks on dynamic graphs via adversarial topology perturbations
Tao, Haicheng
Cao, Jie
Chen, Lei
Sun, Hongliang
Shi, Yong
Zhu, Xingquan
NEURAL NETWORKS, 2024, 171 : 308 - 319
[45] Adversarial Black-Box Attacks with Timing Side-Channel Leakage
Nakai, Tsunato
Suzuki, Daisuke
Omatsu, Fumio
Fujino, Takeshi
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2021, E104A (01) : 143 - 151
[46] Black-box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples
Zhang, Yuekai
Jiang, Ziyan
Villalba, Jesus
Dehak, Najim
INTERSPEECH 2020, 2020, : 4238 - 4242
[47] Improving the transferability of adversarial examples through black-box feature attacks
Wang, Maoyuan
Wang, Jinwei
Ma, Bin
Luo, Xiangyang
NEUROCOMPUTING, 2024, 595
[48] Mitigating Black-Box Adversarial Attacks via Output Noise Perturbation
Aithal, Manjushree B.
Li, Xiaohua
IEEE ACCESS, 2022, 10 : 12395 - 12411
[49] Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information
Zhene, Baolin
Jiang, Peipei
Wang, Qian
Li, Qi
Shen, Chao
Wang, Cong
Ge, Yunjie
Teng, Qingyang
Zhang, Shenyi
CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 86 - 107
[50] Black-box attacks against log anomaly detection with adversarial examples
Lu, Siyang
Wang, Mingquan
Wang, Dongdong
Wei, Xiang
Xiao, Sizhe
Wang, Zhiwei
Han, Ningning
Wang, Liqiang
INFORMATION SCIENCES, 2023, 619 : 249 - 262

← 1 2 3 4 5 →