Unmasking the Vulnerabilities of Deep Learning Models: A Multi-Dimensional Analysis of Adversarial Attacks and Defenses

被引：0

作者：

Juraev, Firuz ^{[1
]}

Abuhamad, Mohammed ^{[2
]}

Chan-Tin, Eric ^{[2
]}

Thiruvathukal, George K. ^{[2
]}

Abuhmed, Tamer ^{[1
]}

机构：

[1] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon, South Korea

[2] Loyola Univ, Dept Comp Sci, Chicago, IL USA

来源：

2024 SILICON VALLEY CYBERSECURITY CONFERENCE, SVCC 2024 | 2024年

基金：

新加坡国家研究基金会;

关键词：

Threat Analysis; Deep Learning; Black-box Attacks; Adversarial Perturbations; Defensive Techniques;

D O I：

10.1109/SVCC61185.2024.10637364

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Learning (DL) is rapidly maturing to the point that it can be used in safety- and security-crucial applications, such as self-driving vehicles, surveillance, drones, and robots. However, adversarial samples, which are undetectable to the human eye, pose a serious threat that can cause the model to misbehave and compromise the performance of such applications. Addressing the robustness of DL models has become crucial to understanding and defending against adversarial attacks. In this study, we perform comprehensive experiments to examine the effect of adversarial attacks and defenses on various model architectures across well-known datasets. Our research focuses on black-box attacks such as SimBA, HopSkipJump, MGAAttack, and boundary attacks, as well as preprocessor-based defensive mechanisms, including bits squeezing, median smoothing, and JPEG filter. Experimenting with various models, our results demonstrate that the level of noise needed for the attack increases as the number of layers increases. Moreover, the attack success rate decreases as the number of layers increases. This indicates that model complexity and robustness have a significant relationship. Investigating the diversity and robustness relationship, our experiments with diverse models show that having a large number of parameters does not imply higher robustness. Our experiments extend to show the effects of the training dataset on model robustness. Using various datasets such as ImageNet-1000, CIFAR-100, and CIFAR-10 are used to evaluate the black-box attacks. Considering the multiple dimensions of our analysis, e.g., model complexity and training dataset, we examined the behavior of black-box attacks when models apply defenses. Our results show that applying defense strategies can significantly reduce attack effectiveness. This research provides in-depth analysis and insight into the robustness of DL models against various attacks, and defenses.

引用

页数：8

共 50 条

[41] Adversarial Attacks and Defenses in Large Language Models: Old and New Threats
Schwinn, Leo
Dobre, David
Guennemann, Stephan
Gidel, Gauthier
PROCEEDINGS ON I CAN'T BELIEVE IT'S NOT BETTER: FAILURE MODES IN THE AGE OF FOUNDATION MODELS AT NEURIPS 2023 WORKSHOPS, 2023, 239 : 103 - 117
[42] Adversarial attacks and defenses on text-to-image diffusion models: A survey
Zhang, Chenyu
Hu, Mingwang
Li, Wenhui
Wang, Lanjun
INFORMATION FUSION, 2025, 114
[43] Adversarial Attacks on Deep-learning Models in Natural Language Processing: A Survey
Zhang, Wei Emma
Sheng, Quan Z.
Alhazmi, Ahoud
Li, Chenliang
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (03)
[44] A Detailed Study on Adversarial attacks and Defense Mechanisms on various Deep Learning Models
Priya, K., V
Dinesh, Peter J.
2023 ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES FOR HIGH PERFORMANCE APPLICATIONS, ACCTHPA, 2023,
[45] A Survey on Adversarial Text Attacks on Deep Learning Models in Natural Language Processing
Deepan, S.
Torres-Cruz, Fred
Placido-Lerma, Ruben L.
Udhayakumar, R.
Anuradha, S.
Kapila, Dhiraj
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 1059 - 1067
[46] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
Zhou, Ziyuan
Liu, Guanjun
Guo, Weiran
Zhou, MengChu
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646
[47] Unmasking Vulnerabilities: Adversarial Attacks against DRL-based Resource Allocation in O-RAN
Ergu, Yared Abera
Nguyen, Van-Linh
Hwang, Ren-Hung
Lin, Ying-Dar
Cho, Chuan-Yu
Yang, Hui-Kuo
IEEE International Conference on Communications, 2024, : 2378 - 2383
[48] Invisible Adversarial Attacks on Deep Learning-Based Face Recognition Models
Lin, Chih-Yang
Chen, Feng-Jie
Ng, Hui-Fuang
Lin, Wei-Yang
IEEE ACCESS, 2023, 11 : 51567 - 51577
[49] A System-Driven Taxonomy of Attacks and Defenses in Adversarial Machine Learning
Sadeghi, Koosha
Banerjee, Ayan
Gupta, Sandeep K. S.
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (04): : 450 - 467
[50] Evaluating the Effectiveness of Attacks and Defenses on Machine Learning Through Adversarial Samples
Gala, Viraj R.
Schneider, Martin A.
2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS, ICSTW, 2023, : 90 - 97

← 1 2 3 4 5 →