Synthetic attack data generation model applying generative adversarial network for intrusion detection

被引：33

作者：

Kumar, Vikash ^{[1
,2
]}

Sinha, Ditipriya ^{[2
]}

机构：

[1] Siksha O Anusandhan Deemed be Univ, Dept Comp Sci & Engn, Bhubaneswar, India

[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India

来源：

COMPUTERS & SECURITY | 2023年 / 125卷

关键词：

Intrusion detection system; Cyber-attack; Generative adversarial networks; Data synthetization; Data imbalance; DEEP LEARNING APPROACH; INTERNET;

D O I：

10.1016/j.cose.2022.103054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Detecting a large number of attack classes accurately applying machine learning (ML) and deep learn-ing (DL) techniques depends on the number of representative samples available for each attack class. In most cases, the data samples are highly imbalanced that results in a biased intrusion detection model towards the majority classes. Under-sampling, over-sampling and SMOTE are some techniques among the solutions that turn the imbalanced dataset to balanced one. These techniques have not had much impact on the improvement of detection accuracy. To deal with this problem, this paper proposes a Wasser-stein Conditional Generative Adversarial Network (WCGAN) combined with an XGBoost Classifier. Gra-dient penalty along with the WCGAN is used for stable learning of the model. The proposed model is evaluated with some other GAN models (i.e., standard/vanilla GAN, Conditional GAN) which shows the significance of applying WCGAN in this paper. The loss on generated and real data shows a similar pat-tern and is lower for the Wasserstein variants of GAN compared to the other variants of the GAN model. The performance is benchmarked on three datasets NSL-KDD, UNSW-NB15 and BoT-IoT. The comparison of performance metrics before and after using the proposed framework with XGBoost classifier shows im-provement in terms of higher precision, recall and F-1 score. However, comparatively less improvement is observed in FAR compared to other classifiers such as Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM). The proposed work is also compared with a recent similar technique called DGM, which uses conditional GAN along with different ML classification models. The performance of the pro-posed model outperforms DGM. The proposed model creates a significant footprint (or, attack signatures) to tackle with the problem of data-imbalance during the design of the Intrusion Detection System (IDS).(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：15

共 50 条

[1] Network Intrusion Detection System based on Generative Adversarial Network for Attack Detection
Das, Abhijit
Balakrishnan, S. G.
Pramod
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 757 - 766
[2] IDSGAN: Generative Adversarial Networks for Attack Generation Against Intrusion Detection
Lin, Zilong
Shi, Yong
Xue, Zhi
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT III, 2022, 13282 : 79 - 91
[3] Empirical Evaluation on Synthetic Data Generation with Generative Adversarial Network
Lu, Pei-Hsuan
Wang, Pang-Chieh
Yu, Chia-Mu
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, MINING AND SEMANTICS (WIMS 2019), 2019,
[4] A Recombination Generative Adversarial Network for Intrusion Detection
Luo, Haoqi
Wan, Liang
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2024, 34 (02) : 323 - 334
[5] Addressing Imbalanced Data Problem with Generative Adversarial Network For Intrusion Detection
Yilmaz, Ibrahim
Masum, Rahat
Siraj, Ambareen
2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 25 - 30
[6] Evaluation of a Fintech Sales Synthetic Data Generation Model Using a Generative Adversarial Network
Lopez, Felipe A.
Duran-Riveros, Marcia
Maldonado-Duran, Sebastian
Ruete, David
Costa, Giannina
Coronado-Hernandez, Jairo R.
Gatica, Gustavo
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT VI, 2024, 14820 : 56 - 70
[7] Synthetic Intrusion Alert Generation through Generative Adversarial Networks
Sweet, Christopher
Moskal, Stephen
Yang, Shanchieh Jay
MILCOM 2019 - 2019 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2019,
[8] Synthetic Dynamic PMU Data Generation: A Generative Adversarial Network Approach
Zheng, Xiangtian
Wang, Bin
Xie, Le
2019 INTERNATIONAL CONFERENCE ON SMART GRID SYNCHRONIZED MEASUREMENTS AND ANALYTICS (SGSMA), 2019,
[9] An Enhanced Generative Adversarial Network Model for Fingerprint Presentation Attack Detection
Anshul, Ashutosh
Jha, Ashwini
Jain, Prayag
Rai, Anuj
Sharma, Ram Prakash
Dey, Somnath
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 376 - 386
[10] An Enhanced Generative Adversarial Network Model for Fingerprint Presentation Attack Detection
Anshul A.
Jha A.
Jain P.
Rai A.
Sharma R.P.
Dey S.
SN Computer Science, 4 (5)

← 1 2 3 4 5 →