Synthetic attack data generation model applying generative adversarial network for intrusion detection

被引：33

作者：

Kumar, Vikash ^{[1
,2
]}

Sinha, Ditipriya ^{[2
]}

机构：

[1] Siksha O Anusandhan Deemed be Univ, Dept Comp Sci & Engn, Bhubaneswar, India

[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India

来源：

COMPUTERS & SECURITY | 2023年 / 125卷

关键词：

Intrusion detection system; Cyber-attack; Generative adversarial networks; Data synthetization; Data imbalance; DEEP LEARNING APPROACH; INTERNET;

D O I：

10.1016/j.cose.2022.103054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Detecting a large number of attack classes accurately applying machine learning (ML) and deep learn-ing (DL) techniques depends on the number of representative samples available for each attack class. In most cases, the data samples are highly imbalanced that results in a biased intrusion detection model towards the majority classes. Under-sampling, over-sampling and SMOTE are some techniques among the solutions that turn the imbalanced dataset to balanced one. These techniques have not had much impact on the improvement of detection accuracy. To deal with this problem, this paper proposes a Wasser-stein Conditional Generative Adversarial Network (WCGAN) combined with an XGBoost Classifier. Gra-dient penalty along with the WCGAN is used for stable learning of the model. The proposed model is evaluated with some other GAN models (i.e., standard/vanilla GAN, Conditional GAN) which shows the significance of applying WCGAN in this paper. The loss on generated and real data shows a similar pat-tern and is lower for the Wasserstein variants of GAN compared to the other variants of the GAN model. The performance is benchmarked on three datasets NSL-KDD, UNSW-NB15 and BoT-IoT. The comparison of performance metrics before and after using the proposed framework with XGBoost classifier shows im-provement in terms of higher precision, recall and F-1 score. However, comparatively less improvement is observed in FAR compared to other classifiers such as Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM). The proposed work is also compared with a recent similar technique called DGM, which uses conditional GAN along with different ML classification models. The performance of the pro-posed model outperforms DGM. The proposed model creates a significant footprint (or, attack signatures) to tackle with the problem of data-imbalance during the design of the Intrusion Detection System (IDS).(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：15

共 50 条

[31] Application of Optimized Bidirectional Generative Adversarial Network in ICS Intrusion Detection
Liu, Huipeng
Zhou, Zhiping
Zhang, Min
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3009 - 3014
[32] Enhancing network intrusion detection performance using generative adversarial networks
Zhao, Xinxing
Fok, Kar Wai
Thing, Vrizlynn L. L.
COMPUTERS & SECURITY, 2024, 145
[33] GAAINet: A Generative Adversarial Artificial Immune Network Model for Intrusion Detection in Industrial IoT Systems
Sithungu, Siphesihle P.
Ehlers, Elizabeth M.
JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (05) : 456 - 461
[34] PGAN:A Generative Adversarial Network based Anomaly Detection Method for Network Intrusion Detection System
Li, Zeyi
Wang, Yun
Wang, Pan
Su, Haorui
2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 734 - 741
[35] Variational data generative model for intrusion detection
Lopez-Martin, Manuel
Carro, Belen
Sanchez-Esguevillas, Antonio
KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (01) : 569 - 590
[36] Insufficient Data Generative Model for Pipeline Network Leak Detection Using Generative Adversarial Networks
Zhang, Huaguang
Hu, Xuguang
Ma, Dazhong
Wang, Rui
Xie, Xiangpeng
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 7107 - 7120
[37] Variational data generative model for intrusion detection
Manuel Lopez-Martin
Belen Carro
Antonio Sanchez-Esguevillas
Knowledge and Information Systems, 2019, 60 : 569 - 590
[38] Synthetic data generation using generative adversarial network for tokamak plasma current quench experiments
Dave, Bhrugu
Patel, Sarthak
Shivani, Rishi
Purohit, Shishir
Chaudhury, Bhaskar
CONTRIBUTIONS TO PLASMA PHYSICS, 2023, 63 (5-6)
[39] Generate medical synthetic data based on generative adversarial network
Xiang X.
Wang J.
Wang Z.
Duan S.
Pan H.
Zhuang R.
Han P.
Liu C.
Tongxin Xuebao/Journal on Communications, 2022, 43 (03): : 211 - 224
[40] Generative Adversarial Network-based Approach for Automated Generation of Adversarial Attacks Against a Deep-Learning based XSS Attack Detection Model
Alaoui, Rokia Lamrani
Nfaoui, El Habib
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 892 - 897

← 1 2 3 4 5 →