Synthetic attack data generation model applying generative adversarial network for intrusion detection

被引:33
|
作者
Kumar, Vikash [1 ,2 ]
Sinha, Ditipriya [2 ]
机构
[1] Siksha O Anusandhan Deemed be Univ, Dept Comp Sci & Engn, Bhubaneswar, India
[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Intrusion detection system; Cyber-attack; Generative adversarial networks; Data synthetization; Data imbalance; DEEP LEARNING APPROACH; INTERNET;
D O I
10.1016/j.cose.2022.103054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting a large number of attack classes accurately applying machine learning (ML) and deep learn-ing (DL) techniques depends on the number of representative samples available for each attack class. In most cases, the data samples are highly imbalanced that results in a biased intrusion detection model towards the majority classes. Under-sampling, over-sampling and SMOTE are some techniques among the solutions that turn the imbalanced dataset to balanced one. These techniques have not had much impact on the improvement of detection accuracy. To deal with this problem, this paper proposes a Wasser-stein Conditional Generative Adversarial Network (WCGAN) combined with an XGBoost Classifier. Gra-dient penalty along with the WCGAN is used for stable learning of the model. The proposed model is evaluated with some other GAN models (i.e., standard/vanilla GAN, Conditional GAN) which shows the significance of applying WCGAN in this paper. The loss on generated and real data shows a similar pat-tern and is lower for the Wasserstein variants of GAN compared to the other variants of the GAN model. The performance is benchmarked on three datasets NSL-KDD, UNSW-NB15 and BoT-IoT. The comparison of performance metrics before and after using the proposed framework with XGBoost classifier shows im-provement in terms of higher precision, recall and F-1 score. However, comparatively less improvement is observed in FAR compared to other classifiers such as Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM). The proposed work is also compared with a recent similar technique called DGM, which uses conditional GAN along with different ML classification models. The performance of the pro-posed model outperforms DGM. The proposed model creates a significant footprint (or, attack signatures) to tackle with the problem of data-imbalance during the design of the Intrusion Detection System (IDS).(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Application of Optimized Bidirectional Generative Adversarial Network in ICS Intrusion Detection
    Liu, Huipeng
    Zhou, Zhiping
    Zhang, Min
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3009 - 3014
  • [32] Enhancing network intrusion detection performance using generative adversarial networks
    Zhao, Xinxing
    Fok, Kar Wai
    Thing, Vrizlynn L. L.
    COMPUTERS & SECURITY, 2024, 145
  • [33] GAAINet: A Generative Adversarial Artificial Immune Network Model for Intrusion Detection in Industrial IoT Systems
    Sithungu, Siphesihle P.
    Ehlers, Elizabeth M.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (05) : 456 - 461
  • [34] PGAN:A Generative Adversarial Network based Anomaly Detection Method for Network Intrusion Detection System
    Li, Zeyi
    Wang, Yun
    Wang, Pan
    Su, Haorui
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 734 - 741
  • [35] Variational data generative model for intrusion detection
    Lopez-Martin, Manuel
    Carro, Belen
    Sanchez-Esguevillas, Antonio
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (01) : 569 - 590
  • [36] Insufficient Data Generative Model for Pipeline Network Leak Detection Using Generative Adversarial Networks
    Zhang, Huaguang
    Hu, Xuguang
    Ma, Dazhong
    Wang, Rui
    Xie, Xiangpeng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 7107 - 7120
  • [37] Variational data generative model for intrusion detection
    Manuel Lopez-Martin
    Belen Carro
    Antonio Sanchez-Esguevillas
    Knowledge and Information Systems, 2019, 60 : 569 - 590
  • [38] Synthetic data generation using generative adversarial network for tokamak plasma current quench experiments
    Dave, Bhrugu
    Patel, Sarthak
    Shivani, Rishi
    Purohit, Shishir
    Chaudhury, Bhaskar
    CONTRIBUTIONS TO PLASMA PHYSICS, 2023, 63 (5-6)
  • [39] Generate medical synthetic data based on generative adversarial network
    Xiang X.
    Wang J.
    Wang Z.
    Duan S.
    Pan H.
    Zhuang R.
    Han P.
    Liu C.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (03): : 211 - 224
  • [40] Generative Adversarial Network-based Approach for Automated Generation of Adversarial Attacks Against a Deep-Learning based XSS Attack Detection Model
    Alaoui, Rokia Lamrani
    Nfaoui, El Habib
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 892 - 897