Synthetic attack data generation model applying generative adversarial network for intrusion detection

被引:33
|
作者
Kumar, Vikash [1 ,2 ]
Sinha, Ditipriya [2 ]
机构
[1] Siksha O Anusandhan Deemed be Univ, Dept Comp Sci & Engn, Bhubaneswar, India
[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Intrusion detection system; Cyber-attack; Generative adversarial networks; Data synthetization; Data imbalance; DEEP LEARNING APPROACH; INTERNET;
D O I
10.1016/j.cose.2022.103054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting a large number of attack classes accurately applying machine learning (ML) and deep learn-ing (DL) techniques depends on the number of representative samples available for each attack class. In most cases, the data samples are highly imbalanced that results in a biased intrusion detection model towards the majority classes. Under-sampling, over-sampling and SMOTE are some techniques among the solutions that turn the imbalanced dataset to balanced one. These techniques have not had much impact on the improvement of detection accuracy. To deal with this problem, this paper proposes a Wasser-stein Conditional Generative Adversarial Network (WCGAN) combined with an XGBoost Classifier. Gra-dient penalty along with the WCGAN is used for stable learning of the model. The proposed model is evaluated with some other GAN models (i.e., standard/vanilla GAN, Conditional GAN) which shows the significance of applying WCGAN in this paper. The loss on generated and real data shows a similar pat-tern and is lower for the Wasserstein variants of GAN compared to the other variants of the GAN model. The performance is benchmarked on three datasets NSL-KDD, UNSW-NB15 and BoT-IoT. The comparison of performance metrics before and after using the proposed framework with XGBoost classifier shows im-provement in terms of higher precision, recall and F-1 score. However, comparatively less improvement is observed in FAR compared to other classifiers such as Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM). The proposed work is also compared with a recent similar technique called DGM, which uses conditional GAN along with different ML classification models. The performance of the pro-posed model outperforms DGM. The proposed model creates a significant footprint (or, attack signatures) to tackle with the problem of data-imbalance during the design of the Intrusion Detection System (IDS).(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Network Intrusion Detection System based on Generative Adversarial Network for Attack Detection
    Das, Abhijit
    Balakrishnan, S. G.
    Pramod
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 757 - 766
  • [2] IDSGAN: Generative Adversarial Networks for Attack Generation Against Intrusion Detection
    Lin, Zilong
    Shi, Yong
    Xue, Zhi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT III, 2022, 13282 : 79 - 91
  • [3] Empirical Evaluation on Synthetic Data Generation with Generative Adversarial Network
    Lu, Pei-Hsuan
    Wang, Pang-Chieh
    Yu, Chia-Mu
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, MINING AND SEMANTICS (WIMS 2019), 2019,
  • [4] A Recombination Generative Adversarial Network for Intrusion Detection
    Luo, Haoqi
    Wan, Liang
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2024, 34 (02) : 323 - 334
  • [5] Addressing Imbalanced Data Problem with Generative Adversarial Network For Intrusion Detection
    Yilmaz, Ibrahim
    Masum, Rahat
    Siraj, Ambareen
    2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 25 - 30
  • [6] Evaluation of a Fintech Sales Synthetic Data Generation Model Using a Generative Adversarial Network
    Lopez, Felipe A.
    Duran-Riveros, Marcia
    Maldonado-Duran, Sebastian
    Ruete, David
    Costa, Giannina
    Coronado-Hernandez, Jairo R.
    Gatica, Gustavo
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT VI, 2024, 14820 : 56 - 70
  • [7] Synthetic Intrusion Alert Generation through Generative Adversarial Networks
    Sweet, Christopher
    Moskal, Stephen
    Yang, Shanchieh Jay
    MILCOM 2019 - 2019 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2019,
  • [8] Synthetic Dynamic PMU Data Generation: A Generative Adversarial Network Approach
    Zheng, Xiangtian
    Wang, Bin
    Xie, Le
    2019 INTERNATIONAL CONFERENCE ON SMART GRID SYNCHRONIZED MEASUREMENTS AND ANALYTICS (SGSMA), 2019,
  • [9] An Enhanced Generative Adversarial Network Model for Fingerprint Presentation Attack Detection
    Anshul, Ashutosh
    Jha, Ashwini
    Jain, Prayag
    Rai, Anuj
    Sharma, Ram Prakash
    Dey, Somnath
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 376 - 386
  • [10] An Enhanced Generative Adversarial Network Model for Fingerprint Presentation Attack Detection
    Anshul A.
    Jha A.
    Jain P.
    Rai A.
    Sharma R.P.
    Dey S.
    SN Computer Science, 4 (5)