Evaluation of Machine Learning Algorithms in Network-Based Intrusion Detection Using Progressive Dataset

被引:5
|
作者
Chua, Tuan-Hong [1 ]
Salam, Iftekhar [1 ]
机构
[1] Xiamen Univ Malaysia, Sch Comp & Data Sci, Sepang 43900, Malaysia
来源
SYMMETRY-BASEL | 2023年 / 15卷 / 06期
关键词
intrusion detection; machine learning; deep learning; cybersecurity; DETECTION SYSTEM;
D O I
10.3390/sym15061251
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cybersecurity has become one of the focuses of organisations. The number of cyberattacks keeps increasing as Internet usage continues to grow. As new types of cyberattacks continue to emerge, researchers focus on developing machine learning (ML)-based intrusion detection systems (IDS) to detect zero-day attacks. They usually remove some or all attack samples from the training dataset and only include them in the testing dataset when evaluating the performance. This method may detect unknown attacks; however, it does not reflect the long-term performance of the IDS as it only shows the changes in the type of attacks. In this work, we focused on evaluating the long-term performance of ML-based IDS. To achieve this goal, we proposed evaluating the ML-based IDS using a dataset created later than the training dataset. The proposed method can better assess the long-term performance as the testing dataset reflects the changes in the attack type and network infrastructure changes over time. We have implemented six of the most popular ML models, including decision tree (DT), random forest (RF), support vector machine (SVM), naive Bayes (NB), artificial neural network (ANN), and deep neural network (DNN). These models are trained and tested with a pair of datasets with symmetrical classes. Our experiments using the CIC-IDS2017 and the CSE-CIC-IDS2018 datasets show that SVM and ANN are most resistant to overfitting. Our experiments also indicate that DT and RF suffer the most from overfitting, although they perform well on the training dataset. On the other hand, our experiments using the LUFlow dataset have shown that all models can perform well when the difference between the training and testing datasets is small.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] A Neural Network-Based Learning Algorithm for Intrusion Detection Systems
    Ahmed, Hassan I.
    Elfeshawy, Nawal A.
    Elzoghdy, S. F.
    El-sayed, Hala S.
    Faragallah, Osama S.
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 97 (02) : 3097 - 3112
  • [42] A Neural Network-Based Learning Algorithm for Intrusion Detection Systems
    Hassan I. Ahmed
    Nawal A. Elfeshawy
    S. F. Elzoghdy
    Hala S. El-sayed
    Osama S. Faragallah
    Wireless Personal Communications, 2017, 97 : 3097 - 3112
  • [43] Intrusion Detection System Based on Machine Learning Algorithms: A Review
    Amanoul, Sandy Victor
    Abdulazeez, Adnan Mohsin
    2022 IEEE 18TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & APPLICATIONS (CSPA 2022), 2022, : 79 - 84
  • [44] Intrusion Detection in Computer Networks based on Machine Learning Algorithms
    Osareh, Alireza
    Shadgar, Bita
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (11): : 15 - 23
  • [45] Performance Evaluation of Some Machine Learning Algorithms for Security Intrusion Detection
    Elaeraj, Ouafae
    Leghris, Cherkaoui
    Renault, Eric
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 154 - 166
  • [46] Performance Evaluation of Machine Learning Algorithms in Apache Spark for Intrusion Detection
    Dobson, Anthony
    Roy, Kaushik
    Yuan, Xiaohong
    Xu, Jinsheng
    2018 28TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2018, : 374 - 379
  • [47] Dataset of attacks on a live enterprise VoIP network for machine learning based intrusion detection and prevention systems
    Alvares, Christabelle
    Dinesh, Dristi
    Alvi, Syed
    Gautam, Tannish
    Hasib, Maheen
    Raza, Ali
    COMPUTER NETWORKS, 2021, 197
  • [48] Intrusion Detection System for CAN Bus In-Vehicle Network based on Machine Learning Algorithms
    Alfardus, Asma
    Rawat, Danda B.
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 944 - 949
  • [49] Machine Learning Techniques for Intrusion Detection on Public Dataset
    Thanthrige, Udaya Sampath K. Perera Miriya
    Samarabandu, Jagath
    Wang, Xianbin
    2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,
  • [50] Ensemble of Machine Learning Algorithms for Intrusion Detection
    Chou, Te-Shun
    Fan, Jeffrey
    Fan, Sharon
    Makki, Kia
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3976 - +