Evaluation of Machine Learning Algorithms in Network-Based Intrusion Detection Using Progressive Dataset

被引：5

作者：

Chua, Tuan-Hong ^{[1
]}

Salam, Iftekhar ^{[1
]}

机构：

[1] Xiamen Univ Malaysia, Sch Comp & Data Sci, Sepang 43900, Malaysia

来源：

SYMMETRY-BASEL | 2023年 / 15卷 / 06期

关键词：

intrusion detection; machine learning; deep learning; cybersecurity; DETECTION SYSTEM;

D O I：

10.3390/sym15061251

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Cybersecurity has become one of the focuses of organisations. The number of cyberattacks keeps increasing as Internet usage continues to grow. As new types of cyberattacks continue to emerge, researchers focus on developing machine learning (ML)-based intrusion detection systems (IDS) to detect zero-day attacks. They usually remove some or all attack samples from the training dataset and only include them in the testing dataset when evaluating the performance. This method may detect unknown attacks; however, it does not reflect the long-term performance of the IDS as it only shows the changes in the type of attacks. In this work, we focused on evaluating the long-term performance of ML-based IDS. To achieve this goal, we proposed evaluating the ML-based IDS using a dataset created later than the training dataset. The proposed method can better assess the long-term performance as the testing dataset reflects the changes in the attack type and network infrastructure changes over time. We have implemented six of the most popular ML models, including decision tree (DT), random forest (RF), support vector machine (SVM), naive Bayes (NB), artificial neural network (ANN), and deep neural network (DNN). These models are trained and tested with a pair of datasets with symmetrical classes. Our experiments using the CIC-IDS2017 and the CSE-CIC-IDS2018 datasets show that SVM and ANN are most resistant to overfitting. Our experiments also indicate that DT and RF suffer the most from overfitting, although they perform well on the training dataset. On the other hand, our experiments using the LUFlow dataset have shown that all models can perform well when the difference between the training and testing datasets is small.

引用

页数：31

共 50 条

[41] A Neural Network-Based Learning Algorithm for Intrusion Detection Systems
Ahmed, Hassan I.
Elfeshawy, Nawal A.
Elzoghdy, S. F.
El-sayed, Hala S.
Faragallah, Osama S.
WIRELESS PERSONAL COMMUNICATIONS, 2017, 97 (02) : 3097 - 3112
[42] A Neural Network-Based Learning Algorithm for Intrusion Detection Systems
Hassan I. Ahmed
Nawal A. Elfeshawy
S. F. Elzoghdy
Hala S. El-sayed
Osama S. Faragallah
Wireless Personal Communications, 2017, 97 : 3097 - 3112
[43] Intrusion Detection System Based on Machine Learning Algorithms: A Review
Amanoul, Sandy Victor
Abdulazeez, Adnan Mohsin
2022 IEEE 18TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & APPLICATIONS (CSPA 2022), 2022, : 79 - 84
[44] Intrusion Detection in Computer Networks based on Machine Learning Algorithms
Osareh, Alireza
Shadgar, Bita
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (11): : 15 - 23
[45] Performance Evaluation of Some Machine Learning Algorithms for Security Intrusion Detection
Elaeraj, Ouafae
Leghris, Cherkaoui
Renault, Eric
MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 154 - 166
[46] Performance Evaluation of Machine Learning Algorithms in Apache Spark for Intrusion Detection
Dobson, Anthony
Roy, Kaushik
Yuan, Xiaohong
Xu, Jinsheng
2018 28TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2018, : 374 - 379
[47] Dataset of attacks on a live enterprise VoIP network for machine learning based intrusion detection and prevention systems
Alvares, Christabelle
Dinesh, Dristi
Alvi, Syed
Gautam, Tannish
Hasib, Maheen
Raza, Ali
COMPUTER NETWORKS, 2021, 197
[48] Intrusion Detection System for CAN Bus In-Vehicle Network based on Machine Learning Algorithms
Alfardus, Asma
Rawat, Danda B.
2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 944 - 949
[49] Machine Learning Techniques for Intrusion Detection on Public Dataset
Thanthrige, Udaya Sampath K. Perera Miriya
Samarabandu, Jagath
Wang, Xianbin
2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,
[50] Ensemble of Machine Learning Algorithms for Intrusion Detection
Chou, Te-Shun
Fan, Jeffrey
Fan, Sharon
Makki, Kia
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3976 - +

← 1 2 3 4 5 →