Balanced Multi-Class Network Intrusion Detection Using Machine Learning

被引:1
|
作者
Khan, Faraz Ahmad [1 ]
Shah, Asghar Ali [2 ]
Alshammry, Nizal [3 ]
Saif, Saifullah [1 ]
Khan, Wasim [1 ]
Malik, Muhammad Osama [4 ]
Ullah, Zahid [5 ]
机构
[1] Univ Engn & Technol Mardan, Mardan 23200, Pakistan
[2] Beaconhouse Int Coll, Dept Comp Sci, Islamabad 46000, Pakistan
[3] Northern Border Univ, Fac Comp & Informat Technol, Dept Comp Sci, Ar Ar 91431, Saudi Arabia
[4] Univ Tulsa, Collin Coll Business, Tulsa, OK 74104 USA
[5] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Random forests; Classification algorithms; Telecommunication traffic; Security; Organizations; Feature extraction; Machine learning algorithms; Firewalls (computing); Biological system modeling; AdaBoost; anomaly detection; binary classification; decision trees; deep learning; intrusion detection; K-nearest neighbor (KNN); logistic regression; machine learning; naive Bayes classifiers; network security; random forests;
D O I
10.1109/ACCESS.2024.3503497
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cybersecurity is gaining a high position in the IT sector. Connecting more devices to the internet smooths the way for hackers. It is hard for signature-based security tools to detect new attacks that emerge and evolve with slight changes. Researchers are trying to build a Network Intrusion Detection System (NIDS) that can accurately detect the zero-day attacks evolved through minor changes. An anomaly-based NIDS has attracted researchers to develop a system to detect malign traffic in a network using Machine Learning (ML) models. Therefore, in recent years, the designs of modern NIDS for higher detection rates and lower false alarms have been refined by utilizing advanced ML and Deep Learning (DL) approaches. However, it is still a problem for the supervised and unsupervised algorithms to achieve high performance, absolute accuracy, and minimal false alarm rate. This work aims to design an effective NIDS that addresses the current limitation using machine learning models trained on reliable flow-based data (CICIDS-2017). The system will improve the detection accuracy and reduce false alarms in high-speed network environments. To achieve results, the dataset has been balanced using the SMOTE-Tomek Links technique. After cleaning and organizing the dataset, the trained algorithms are Decision Tree, Random Forest, XGBoost, K-Nearest Neighbor, Naive Bayes, Logistic Regression, and AdaBoost algorithm. These algorithms are pulled from literature studies because of their exceptional performance on old datasets. This work has achieved a Decision Tree model with 96.37% accuracy and 96.33% F1-score and the AdaBoost model with 96.37% accuracy and 96.33% F1-score for multiclass classification. For binary classification, the Decision Tree (DT) model has exhibited the highest test accuracy of 99.96%, followed by Random Forest (99.84%), Adaboost (99.77%), and Xgboost (99.57), with the highest average precision of 100% and ROC-AUC of 99.96%. We have also found that binary classification performs better when it takes more time to train each classifier than multiclass classification. This research study incorporates proper validation of the models and achieves high accuracy and exact results compared to the literature. The results show that a balanced CICIDS-2017 dataset improves the performance of decision trees and AdaBoost classifiers. The emplacement of NIDS in networks and their underlying technology are equally significant for detecting real-time attacks.
引用
收藏
页码:178222 / 178236
页数:15
相关论文
共 50 条
  • [1] Multi-Class Network Anomaly Detection Using Machine Learning Techniques
    Gunupusala, Satyanarayana
    Kaila, Shahu Chatrapathi
    CONTEMPORARY MATHEMATICS, 2024, 5 (02): : 2335 - 2352
  • [2] Network Intrusion Detection Based on Multi-Class Support Vector Machine
    Anh Vu Le
    Hoai An Le Thi
    Manh Cuong Nguyen
    Zidna, Ahmed
    COMPUTATIONAL COLLECTIVE INTELLIGENCE - TECHNOLOGIES AND APPLICATIONS, PT I, 2012, 7653 : 536 - 543
  • [3] A Multi-class Classification for Detection of IoT Network Attacks Using Machine Learning Models
    Ashok, Gadde
    Serath, Kommula
    Kumar, T. Gireesh
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2024, 2024, 14501 : 167 - 178
  • [4] A Multi-Class Intrusion Detection System Based on Continual Learning
    Oikonomou, Chrysoula
    Iliopoulos, Ilias
    Ioannidis, Dimosthenis
    Tzovaras, Dimitrios
    2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 86 - 91
  • [5] Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem
    Bacevicius, Mantas
    Paulauskaite-Taraseviciene, Agne
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [6] Dealing with Imbalanced Data in Multi-class Network Intrusion Detection Systems Using XGBoost
    AL-Essa, Malik
    Appice, Annalisa
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2021, 1525 : 5 - 21
  • [7] Two Layers Multi-class Detection Method for Network Intrusion Detection System
    Yuan, Yali
    Huo, Liuwei
    Hogrefe, Dieter
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 767 - 772
  • [8] ECG Multi-Class Classification using Neural Network as Machine Learning Model
    Lassoued, Hela
    Ketata, Raouf
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 473 - 478
  • [9] Enhanced GraphSAGE for Multi-Class Intrusion Detection
    Le, Hong-Dang
    Park, Minho
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 39 - 41
  • [10] An Approach for the Application of a Dynamic Multi-Class Classifier for Network Intrusion Detection Systems
    Larriva-Novo, Xavier
    Sanchez-Zas, Carmen
    Villagra, Victor A.
    Vega-Barbas, Mario
    Rivera, Diego
    ELECTRONICS, 2020, 9 (11) : 1 - 18