Towards Data-Driven Network Intrusion Detection Systems: Features Dimensionality Reduction and Machine Learning

被引:0
|
作者
Maabreh M. [1 ]
Obeidat I. [1 ]
Elsoud E.A. [2 ]
Alnajjai A. [2 ]
Alzyoud R. [2 ]
Darwish O. [3 ]
机构
[1] Department of Information Technology, Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology, The Hashemite University, P.O. Box 330127, Zarqa
[2] Department of Computer Information Systems, Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology, The Hashemite University, Zarqa
[3] Information Security and Applied Computing Department, Eastern Michigan University, MI
关键词
big data; deep learning; feature selection; intrusion detection; machine learning; network security;
D O I
10.3991/ijim.v16i14.30197
中图分类号
学科分类号
摘要
Cyber attacks have increased in tandem with the exponential expansion of computer networks and network applications throughout the world. Fortunately, various machine/deep learning models have demonstrated excellent accuracy in predicting network attacks in the literature; nonetheless, having simple and understandable models might be a big benefit in network monitoring systems. In this study, we evaluate four feature selection algorithms to find the minimal set of predictive features of network attacks, seven classical machine learning algorithms, and the deep learning algorithm on one million random instances of the CSE-CIC-IDS2018 big data set for network intrusions. The feature selection algorithms highlighted the importance of features related to forwarding direction (FWD) and two flow measures (FLOW) in predicting the binary traffic type; benign or attack. Furthermore, the results revealed that not all features are required to build efficient ML/DL in detecting network attacks, four features unanimously selected by the feature selection algorithms were enough to build comparable ML models to those trained on all features. This might lead to models that are more suitable for deployment in terms of complexity, explainability, and scalability. Moreover, by selecting four unanimity features instead of all traffic features, the training time may be decreased by 10% to 50%. © 2022. All Rights Reserved.
引用
收藏
页码:123 / 135
页数:12
相关论文
共 50 条
  • [31] Data-driven Anomaly Detection with Timing Features for Embedded Systems
    Lu, Sixing
    Lysecky, Roman
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2019, 24 (03)
  • [32] Machine Learning Techniques for feature Reduction in Intrusion Detection Systems: A Comparison
    Bahrololum, M.
    Salahi, E.
    Khaleghi, M.
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 1091 - 1095
  • [33] Comparison of Machine Learning and Deep Learning Models for Network Intrusion Detection Systems
    Thapa, Niraj
    Liu, Zhipeng
    Kc, Dukka B.
    Gokaraju, Balakrishna
    Roy, Kaushik
    FUTURE INTERNET, 2020, 12 (10) : 1 - 16
  • [34] Unsupervised Machine Learning Techniques for Network Intrusion Detection on Modern Data
    Verkerken, Miel
    D'hooge, Laurens
    Wauters, Tim
    Volckaert, Bruno
    De Turck, Filip
    2020 FOURTH CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2020,
  • [35] Application of deep extreme learning machine in network intrusion detection systems
    Wuke, Li
    Guangluan, Yin
    Xiaoxiao, Chen
    IAENG International Journal of Computer Science, 2020, 47 (02) : 136 - 143
  • [36] Adversarial Machine Learning for Network Intrusion Detection Systems: A Comprehensive Survey
    He, Ke
    Kim, Dan Dongseong
    Asghar, Muhammad Rizwan
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2023, 25 (01): : 538 - 566
  • [37] A machine learning approach for improving the performance of network intrusion detection systems
    Azizan A.H.
    Mostafa S.A.
    Mustapha A.
    Mohd Foozy C.F.
    Abd Wahab M.H.
    Mohammed M.A.
    Khalaf B.A.
    Annals of Emerging Technologies in Computing, 2021, 5 (Special issue 5) : 201 - 208
  • [38] Dimensionality reduction for regularization of sparse data-driven RANS simulations
    Piroozmand, Pasha
    Brenner, Oliver
    Jenny, Patrick
    JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 492
  • [39] Towards Data-Driven Machine Translation for Lumasaaba
    Nabende, Peter
    DIGITAL SCIENCE, 2019, 850 : 3 - 11
  • [40] Towards optimized machine-learning-driven intrusion detection for Internet of Things applications
    Alemerien K.
    Al-suhemat S.
    Almahadin M.
    International Journal of Information Technology, 2024, 16 (8) : 4981 - 4994