Towards Data-Driven Network Intrusion Detection Systems: Features Dimensionality Reduction and Machine Learning

被引:0
|
作者
Maabreh M. [1 ]
Obeidat I. [1 ]
Elsoud E.A. [2 ]
Alnajjai A. [2 ]
Alzyoud R. [2 ]
Darwish O. [3 ]
机构
[1] Department of Information Technology, Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology, The Hashemite University, P.O. Box 330127, Zarqa
[2] Department of Computer Information Systems, Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology, The Hashemite University, Zarqa
[3] Information Security and Applied Computing Department, Eastern Michigan University, MI
关键词
big data; deep learning; feature selection; intrusion detection; machine learning; network security;
D O I
10.3991/ijim.v16i14.30197
中图分类号
学科分类号
摘要
Cyber attacks have increased in tandem with the exponential expansion of computer networks and network applications throughout the world. Fortunately, various machine/deep learning models have demonstrated excellent accuracy in predicting network attacks in the literature; nonetheless, having simple and understandable models might be a big benefit in network monitoring systems. In this study, we evaluate four feature selection algorithms to find the minimal set of predictive features of network attacks, seven classical machine learning algorithms, and the deep learning algorithm on one million random instances of the CSE-CIC-IDS2018 big data set for network intrusions. The feature selection algorithms highlighted the importance of features related to forwarding direction (FWD) and two flow measures (FLOW) in predicting the binary traffic type; benign or attack. Furthermore, the results revealed that not all features are required to build efficient ML/DL in detecting network attacks, four features unanimously selected by the feature selection algorithms were enough to build comparable ML models to those trained on all features. This might lead to models that are more suitable for deployment in terms of complexity, explainability, and scalability. Moreover, by selecting four unanimity features instead of all traffic features, the training time may be decreased by 10% to 50%. © 2022. All Rights Reserved.
引用
收藏
页码:123 / 135
页数:12
相关论文
共 50 条
  • [41] A Hybrid Data-driven Model for Intrusion Detection in VANET
    Bangui, Hind
    Ge, Mouzhi
    Buhnova, Barbora
    12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 516 - 523
  • [42] Damage Detection with Data-Driven Machine Learning Models on an Experimental Structure
    Alemu, Yohannes L.
    Lahmer, Tom
    Walther, Christian
    ENG, 2024, 5 (02): : 629 - 656
  • [43] Machine learning technique for data-driven fault detection of nonlinear processes
    Said, Maroua
    ben Abdellafou, Khaoula
    Taouali, Okba
    JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (04) : 865 - 884
  • [44] Machine learning technique for data-driven fault detection of nonlinear processes
    Maroua Said
    Khaoula ben Abdellafou
    Okba Taouali
    Journal of Intelligent Manufacturing, 2020, 31 : 865 - 884
  • [45] WeChat Toxic Article Detection: A Data-Driven Machine Learning Approach
    Weng, Yunpeng
    Wu, Muhong
    Chen, Xu
    Wu, Qiong
    He, Lingnan
    Chen, Liang
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 916 - 921
  • [46] A data-driven approach for Network Intrusion Detection and Monitoring based on Kernel Null Space
    Huong T.T.
    Bac T.P.
    Nguyen Q.T.
    Nguyen H.D.
    Tran K.P.
    EAI Endorsed Transactions on Industrial Networks and Intelligent Systems, 2019, 6 (20) : 1 - 8
  • [47] Machine Learning Based Network Intrusion Detection
    Lee, Chie-Hong
    Su, Yann-Yean
    Lin, Yu-Chun
    Lee, Shie-Jue
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 79 - 83
  • [48] Generating realistic cyber data for training and evaluating machine learning classifiers for network intrusion detection systems
    Chale, Marc
    Bastian, Nathaniel D.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 207
  • [49] Machine Learning for the Development of Data-Driven Turbulence Closures in Coolant Systems
    Hammond, James
    Montomoli, Francesco
    Pietropaoli, Marco
    Sandberg, Richard D.
    Michelassi, Vittorio
    JOURNAL OF TURBOMACHINERY-TRANSACTIONS OF THE ASME, 2022, 144 (08):
  • [50] Machine Learning with Dimensionality Reduction for DDoS Attack Detection
    Gupta, Shaveta
    Grover, Dinesh
    AlZubi, Ahmad Ali
    Sachdeva, Nimit
    Baig, Mirza Waqar
    Singla, Jimmy
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 2665 - 2682