A network anomaly detection algorithm based on semi-supervised learning and adaptive multiclass balancing

被引:3
|
作者
Zhang, Hao [1 ,2 ]
Xiao, Zude [1 ,2 ]
Gu, Jason [3 ]
Liu, Yanhua [1 ,2 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
[2] Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informat, Fuzhou 350116, Peoples R China
[3] Dalhousie Univ, Dept Elect & Comp Engn, Halifax, NS B3J 1Z1, Canada
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 18期
基金
中国国家自然科学基金;
关键词
Network intrusion detection; Anomaly detection; Semi-supervised learning; Ensemble learning; Class imbalance; INTRUSION DETECTION; SYSTEMS; FRAMEWORK; FOREST;
D O I
10.1007/s11227-023-05474-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of network technology, the Internet has brought significant convenience to various sectors of society, holding a prominent position. Due to the unpredictable and severe consequences resulting from malicious attacks, the detection of anomalous network traffic has garnered considerable attention from researchers over the past few decades. Accurately labeling a sufficient amount of network traffic data as a training dataset within a short period of time is a challenging task, given the rapid and massive generation of network traffic data. Furthermore, the proportion of malicious attack traffic is relatively small compared to the overall traffic data, and the distribution of traffic data across different types of malicious attacks also varies significantly. To address the aforementioned challenges, this paper presents a novel network anomaly detection algorithm based on semi-supervised learning and adaptive multiclass balancing. Building upon the assumption of consistent distribution between labeled and unlabeled data, this paper introduces the multiclass split balancing strategy and the adaptive confidence threshold function. These innovative approaches aim to tackle the issue of the multiclass imbalanced in traffic data. By leveraging the mutually beneficial relationship between semi-supervised learning and ensemble learning, this paper presents the collaborative rotation forest algorithm. This algorithm is specifically designed to enhance performance of anomaly detection in an environment with label inadequacy. Several comparative experiments conducted on the NSL-KDD, UNSW-NB15, and ToN-IoT demonstrate that the proposed algorithm achieves significant improvements in performance. Specifically, it enhances precision by 1.5-5.7%, recall by 1.5-5.7%, and F-Measure by 1.4-4.3% compared to the state-of-the-art algorithms.
引用
收藏
页码:20445 / 20480
页数:36
相关论文
共 50 条
  • [31] Anomaly Intrusion Detection for Evolving Data Stream Based on Semi-supervised Learning
    Yu, Yan
    Guo, Shanqing
    Lan, Shaohua
    Ban, Tao
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT I, 2009, 5506 : 571 - +
  • [32] Semi-supervised visual anomaly detection based on convolutional autoencoder and transfer learning
    Saeedi, Jamal
    Giusti, Alessandro
    MACHINE LEARNING WITH APPLICATIONS, 2023, 11
  • [33] SSCL: Semi-supervised Contrastive Learning for Industrial Anomaly Detection
    Cai, Wei
    Gao, Jiechao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 100 - 112
  • [34] Topological Learning for Semi-Supervised Anomaly Detection in Hyperspectral Imagery
    Ramirez, Juan, Jr.
    Armitage, Tristan
    Bihl, Trevor
    Kramer, Ryan
    PROCEEDINGS OF THE 2019 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2019, : 560 - 564
  • [35] Semi-Supervised Machine Learning for Spacecraft Anomaly Detection & Diagnosis
    Ramachandran, Sowmya
    Rosengarten, Maia
    Belardi, Christian
    2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [36] The Network Representation Learning Algorithm Based on Semi-Supervised Random Walk
    Liu, Dong
    Li, Qinpeng
    Ru, Yan
    Zhang, Jun
    IEEE ACCESS, 2020, 8 : 222956 - 222965
  • [37] A semi-supervised clustering algorithm for network intrusion detection
    Wei X.-T.
    Huang H.-K.
    Tian S.-F.
    Tiedao Xuebao/Journal of the China Railway Society, 2010, 32 (01): : 49 - 53
  • [38] Semi-Supervised Learning Methods for Network Intrusion Detection
    Chen, Chuanliang
    Gong, Yunchao
    Tian, Yingjie
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2602 - +
  • [39] MSSBoost: A new multiclass boosting to semi-supervised learning
    Tanha, Jafar
    NEUROCOMPUTING, 2018, 314 : 251 - 266
  • [40] Multiclass Semi-Supervised Boosting Using Similarity Learning
    Tanha, Jafar
    Saberian, Mohammad Javad
    van Someren, Maarten
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1205 - 1210