A network anomaly detection algorithm based on semi-supervised learning and adaptive multiclass balancing

被引:3
|
作者
Zhang, Hao [1 ,2 ]
Xiao, Zude [1 ,2 ]
Gu, Jason [3 ]
Liu, Yanhua [1 ,2 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
[2] Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informat, Fuzhou 350116, Peoples R China
[3] Dalhousie Univ, Dept Elect & Comp Engn, Halifax, NS B3J 1Z1, Canada
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 18期
基金
中国国家自然科学基金;
关键词
Network intrusion detection; Anomaly detection; Semi-supervised learning; Ensemble learning; Class imbalance; INTRUSION DETECTION; SYSTEMS; FRAMEWORK; FOREST;
D O I
10.1007/s11227-023-05474-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of network technology, the Internet has brought significant convenience to various sectors of society, holding a prominent position. Due to the unpredictable and severe consequences resulting from malicious attacks, the detection of anomalous network traffic has garnered considerable attention from researchers over the past few decades. Accurately labeling a sufficient amount of network traffic data as a training dataset within a short period of time is a challenging task, given the rapid and massive generation of network traffic data. Furthermore, the proportion of malicious attack traffic is relatively small compared to the overall traffic data, and the distribution of traffic data across different types of malicious attacks also varies significantly. To address the aforementioned challenges, this paper presents a novel network anomaly detection algorithm based on semi-supervised learning and adaptive multiclass balancing. Building upon the assumption of consistent distribution between labeled and unlabeled data, this paper introduces the multiclass split balancing strategy and the adaptive confidence threshold function. These innovative approaches aim to tackle the issue of the multiclass imbalanced in traffic data. By leveraging the mutually beneficial relationship between semi-supervised learning and ensemble learning, this paper presents the collaborative rotation forest algorithm. This algorithm is specifically designed to enhance performance of anomaly detection in an environment with label inadequacy. Several comparative experiments conducted on the NSL-KDD, UNSW-NB15, and ToN-IoT demonstrate that the proposed algorithm achieves significant improvements in performance. Specifically, it enhances precision by 1.5-5.7%, recall by 1.5-5.7%, and F-Measure by 1.4-4.3% compared to the state-of-the-art algorithms.
引用
收藏
页码:20445 / 20480
页数:36
相关论文
共 50 条
  • [21] Semi-supervised Machine Learning for Textual Anomaly Detection
    Steyn, Carl
    de Waal, Alta
    2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
  • [22] Deceptive reviews detection based on semi-supervised learning algorithm
    Ren, Yafeng
    Ji, Donghong
    Yin, Lan
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2014, 46 (03): : 62 - 69
  • [23] Network Intrusion Detection Based on Active Semi-supervised Learning
    Zhang, Yong
    Niu, Jie
    He, Guojian
    Zhu, Lin
    Guo, Da
    51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN-W 2021), 2021, : 129 - 135
  • [24] An anomaly intrusion detection algorithm based on minimal diversity semi-supervised clustering
    Wang, Juan
    Zhang, Ke
    Ren, Da-sen
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 525 - 528
  • [25] Extreme semi-supervised learning for multiclass classification
    Chen, Chuangquan
    Gan, Yanfen
    Vong, Chi-Man
    NEUROCOMPUTING, 2020, 376 : 103 - 118
  • [26] PUNet: A Semi-Supervised Anomaly Detection Model for Network Anomaly Detection Based on Positive Unlabeled Data
    Long, Gang
    Zhang, Zhaoxin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (01): : 327 - 343
  • [27] Semi-supervised log anomaly detection based on bidirectional temporal convolution network
    Yin, Zhichao
    Kong, Xian
    Yin, Chunyong
    COMPUTERS & SECURITY, 2024, 140
  • [28] A Deep-Convolutional-Neural-Network-Based Semi-Supervised Learning Method for Anomaly Crack Detection
    Gao, Xingjun
    Huang, Chuansheng
    Teng, Shuai
    Chen, Gongfa
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [29] Semi-supervised Graph Edge Convolutional Network for Anomaly Detection
    Lun, Zhicheng
    Gu, Xiaoyan
    Fan, Haihui
    Li, Bo
    Wang, Weiping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 141 - 152
  • [30] Semi-Supervised Time Series Anomaly Detection Based on Statistics and Deep Learning
    Jiang, Jehn-Ruey
    Kao, Jian-Bin
    Li, Yu-Lin
    APPLIED SCIENCES-BASEL, 2021, 11 (15):