A novel semi-supervised approach for network traffic clustering

被引:29
|
作者
Wang Y. [1 ]
Xiang Y. [1 ]
Zhang J. [1 ]
Yu S. [2 ]
机构
[1] School of Information Technology, Deakin University, Melbourne
[2] Department of Electronic and Communication Engineering, Sun Yat-Sen University, Guangzhou
关键词
constrained clustering; constraints; machine learning; semi-supervised learning; traffic classification;
D O I
10.1109/ICNSS.2011.6059997
中图分类号
学科分类号
摘要
Network traffic classification is an essential component for network management and security systems. To address the limitations of traditional port-based and payload-based methods, recent studies have been focusing on alternative approaches. One promising direction is applying machine learning techniques to classify traffic flows based on packet and flow level statistics. In particular, previous papers have illustrated that clustering can achieve high accuracy and discover unknown application classes. In this work, we present a novel semi-supervised learning method using constrained clustering algorithms. The motivation is that in network domain a lot of background information is available in addition to the data instances themselves. For example, we might know that flow f1 and f2 are using the same application protocol because they are visiting the same host address at the same port simultaneously. In this case, f1 and f2 shall be grouped into the same cluster ideally. Therefore, we describe these correlations in the form of pair-wise must-link constraints and incorporate them in the process of clustering. We have applied three constrained variants of the K-Means algorithm, which perform hard or soft constraint satisfaction and metric learning from constraints. A number of real-world traffic traces have been used to show the availability of constraints and to test the proposed approach. The experimental results indicate that by incorporating constraints in the course of clustering, the overall accuracy and cluster purity can be significantly improved. © 2011 IEEE.
引用
收藏
页码:169 / 175
页数:6
相关论文
共 50 条
  • [1] A Novel Approach for Semi-Supervised Network Traffic Classification
    Huo, Yonghua
    Song, Chunxiao
    Zhou, Meichao
    Lv, Rui
    Yang, Yang
    2022 IEEE 14TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2022), 2022, : 64 - 69
  • [2] Clustering Network Traffic Using Semi-Supervised Learning
    Krajewska, Antonina
    Niewiadomska-Szynkiewicz, Ewa
    ELECTRONICS, 2024, 13 (14)
  • [3] Network traffic classification based on semi-supervised clustering
    Information Security Center, State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
    不详
    不详
    不详
    J. China Univ. Post Telecom., SUPPL. 2 (84-88):
  • [4] Semi-Supervised Network Traffic Classification
    Erman, Jeffrey
    Mahanti, Anirban
    Arlitt, Martin
    Cohen, Ira
    Williamson, Carey
    SIGMETRICS'07: PROCEEDINGS OF THE 2007 INTERNATIONAL CONFERENCE ON MEASUREMENT & MODELING OF COMPUTER SYSTEMS, 2007, 35 (01): : 369 - 370
  • [5] A federated semi-supervised learning approach for network traffic classification
    Jin, Zhiping
    Liang, Zhibiao
    He, Meirong
    Peng, Yao
    Xue, Hanxiao
    Wang, Yu
    INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2023, 33 (03)
  • [6] A Semi-supervised Stacked Autoencoder Approach for Network Traffic Classification
    Aouedi, Ons
    Piamrat, Kandaraj
    Bagadthey, Dhruvjyoti
    2020 IEEE 28TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP 2020), 2020,
  • [7] Spectral clustering: A semi-supervised approach
    Chen, Weifu
    Feng, Guocan
    NEUROCOMPUTING, 2012, 77 (01) : 229 - 242
  • [8] A SUPERVISORY APPROACH TO SEMI-SUPERVISED CLUSTERING
    Conroy, Bryan
    Xi, Yongxin Taylor
    Ramadge, Peter
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1858 - 1861
  • [9] A Novel Multiple Kernel Learning Approach for Semi-Supervised Clustering
    Zare, T.
    Sadeghi, M. T.
    Abutalebi, H. R.
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 451 - 456
  • [10] SSSNET: Semi-Supervised Signed Network Clustering
    He, Yixuan
    Reinert, Gesine
    Wang, Songchao
    Cucuringu, Mihai
    PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2022, : 244 - 252