A novel semi-supervised approach for network traffic clustering

被引:29
|
作者
Wang Y. [1 ]
Xiang Y. [1 ]
Zhang J. [1 ]
Yu S. [2 ]
机构
[1] School of Information Technology, Deakin University, Melbourne
[2] Department of Electronic and Communication Engineering, Sun Yat-Sen University, Guangzhou
关键词
constrained clustering; constraints; machine learning; semi-supervised learning; traffic classification;
D O I
10.1109/ICNSS.2011.6059997
中图分类号
学科分类号
摘要
Network traffic classification is an essential component for network management and security systems. To address the limitations of traditional port-based and payload-based methods, recent studies have been focusing on alternative approaches. One promising direction is applying machine learning techniques to classify traffic flows based on packet and flow level statistics. In particular, previous papers have illustrated that clustering can achieve high accuracy and discover unknown application classes. In this work, we present a novel semi-supervised learning method using constrained clustering algorithms. The motivation is that in network domain a lot of background information is available in addition to the data instances themselves. For example, we might know that flow f1 and f2 are using the same application protocol because they are visiting the same host address at the same port simultaneously. In this case, f1 and f2 shall be grouped into the same cluster ideally. Therefore, we describe these correlations in the form of pair-wise must-link constraints and incorporate them in the process of clustering. We have applied three constrained variants of the K-Means algorithm, which perform hard or soft constraint satisfaction and metric learning from constraints. A number of real-world traffic traces have been used to show the availability of constraints and to test the proposed approach. The experimental results indicate that by incorporating constraints in the course of clustering, the overall accuracy and cluster purity can be significantly improved. © 2011 IEEE.
引用
收藏
页码:169 / 175
页数:6
相关论文
共 50 条
  • [21] A SEMI-SUPERVISED MODEL FOR NETWORK TRAFFIC ANOMALY DETECTION
    Nguyen Ha Duong
    Hoang Dang Hai
    2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2015, : 70 - 75
  • [22] Novel automatic traffic sign classification system using a semi-supervised approach
    Pupezescu, Marilena-Catalina
    Pupezescu, Valentin
    2022 23RD INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2022, : 177 - 180
  • [23] An Approach for Classification of Network Traffic on Semi - Supervised Data using Clustering Techniques
    Shukla, Dheeraj Basant
    Chandel, Gajendra Singh
    2013 4TH NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2013), 2013,
  • [24] A semi-supervised clustering approach using labeled data
    Taghizabet, A.
    Tanha, J.
    Amini, A.
    Mohammadzadeh, J.
    SCIENTIA IRANICA, 2023, 30 (01) : 104 - 115
  • [25] A Semi-Supervised Clustering Approach for Semantic Slot Labelling
    Cuayahuitl, Heriberto
    Dethlefs, Nina
    Hastie, Helen
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 500 - 505
  • [26] Fast Semi-Supervised Fuzzy Clustering :Approach and Application
    Cai, Jia-xin
    Yang, Feng
    Feng, Guo-can
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 108 - +
  • [27] A Novel Distributed Semi-Supervised Approach for Detection of Network Based Attacks
    Jain, Meenal
    Kaur, Gagandeep
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 120 - 125
  • [28] Semi-supervised clustering methods
    Bair, Eric
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [29] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [30] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    INFORMATION SCIENCES, 2023, 632 : 164 - 200