Feature selection for optimizing traffic classification

被引:96
|
作者
Zhang, Hongli [1 ]
Lu, Gang [1 ]
Qassrawi, Mahmoud T. [1 ]
Zhang, Yu [1 ]
Yu, Xiangzhan [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Traffic classification; Class imbalance; Robust features; IDENTIFICATION;
D O I
10.1016/j.comcom.2012.04.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) algorithms have been widely applied in recent traffic classification. However, due to the imbalance in the number of traffic flows, ML based classifiers are prone to misclassify flows as the traffic type that occupies the majority of flows on the Internet. To address the problem, a novel feature selection metric named Weighted Symmetrical Uncertainty (WSU) is proposed. We design a hybrid feature selection algorithm named WSU_AUC, which prefilters most of features with WSU metric and further uses a wrapper method to select features for a specific classifier with Area Under roc Curve (AUC) metric. Additionally, to overcome the impacts of dynamic traffic flows on feature selection, we propose an algorithm named SRSF that Selects the Robust and Stable Features from the results achieved by WSU_AUC. We evaluate our approaches using three classifiers on the traces captured from entirely different networks. Experimental results obtained by our algorithms are promising in terms of true positive rate (TPR) and false positive rate (FPR). Moreover, our algorithms can achieve >94% flow accuracy and >80% byte accuracy on average. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1457 / 1471
页数:15
相关论文
共 50 条
  • [41] Optimizing Feature Selection and Oversampling Using Metaheuristic Algorithms for Binary Fraud Detection Classification
    Biltawi, Mariam M.
    Qaddoura, Raneem
    Faris, Hossam
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 452 - 462
  • [42] Optimizing Endotracheal Suctioning Classification: Leveraging Prompt Engineering in Machine Learning for Feature Selection
    Islam, Mahera Roksana
    Ferdous, Anik Mahmud
    Hossain, Shahera
    Ahad, Md Atiqur Rahman
    Alnajjar, Fady
    2024 INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING, ABC 2024, 2024,
  • [43] Optimizing Urban Traffic Incident Prediction With Vertical Federated Learning: A Feature Selection Based Approach
    Hussain, Basharat
    Afzal, Muhammad Khalil
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (01): : 145 - 155
  • [44] A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark
    Wang, Yong
    Ke, Wenlong
    Tao, Xiaoling
    INFORMATION, 2016, 7 (01)
  • [45] Consistency measure based simultaneous feature selection and instance purification for multimedia traffic classification
    Wu, Zheng
    Dong, Yu-ning
    Wei, Hua-Liang
    Tian, Wei
    COMPUTER NETWORKS, 2020, 173 (173)
  • [46] Enhancing The Performance of Network Traffic Classification Methods Using Efficient Feature Selection Models
    Alam, Farzana
    Kashef, Rasha
    Jaseemuddin, Muhammad
    2021 15TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON 2021), 2021,
  • [47] On feature selection for traffic congestion prediction
    Yang, Su
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2013, 26 : 160 - 169
  • [48] Feature Selection to Traffic Flow Forecasting
    Sun, Zhanquan
    Wang, Yinglong
    Pan, Jingshan
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 2864 - 2869
  • [49] Feature Selection for Classification with QAOA
    Turati, Gloria
    Dacrema, Maurizio Ferrari
    Cremonesi, Paolo
    2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE 2022), 2022, : 782 - 785
  • [50] Feature Selection for Collective Classification
    Senliol, Baris
    Aral, Atakan
    Cataltepe, Zehra
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 285 - 290