Studying cost-sensitive learning for multi-class imbalance in Internet traffic classification

被引:0
|
作者
LIU Zhen [1 ]
LIU Qiong [2 ]
机构
[1] School of Soft Engineering,South China University of Technology
[2] School of Computer Science and Engineering,South China University of
关键词
D O I
暂无
中图分类号
TP393.06 [];
学科分类号
摘要
Cost-sensitive learning has been applied to resolve the multi-class imbalance problem in Internet traffic classification and it has achieved considerable results.But the classification performance on the minority classes with a few bytes is still unhopeful because the existing research only focuses on the classes with a large amount of bytes.Therefore,the class-dependent misclassification cost is studied.Firstly,the flow rate based cost matrix(FCM) is investigated.Secondly,a new cost matrix named weighted cost matrix(WCM) is proposed,which calculates a reasonable weight for each cost of FCM by regarding the data imbalance degree and classification accuracy of each class.It is able to further improve the classification performance on the difficult minority class(the class with more flows but worse classification accuracy).Experimental results on twelve real traffic datasets show that FCM and WCM obtain more than 92% flow g-mean and 80% byte g-mean on average;on the test set collected one year later,WCM outperforms FCM in terms of stability.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [41] Cost-sensitive collaborative representation based classification via probability estimation with addressing the class imbalance
    Liu, Zhenbing
    Ma, Chao
    Gao, Chunyang
    Yang, Huihua
    Lan, Rushi
    Luo, Xiaonan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (09) : 10835 - 10851
  • [42] Two-Stage Cost-Sensitive Learning for Data Streams With Concept Drift and Class Imbalance
    Sun, Yange
    Sun, Yi
    Dai, Honghua
    IEEE ACCESS, 2020, 8 : 191942 - 191955
  • [43] Class probability estimation and cost-sensitive classification decisions
    Margineantu, DD
    MACHINE LEARNING: ECML 2002, 2002, 2430 : 270 - 281
  • [44] Multi-view cost-sensitive kernel learning for imbalanced classification problem
    Tang, Jingjing
    Hou, Zhaojie
    Yu, Xiaotong
    Fu, Saiji
    Tian, Yingjie
    NEUROCOMPUTING, 2023, 552
  • [45] Cost-sensitive ensemble learning algorithm for multi-label classification problems
    Fu, Z.-L. (fzliang@netease.com), 1600, Science Press (40):
  • [46] Effect of Feature Selection on Performance of Internet Traffic Classification on NIMS Multi-Class dataset
    Oluranti, Jonathan
    Omoregbe, Nicholas
    Misra, Sanjay
    3RD INTERNATIONAL CONFERENCE ON SCIENCE AND SUSTAINABLE DEVELOPMENT (ICSSD 2019): SCIENCE, TECHNOLOGY AND RESEARCH: KEYS TO SUSTAINABLE DEVELOPMENT, 2019, 1299
  • [47] ARConvL: Adaptive Region-Based Convolutional Learning for Multi-class Imbalance Classification
    Li, Shuxian
    Song, Liyan
    Wu, Xiaoyu
    Hu, Zheng
    Cheung, Yiu-ming
    Yao, Xin
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 103 - 120
  • [48] Multi-label thresholding for cost-sensitive classification
    Alotaibi, Reem
    Flach, Peter
    NEUROCOMPUTING, 2021, 436 : 232 - 247
  • [49] Inter-class margin climbing with cost-sensitive learning in neural network classification
    Zhang, Siyuan
    Xie, Linbo
    Chen, Ying
    Zhang, Shanxin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1993 - 2016
  • [50] Cost-sensitive decision tree learning for forensic classification
    Davis, Jason V.
    Ha, Jungwoo
    Rossbach, Christopher J.
    Ramadan, Hany E.
    Witchel, Emmett
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 622 - 629