Mining anomalies using traffic feature distributions

被引:416
|
作者
Lakhina, A
Crovella, M
Diot, C
机构
[1] Boston Univ, Dept Comp Sci, Boston, MA 02215 USA
[2] Intel Res, Cambridge, England
关键词
anomaly detection; anomaly classification; network-wide traffic analysis;
D O I
10.1145/1090191.1080118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing practicality of large-scale flow capture makes it possible to conceive of traffic analysis methods that detect and identify a large and diverse set of anomalies. However the challenge of effectively analyzing this massive data source for anomaly diagnosis is as yet unmet. We argue that the distributions of packet features (IP addresses and ports) observed in flow traces reveals both the presence and the structure of a wide range of anomalies. Using entropy as a summarization tool, we show that the analysis of feature distributions leads to significant advances on two fronts: (1) it enables highly sensitive detection of a wide range of anomalies, augmenting detections by volume-based methods, and (2) it enables automatic classification of anomalies via unsupervised learning. We show that using feature distributions, anomalies naturally fall into distinct and meaningful clusters. These clusters can be used to automatically classify anomalies and to uncover new anomaly types. We validate our claims on data from two backbone networks (Abilene and Geant) and conclude that feature distributions show promise as a key element of a fairly general network anomaly diagnosis framework.
引用
收藏
页码:217 / 228
页数:12
相关论文
共 50 条
  • [1] A Flow Analysis For Mining Traffic Anomalies
    Kanda, Yoshiki
    Fukuda, Kensuke
    Sugawara, Toshiharu
    2010 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS - ICC 2010, 2010,
  • [2] Poster: Learning distributions to detect anomalies using all the network traffic
    Dietmuller, Alexander
    Fragkouli, Georgia
    Vanbever, Laurent
    PROCEEDINGS OF THE 2023 ACM SIGCOMM 2023 CONFERENCE, SIGCOMM 2023, 2023, : 1108 - 1110
  • [3] Using Entropy Spaces and Mixtures of Gaussian Distributions to Characterize Traffic Anomalies
    Velarde-Alvarado, Pablo
    Martinez-Herrera, Alberto F.
    Iriarte-Solis, Adalberto
    2012 IBEROAMERICAN CONFERENCE ON ELECTRONICS ENGINEERING AND COMPUTER SCIENCE, 2012, 3 : 97 - 108
  • [4] Uncovering network traffic anomalies based on their sparse distributions
    CHENG GuoZhen
    CHEN HongChang
    CHENG DongNian
    ZHANG Zhen
    LAN JuLong
    Science China(Information Sciences), 2014, 57 (09) : 256 - 266
  • [5] Uncovering network traffic anomalies based on their sparse distributions
    GuoZhen Cheng
    HongChang Chen
    DongNian Cheng
    Zhen Zhang
    JuLong Lan
    Science China Information Sciences, 2014, 57 : 1 - 11
  • [6] Uncovering network traffic anomalies based on their sparse distributions
    Cheng GuoZhen
    Chen HongChang
    Cheng DongNian
    Zhang Zhen
    Lan JuLong
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (09) : 1 - 11
  • [7] Feature Mining for Internet Video Traffic Classification
    Yang, Ling-yun
    Dong, Yu-ning
    Wu, Zheng
    Tang, Pingping
    Feng, You-hong
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 441 - 444
  • [8] An early stage convolutional feature extracting method using for mining traffic detection
    Sun, Peifa
    Lyu, Mengda
    Li, Hui
    Yang, Bo
    Peng, Lizhi
    COMPUTER COMMUNICATIONS, 2022, 193 : 346 - 354
  • [9] Detecting semantic anomalies in truck weigh-in-motion traffic data using data mining
    Raz, O
    Buchheit, R
    Shaw, M
    Koopman, P
    Faloutsos, C
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2004, 18 (04) : 291 - 300
  • [10] Online mining of risk level of traffic anomalies with user's feedbacks
    Meng, Yu
    Dunham, Margaret H.
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 176 - +