Benchmarking the benchmark - Comparing synthetic and real-world Network IDS datasets

被引:2
|
作者
Layeghy, Siamak [1 ]
Gallagher, Marcus [1 ]
Marius, Portmann [1 ]
机构
[1] Univ Queensland, Sch ITEE, Brisbane, Qld 4072, Australia
关键词
Network traffic characteristics; Feature distribution; Network Intrusion System (NIDS) dataset; Real-world NIDS dataset; Synthetic NIDS dataset; Machine learning benchmark dataset; ANOMALY DETECTION;
D O I
10.1016/j.jisa.2023.103689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network Intrusion Detection Systems (NIDSs) are an increasingly important tool for the prevention and mitigation of cyber attacks. Over the past years, a lot of research efforts have aimed at leveraging the increasingly powerful models of Machine Learning (ML) for this purpose. A number of labelled synthetic datasets have been generated and made publicly available by researchers, and they have become the benchmarks via which new ML -based NIDS classifiers are being evaluated. Recently published results show excellent classification performance with these datasets, increasingly approaching 100 percent performance across key evaluation metrics such as Accuracy, F1 score, AUC, etc. Unfortunately, we have not yet seen these excellent academic research results translated into practical NIDS systems with such near -perfect performance. This motivated our research presented in this paper, where we analyse the statistical properties of the benign traffic in three of the more recent and relevant NIDS datasets, (CIC_IDS, UNSW_NB15, TON_IOT), by converting them into a common flow format. As a comparison, we consider two datasets obtained from real -world production networks, one from a university network and one from a medium size Internet Service Provider (ISP). Our results show that the two real -world datasets are quite similar among themselves in regards to most of the considered statistical features. Equally, the three synthetic datasets are also relatively similar within their group. However, and most importantly, our results show a distinct difference of most of the considered statistical features between the three synthetic datasets and the two real -world datasets. Since ML relies on the basic assumption of training and test datasets being sampled from the same distribution, this raises the question of how well the performance results of ML -classifiers trained on the considered synthetic datasets can translate and generalise to real -world networks. We believe this is an interesting and relevant question which provides motivation for further research in this space.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Investigating the optimisation of real-world and synthetic object detection training datasets through the consideration of environmental and simulation factors
    Newman, Callum
    Petzing, Jon
    Goh, Yee Mey
    Justham, Laura
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
  • [42] Large language models generating synthetic clinical datasets: a feasibility and comparative analysis with real-world perioperative data
    Barr, Austin A.
    Quan, Joshua
    Guo, Eddie
    Sezgin, Emre
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2025, 8
  • [43] Advancing Real-World Burst Denoising: A New Benchmark and Dual-Branch Burst Denoising Network
    Wu, Huilei
    Zhao, Qing
    Song, Zhiyuan
    Wei, Pengxu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 266 - 278
  • [44] Neuropsychiatric Adverse Events of Montelukast: An Analysis of Real-World Datasets and drug-gene Interaction Network
    Umetsu, Ryogo
    Tanaka, Mizuki
    Nakayama, Yoko
    Kato, Yamato
    Ueda, Natsumi
    Nishibata, Yuri
    Hasegawa, Shiori
    Matsumoto, Kiyoka
    Takeyama, Noriaki
    Iguchi, Kazuhiro
    Tanaka, Hiroyuki
    Hinoi, Eiichi
    Inagaki, Naoki
    Inden, Masatoshi
    Muto, Yoshinori
    Nakamura, Mitsuhiro
    FRONTIERS IN PHARMACOLOGY, 2021, 12
  • [45] EFFECTIVENESS OF BIOLOGICS IN ASTHMA: COMPARING REAL-WORLD EVIDENCE
    Adams, Sandra G.
    Caceres, Diego J. Maselli
    Durg, Sharanbasappa
    Dubucq, Hugo
    Ritter, Janet
    Pandit-Abid, Nami
    Ledanois, Olivier
    Wang, Zhixiao
    Cheng, Wei-Han
    CHEST, 2024, 166 (04) : 77A - 78A
  • [46] Comparing ICP variants on real-world data sets
    Pomerleau, Francois
    Colas, Francis
    Siegwart, Roland
    Magnenat, Stephane
    AUTONOMOUS ROBOTS, 2013, 34 (03) : 133 - 148
  • [47] Statement of the network Real-world Labs of Sustainability on the real-world labs law initiative in Germany
    Parodi, Oliver
    Schwichtenberg, Roy
    Stelzer, Franziska
    Rhodius, Regina
    Schreider, Claudia
    von Wirth, Timo
    Lang, Daniel J.
    Marg, Oskar
    Wagner, Felix
    Egermann, Markus
    Bauknecht, Dierk
    Wanner, Matthias
    GAIA-ECOLOGICAL PERSPECTIVES FOR SCIENCE AND SOCIETY, 2023, 32 (04): : 399 - 401
  • [48] TaintBench: Automatic real-world malware benchmarking of Android taint analyses
    Luo, Linghui
    Pauck, Felix
    Piskachev, Goran
    Benz, Manuel
    Pashchenko, Ivan
    Mory, Martin
    Bodden, Eric
    Hermann, Ben
    Massacci, Fabio
    EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (01)
  • [49] Validation and Benchmarking of a Wearable EEG Acquisition Platform for Real-World Applications
    Valentin, Olivier
    Ducharme, Mikael
    Cretot-Richert, Gabrielle
    Monsarrat-Chanon, Hami
    Viallet, Guilhem
    Delnavaz, Aidin
    Voix, Jeremie
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2019, 13 (01) : 103 - 111
  • [50] TaintBench: Automatic real-world malware benchmarking of Android taint analyses
    Linghui Luo
    Felix Pauck
    Goran Piskachev
    Manuel Benz
    Ivan Pashchenko
    Martin Mory
    Eric Bodden
    Ben Hermann
    Fabio Massacci
    Empirical Software Engineering, 2022, 27