The Effect of Hyperparameter Optimization on the Estimation of Performance Metrics in Network Traffic Prediction using the Gradient Boosting Machine Model

被引:2
|
作者
Mwita, Machoke [1 ]
Mbelwa, Jimmy [2 ]
Agbinya, Johnson [3 ]
Sam, Anael Elikana [4 ]
机构
[1] Nelson Mandela African Inst Sci & Technol, Dept Informat Technol Dev & Management ITDM, Sch Computat & Commun Sci & Engn, Arusha, Tanzania
[2] Univ Dar Es Salaam, Dar Es Salaam, Tanzania
[3] Melbourne Inst Technol, Sch Informat Technol & Engn, Melbourne, Australia
[4] Nelson Mandela African Inst Sci & Technol, Dept Commun Sci & Engn CoSE, Sch Computat & Commun Sci & Engn CoCSE, Arusha, Tanzania
关键词
network traffic; machine learning; big data; data loggers; feature selection; gradient boosting machine prediction; ALGORITHMS;
D O I
10.48084/etasr.5548
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Information and Communication Technology (ICT) has changed the way we communicate and access information, resulting in the high generation of heterogeneous data. The amount of network traffic generated constantly increases in velocity, veracity, and volume as we enter the era of big data. Network traffic classification and intrusion detection are very important for the early detection and identification of unnecessary network traffic. The Machine Learning (ML) approach has recently entered the center stage in network traffic accurate classification. However, in most cases, it does not apply model hyperparameter optimization. In this study, gradient boosting machine prediction was used with different hyperparameter optimization configurations, such as interaction depth, tree number, learning rate, and sampling. Data were collected through an experimental setup by using the Sophos firewall and Cisco router data loggers. Data analysis was conducted with R software version 4.2.0 with Rstudio Integrated Development Environment. The dataset was split into two partitions, where 70% was used for training the model and 30% for testing. At a learning rate of 0.1, interaction depth of 14, and tree number of 2500, the model estimated the highest performance metrics with an accuracy of 0.93 and R of 0.87 compared to 0.90 and 0.85 before model optimization. The same configuration attained the minimum classification error of 0.07 than 0.10 before model optimization. After model tweaking, a method was developed for achieving improved accuracy, R square, mean decrease in Gini coefficients for more than 8 features, lower classification error, root mean square error, logarithmic loss, and mean square error in the model.
引用
收藏
页码:10714 / 10720
页数:7
相关论文
共 50 条
  • [21] Maximum Latency Prediction Based on Random Forests and Gradient Boosting Machine for AVB Traffic in TSN
    Zhang, Xiaodi
    Li, Dong
    Piao, Jinnan
    IEEE COMMUNICATIONS LETTERS, 2025, 29 (02) : 264 - 268
  • [22] Performance prediction and multi-objective optimization for the Atkinson cycle engine using eXtreme Gradient Boosting
    Sun, Xilei
    Fu, Jianqin
    Zhou, Feng
    Luo, Baojun
    Liu, Jingping
    THERMAL SCIENCE AND ENGINEERING PROGRESS, 2024, 48
  • [23] Network Traffic Prediction Performance Using LSTM
    Yalda, Khirota
    Hamad, Diyar Jamal
    Tapus, Nicolae
    Okumus, Ibrahim Taner
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2024, 27 (3-4): : 336 - 347
  • [24] Prediction of Cardiotoxicity for Breast Cancer Patients Using Light Gradient Boosting Machine
    Jiang, Z.
    Diao, P.
    Liang, Y.
    Dai, K.
    Li, H.
    Wang, H.
    Chen, Y.
    Lu, M.
    Kuang, Y.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [25] Using social network analysis and gradient boosting to develop a soccer win-lose prediction model
    Cho, Yoonjae
    Yoon, Jaewoong
    Lee, Sukjun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 72 : 228 - 240
  • [26] Prediction performance of linear models and gradient boosting machine on complex phenotypes in outbred mice
    Perez, Bruno C.
    Bink, Marco C. A. M.
    Svenson, Karen L.
    Churchill, Gary A.
    Calus, Mario P. L.
    G3-GENES GENOMES GENETICS, 2022, 12 (04):
  • [27] Prediction interval estimation of sinter drum index based on light gradient boosting machine and kernel density estimation
    Xia, Guanglei
    Wu, Zhaoxia
    Liu, Mengyuan
    Jiang, Yushan
    IRONMAKING & STEELMAKING, 2023, 50 (08) : 909 - 920
  • [28] In-Vehicle Network Anomaly Detection Using Extreme Gradient Boosting Machine
    Anjum, Afia
    Agbaje, Paul
    Hounsinou, Sena
    Olufowobi, Habeeb
    2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 100 - 105
  • [29] Length of Stay Prediction Model of Indoor Patients Based on Light Gradient Boosting Machine
    Zeng, Xiangrui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [30] Hyperparameter optimization: a comparative machine learning model analysis for enhanced heart disease prediction accuracy
    Rimal, Yagyanath
    Sharma, Navneet
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55091 - 55107