The Effect of Hyperparameter Optimization on the Estimation of Performance Metrics in Network Traffic Prediction using the Gradient Boosting Machine Model

被引:2
|
作者
Mwita, Machoke [1 ]
Mbelwa, Jimmy [2 ]
Agbinya, Johnson [3 ]
Sam, Anael Elikana [4 ]
机构
[1] Nelson Mandela African Inst Sci & Technol, Dept Informat Technol Dev & Management ITDM, Sch Computat & Commun Sci & Engn, Arusha, Tanzania
[2] Univ Dar Es Salaam, Dar Es Salaam, Tanzania
[3] Melbourne Inst Technol, Sch Informat Technol & Engn, Melbourne, Australia
[4] Nelson Mandela African Inst Sci & Technol, Dept Commun Sci & Engn CoSE, Sch Computat & Commun Sci & Engn CoCSE, Arusha, Tanzania
关键词
network traffic; machine learning; big data; data loggers; feature selection; gradient boosting machine prediction; ALGORITHMS;
D O I
10.48084/etasr.5548
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Information and Communication Technology (ICT) has changed the way we communicate and access information, resulting in the high generation of heterogeneous data. The amount of network traffic generated constantly increases in velocity, veracity, and volume as we enter the era of big data. Network traffic classification and intrusion detection are very important for the early detection and identification of unnecessary network traffic. The Machine Learning (ML) approach has recently entered the center stage in network traffic accurate classification. However, in most cases, it does not apply model hyperparameter optimization. In this study, gradient boosting machine prediction was used with different hyperparameter optimization configurations, such as interaction depth, tree number, learning rate, and sampling. Data were collected through an experimental setup by using the Sophos firewall and Cisco router data loggers. Data analysis was conducted with R software version 4.2.0 with Rstudio Integrated Development Environment. The dataset was split into two partitions, where 70% was used for training the model and 30% for testing. At a learning rate of 0.1, interaction depth of 14, and tree number of 2500, the model estimated the highest performance metrics with an accuracy of 0.93 and R of 0.87 compared to 0.90 and 0.85 before model optimization. The same configuration attained the minimum classification error of 0.07 than 0.10 before model optimization. After model tweaking, a method was developed for achieving improved accuracy, R square, mean decrease in Gini coefficients for more than 8 features, lower classification error, root mean square error, logarithmic loss, and mean square error in the model.
引用
收藏
页码:10714 / 10720
页数:7
相关论文
共 50 条
  • [1] An efficient churn prediction model using gradient boosting machine and metaheuristic optimization
    Alshourbaji, Ibrahim
    Helian, Na
    Sun, Yi
    Hussien, Abdelazim G.
    Abualigah, Laith
    Elnaim, Bushra
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] An efficient churn prediction model using gradient boosting machine and metaheuristic optimization
    Ibrahim AlShourbaji
    Na Helian
    Yi Sun
    Abdelazim G. Hussien
    Laith Abualigah
    Bushra Elnaim
    Scientific Reports, 13
  • [3] Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost)
    Taskin Kavzoglu
    Alihan Teke
    Bulletin of Engineering Geology and the Environment, 2022, 81
  • [4] Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost)
    Kavzoglu, Taskin
    Teke, Alihan
    BULLETIN OF ENGINEERING GEOLOGY AND THE ENVIRONMENT, 2022, 81 (05)
  • [5] On using eXtreme Gradient Boosting (XGBoost) Machine Learning algorithm for Home Network Traffic Classification
    Cherif, Iyad Lahsen
    Kortebi, Abdesselem
    2019 WIRELESS DAYS (WD), 2019,
  • [6] Neural network architecture based on gradient boosting for IoT traffic prediction
    Lopez-Martin, Manuel
    Carro, Belen
    Sanchez-Esguevillas, Antonio
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 100 : 656 - 673
  • [7] Hyperparameter estimation of a variational model using a stochastic gradient method
    Zerubia, J
    Blanc-Féraud, L
    BAYESIAN INFERENCE FOR INVERSE PROBLEMS, 1998, 3459 : 349 - 356
  • [8] Prediction of Ecofriendly Concrete Compressive Strength Using Gradient Boosting Regression Tree Combined with GridSearchCV Hyperparameter-Optimization Techniques
    Alhakeem, Zaineb M.
    Jebur, Yasir Mohammed
    Henedy, Sadiq N.
    Imran, Hamza
    Bernardo, Luis F. A.
    Hussein, Hussein M.
    MATERIALS, 2022, 15 (21)
  • [9] PM2.5 concentration estimation using convolutional neural network and gradient boosting machine
    Zhenyu Luo
    Feifan Huang
    Huan Liu
    Journal of Environmental Sciences, 2020, 98 (12) : 85 - 93
  • [10] PM2.5 concentration estimation using convolutional neural network and gradient boosting machine
    Luo, Zhenyu
    Huang, Feifan
    Liu, Huan
    JOURNAL OF ENVIRONMENTAL SCIENCES, 2020, 98 : 85 - 93