A comparative analysis of gradient boosting algorithms

被引:0
|
作者
Candice Bentéjac
Anna Csörgő
Gonzalo Martínez-Muñoz
机构
[1] University of Bordeaux,College of Science and Technology
[2] Pázmány Péter Catholic University,Faculty of Information Technology and Bionics
[3] Universidad Autónoma de Madrid,Escuela Politéctica Superior
来源
关键词
XGBoost; LightGBM; CatBoost; Gradient boosting; Random forest; Ensembles of classifiers;
D O I
暂无
中图分类号
学科分类号
摘要
The family of gradient boosting algorithms has been recently extended with several interesting proposals (i.e. XGBoost, LightGBM and CatBoost) that focus on both speed and accuracy. XGBoost is a scalable ensemble technique that has demonstrated to be a reliable and efficient machine learning challenge solver. LightGBM is an accurate model focused on providing extremely fast training performance using selective sampling of high gradient instances. CatBoost modifies the computation of gradients to avoid the prediction shift in order to improve the accuracy of the model. This work proposes a practical analysis of how these novel variants of gradient boosting work in terms of training speed, generalization performance and hyper-parameter setup. In addition, a comprehensive comparison between XGBoost, LightGBM, CatBoost, random forests and gradient boosting has been performed using carefully tuned models as well as using their default settings. The results of this comparison indicate that CatBoost obtains the best results in generalization accuracy and AUC in the studied datasets although the differences are small. LightGBM is the fastest of all methods but not the most accurate. Finally, XGBoost places second both in accuracy and in training speed. Finally an extensive analysis of the effect of hyper-parameter tuning in XGBoost, LightGBM and CatBoost is carried out using two novel proposed tools.
引用
收藏
页码:1937 / 1967
页数:30
相关论文
共 50 条
  • [31] Predictive Performances of Ensemble Machine Learning Algorithms in Landslide Susceptibility Mapping Using Random Forest, Extreme Gradient Boosting (XGBoost) and Natural Gradient Boosting (NGBoost)
    Taskin Kavzoglu
    Alihan Teke
    Arabian Journal for Science and Engineering, 2022, 47 : 7367 - 7385
  • [32] Comparative analysis of accelerated gradient algorithms for convex optimization: high and super resolution ODE approach
    Adly, Samir
    Attouch, Hedy
    Fadili, Jalal
    OPTIMIZATION, 2024,
  • [33] A novel approach for prediction of groundwater quality using gradient boosting-based algorithms
    Raheja H.
    Goel A.
    Pal M.
    ISH Journal of Hydraulic Engineering, 2024, 30 (03) : 281 - 292
  • [34] Analysis of boosting algorithms using the smooth margin function
    Rudin, Cynthia
    Schapire, Robert E.
    Daubechies, Ingrid
    ANNALS OF STATISTICS, 2007, 35 (06): : 2723 - 2768
  • [35] Sentiment Analysis for Hindi Cinema Using Boosting Algorithms
    Mann, Parul
    Jha, Anmol
    Rani, Ritu
    Sharma, Arun
    Dev, Amita
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 4, SMARTCOM 2024, 2024, 948 : 377 - 387
  • [36] Feature selection using ModifiedBoostARoota and prediction of heart diseases using Gradient Boosting algorithms
    Anuradha, P.
    David, Vasantha Kalyani
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 19 - 23
  • [37] Age Classification of Rice Seeds in Japan Using Gradient-Boosting and ANFIS Algorithms
    Rathnayake, Namal
    Miyazaki, Akira
    Dang, Tuan Linh
    Hoshino, Yukinobu
    SENSORS, 2023, 23 (05)
  • [38] Improving the forecast performance of landslide susceptibility mapping by using ensemble gradient boosting algorithms
    Ha, Hang
    Bui, Quynh Duy
    Tran, Dinh Trong
    Nguyen, Dinh Quoc
    Bui, Hanh Xuan
    Luu, Chinh
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2024,
  • [39] Efficient and Effective Anomaly Detection in Autonomous Vehicles: A Combination of Gradient Boosting and ANFIS Algorithms
    Al Quran, Mahdi
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2024,
  • [40] Adverse Media Classification: A New Era of Risk Management with XGBoost and Gradient Boosting Algorithms
    Juliandri, Reza
    Johan, Monika Evelin
    Wiratama, Jansen
    Sanjaya, Samuel Ady
    2024 5TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND PRACTICES, IBDAP, 2024, : 18 - 21