Using extreme gradient boosting (XGBoost) machine learning to predict construction cost overruns

被引:2
|
作者
Coffie, G. H. [1 ]
Cudjoe, S. K. F. [2 ]
机构
[1] Ho Tech Univ, Dept Bldg Technol, Ho, Volta Region, Ghana
[2] Univ Witwatersrand, Wits Business Sch, Johannesburg, South Africa
关键词
Cost overruns; machine learning; construction; extreme gradient boost; modelling; PROJECTS; DELAYS; SUCCESS;
D O I
10.1080/15623599.2023.2289754
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Continuously, cost overruns in construction projects, as a leading cause of project failure, have been attracting more and more attention among construction stakeholders. Notably, cost overrun prediction model development can help identify factors that lead to cost overruns, thereby substantially improving cost estimates. Meanwhile, a machine learning application on archival data to estimate construction cost overrun is still in development. Motivated by this, we applied an Extreme Gradient Boosting (XGBoost) machine to analyze historical data of construction projects in Ghana completed between 2016 and 2018. The comparison between the actual and predicted cost yielded a good model prediction. The RMSE, MSE, MAE, and MAPE values are 0.202, 0.041, 0.069, and 0.306, respectively. To visually explain the importance of each feature for cost overrun prediction, we used SHAP values to illustrate the effect of each feature for model interpretability. According to SHAP ranking, we discover that the initial contract amount, the number of storeys, scope changes, and the initial duration are the variables that most accurately predict project completion costs and cost overruns. This research explores an innovative way to understand and evaluate essential variables that can help develop a prediction model of cost overruns that could aid the construction industry's cost estimation.
引用
收藏
页码:1742 / 1750
页数:9
相关论文
共 50 条
  • [31] Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost)
    Kavzoglu, Taskin
    Teke, Alihan
    BULLETIN OF ENGINEERING GEOLOGY AND THE ENVIRONMENT, 2022, 81 (05)
  • [33] Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost)
    Taskin Kavzoglu
    Alihan Teke
    Bulletin of Engineering Geology and the Environment, 2022, 81
  • [34] A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers
    Rahman, Md Siddikur
    Chowdhury, Arman Hossain
    PLOS ONE, 2022, 17 (09):
  • [35] Dealing with construction cost overruns using data mining
    Ahiaga-Dagbui, Dominic D.
    Smith, Simon D.
    CONSTRUCTION MANAGEMENT AND ECONOMICS, 2014, 32 (7-8) : 682 - 694
  • [36] In-Vehicle Network Anomaly Detection Using Extreme Gradient Boosting Machine
    Anjum, Afia
    Agbaje, Paul
    Hounsinou, Sena
    Olufowobi, Habeeb
    2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 100 - 105
  • [37] Lithium-Ion Battery Estimation in Online Framework Using Extreme Gradient Boosting Machine Learning Approach
    Jafari, Sadiqa
    Shahbazi, Zeinab
    Byun, Yung-Cheol
    Lee, Sang-Joon
    MATHEMATICS, 2022, 10 (06)
  • [38] Time Series Analysis and Forecasting of Solar Generation in Spain Using eXtreme Gradient Boosting: A Machine Learning Approach
    Saigustia, Candra
    Pijarski, Pawel
    ENERGIES, 2023, 16 (22)
  • [39] Machine learning-based model for accurate identification of druggable proteins using light extreme gradient boosting
    Alghushairy, Omar
    Ali, Farman
    Alghamdi, Wajdi
    Khalid, Majdi
    Alsini, Raed
    Asiry, Othman
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024, 42 (22): : 12330 - 12341
  • [40] Explainable AI to Predict Male Fertility Using Extreme Gradient Boosting Algorithm with SMOTE
    GhoshRoy, Debasmita
    Alvi, Parvez Ahmad
    Santosh, K. C.
    ELECTRONICS, 2023, 12 (01)