Machine learning models and bankruptcy prediction

被引:415
|
作者
Barboza, Flavio [1 ]
Kimura, Herbert [2 ]
Altman, Edward [3 ]
机构
[1] Univ Fed Uberlandia, BR-38408100 Uberlandia, MG, Brazil
[2] Univ Brasilia, Dept Management, Campus Darcy Ribeiro, BR-70910900 Brasilia, DF, Brazil
[3] NYU, Leonard N Stern Sch Business, New York, NY 10012 USA
关键词
Bankruptcy prediction; Machine learning; Support vector machines; Boosting; Bagging; Random forest; SUPPORT VECTOR MACHINES; CREDIT RISK; BOOSTING ALGORITHM; DEFAULT PREDICTION; FINANCIAL RATIOS; RANDOM FORESTS; CLASSIFICATION; SELECTION; ENSEMBLE; PROBABILITY;
D O I
10.1016/j.eswa.2017.04.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been intensive research from academics and practitioners regarding models for predicting bankruptcy and default events, for credit risk management. Seminal academic research has evaluated bankruptcy using traditional statistics techniques (e.g. discriminant analysis and logistic regression) and early artificial intelligence models (e.g. artificial neural networks). In this study, we test machine learning models (support vector machines, bagging, boosting, and random forest) to predict bankruptcy one year prior to the event, and compare their performance with results from discriminant analysis, logistic regression, and neural networks. We use data from 1985 to 2013 on North American firms, integrating information from the Salomon Center database and Compustat, analysing more than 10,000 firm-year observations. The key insight of the study is a substantial improvement in prediction accuracy using machine learning techniques especially when, in addition to the original Altman's Z-score variables, we include six complementary financial indicators. Based on Carton and Hofer (2006), we use new variables, such as the operating margin, change in return-on-equity, change in price-to-book, and growth measures related to assets, sales, and number of employees, as predictive variables. Machine learning models show, on average, approximately 10% more accuracy in relation to traditional models. Comparing the best models, with all predictive variables, the machine learning technique related to random forest led to 87% accuracy, whereas logistic regression and linear discriminant analysis led to 69% and 50% accuracy, respectively, in the testing sample. We find that bagging, boosting, and random forest models outperform the others techniques, and that all prediction accuracy in the testing sample improves when the additional variables are included. Our research adds to the discussion of the continuing debate about superiority of computational methods over statistical techniques such as in Tsai, Hsu, and Yen (2014) and Yeh, Chi, and Lin (2014). In particular, for machine learning mechanisms, we do not find SVM to lead to higher accuracy rates than other models. This result contradicts outcomes from Danenas and Garsva (2015) and Cleofas-Sanchez, Garcia, Marques, and Senchez (2016), but corroborates, for instance, Wang, Ma, and Yang (2014), Liang, Lu, Tsai, and Shih (2016), and Cano et al. (2017). Our study supports the applicability of the expert systems by practitioners as in Heo and Yang (2014), Kim, Kang, and Kim (2015) and Xiao, Xiao, and Wang (2016). (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:405 / 417
页数:13
相关论文
共 50 条
  • [1] Explainability of Machine Learning Models for Bankruptcy Prediction
    Park, Min Sue
    Son, Hwijae
    Hyun, Chongseok
    Hwang, Hyung Ju
    IEEE ACCESS, 2021, 9 : 124887 - 124899
  • [2] Machine-learning models for bankruptcy prediction: do industrial variables matter?
    Bragoli, Daniela
    Ferretti, Camilla
    Ganugi, Piero
    Marseguerra, Giovanni
    Mezzogori, Davide
    Zammori, Francesco
    SPATIAL ECONOMIC ANALYSIS, 2022, 17 (02) : 156 - 177
  • [3] Comparative Analysis of Machine Learning Models for Bankruptcy Prediction in the Context of Pakistani Companies
    Mate, Domician
    Raza, Hassan
    Ahmad, Ishtiaq
    RISKS, 2023, 11 (10)
  • [4] Bankruptcy Prediction Using Machine Learning Techniques
    Shetty, Shekar
    Musa, Mohamed
    Bredart, Xavier
    JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2022, 15 (01)
  • [5] Comprehensive Evaluation of Bankruptcy Prediction in Taiwanese Firms Using Multiple Machine Learning Models
    Pham, Hung, V
    Chu, Tuan
    Le, Tuan M.
    Tran, Hieu M.
    Tran, Huong T. K.
    Yen, Khanh N.
    Dao, Son V. T.
    INTERNATIONAL JOURNAL OF TECHNOLOGY, 2025, 16 (01) : 289 - 309
  • [6] An Improved Kernel Extreme Learning Machine for Bankruptcy Prediction
    Wang, Ming-Jing
    Chen, Hui-Ling
    Zhu, Bin-Lei
    Li, Qiang
    Wang, Ke-Jie
    Shen, Li-Ming
    FUZZY SYSTEMS AND DATA MINING II, 2016, 293 : 282 - 289
  • [7] Personal bankruptcy prediction using machine learning techniques
    Brygala, Magdalena
    Korol, Tomasz
    ECONOMICS AND BUSINESS REVIEW, 2024, 10 (02) : 118 - 142
  • [8] Automated Machine Learning in Bankruptcy Prediction of Manufacturing Companies
    Papik, Mario
    Papikova, Lenka
    5TH INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, ISM 2023, 2024, 232 : 1428 - 1436
  • [9] Benchmarking machine learning models to predict corporate bankruptcy
    Alanis, Emmanuel
    Chava, Sudheer
    Shah, Agam
    JOURNAL OF CREDIT RISK, 2023, 19 (02): : 77 - 110
  • [10] Review of bankruptcy prediction using machine learning and deep learning techniques
    Qu, Yi
    Quan, Pei
    Lei, Minglong
    Shi, Yong
    7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 895 - 899