Credit Scoring of Small and Micro Enterprises Based on Sample-Dependent Cost Matrix

被引:0
|
作者
Zhang T. [1 ,2 ]
Wang Y. [1 ]
Li K. [1 ]
Zhang Y. [3 ,4 ]
机构
[1] School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai
[2] Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai
[3] School of Computer Science, Fudan University, Shanghai
[4] Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai
来源
关键词
Cost sensitive learning; Credit scoring; Minimum Bayes risk; Sample-dependent; XGBoost model;
D O I
10.11908/j.issn.0253-374x.19017
中图分类号
学科分类号
摘要
Because the credit history data of small and micro enterprises are small and the problem of class imbalance is more serious, this paper proposes a Smote XGboost-Bayes Minimum Risk (SXG-BMR) model based on the sample-dependent cost matrix. The whole sample is oversampled at a low rate to weaken the problem of class imbalance and reduce the risk of model overfitting. The model combines the integrated learning model with the minimum risk Bayes decision to realize the cost sensitivity. At the same time, this paper introduces the sample-dependent cost matrix into the model. The cost matrix is related not only to the category, but also to the attributes of the sample.Therefore,it can characterize the cost more accurately. In the empirical study,this paper uses a standard credit dataset and a real credit dataset of small and micro enterprises in Shanghai. Besides,it compares and analzes of various algorithms. The results show that the SXG-BMR model proposed in this paper has a good performance. © 2020, Editorial Department of Journal of Tongji University. All right reserved.
引用
收藏
页码:149 / 158
页数:9
相关论文
共 33 条
  • [1] West D., Neural network credit scoring models, Computers & Operations Research, 27, 11, (2000)
  • [2] Xiao W., Fei Q., A study of personal credit scoring models on support vector machine with optimal choice of kernel function parameters, Systems Engineering -Theory & Practice, 26, 10, (2006)
  • [3] Bhattacharyya S., Jha S., Tharakunnel K., Et al., Data mining for credit card fraud: a comparative study, Decision Support Systems, 50, 3, (2011)
  • [4] Deng C., Hu M., Zeng W., Small business credit scoring model based on Bayesian inference using bound and collapse, Journal of Industrial Engineering and Engineering Management, 29, 4, (2015)
  • [5] Lessmann S., Baesens B., Seow H.V., Et al., Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research, European Journal of Operational Research, 247, 1, (2015)
  • [6] Xiao B., Yang Y., Li X., Et al., Research on the credit rating of small and micro enterprises based on fuzzy neural network, Journal of Management Sciences in China, 19, 11, (2016)
  • [7] Xiong Z., Research on feature selection method in credit evaluation, The Jouranal of Quantitative & Technical Economics, 33, 1, (2016)
  • [8] Vlasselaer V.V., Bravo C., Caelen O., Et al., APATE: a novel approach for automated credit card transaction fraud detection using network-based extensions, Decision Support Systems, 75, (2015)
  • [9] Dahiya S., Handa S.S., Singh N.P., A feature selection enabled hybrid-bagging algorithm for credit risk evaluation, Expert Systems, 34, 9, (2017)
  • [10] Chen F.L., Li F.C., Combination of feature selection approaches with SVM in credit scoring, Expert Systems with Applications, 37, 7, (2010)