Integrated framework for profit-based feature selection and SVM classification in credit scoring

被引:79
|
作者
Maldonado, Sebastian [1 ]
Bravo, Cristian [2 ]
Lopez, Julio [3 ]
Perez, Juan [1 ]
机构
[1] Univ Los Andes, Fac Ingn & Ciencias Aplicadas, Monsenor Alvaro del Portillo 12455, Santiago, Chile
[2] Univ Southampton, Dept Decis Analyt & Risk, Univ Rd, Southampton SO17 1BJ, Hants, England
[3] Univ Diego Portales, Fac Ingn & Ciencias, Ejercito 441, Santiago, Chile
关键词
Profit measure; Group penalty; Credit scoring; Support Vector Machines; Analytics; SUPPORT VECTOR MACHINES; NETWORKS;
D O I
10.1016/j.dss.2017.10.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a profit-driven approach for classifier construction and simultaneous variable selection based on linear Support Vector Machines. The main goal is to incorporate business-related information such as the variable acquisition costs, the Types I and II error costs, and the profit generated by correctly classified instances, into the modeling process. Our proposal incorporates a group penalty function in the SVM formulation in order to penalize the variables simultaneously that belong to the same group, assuming that companies often acquire groups of related variables for a given cost rather than acquiring them individually. The proposed framework was studied in a credit scoring problem for a Chilean bank, and led to superior performance with respect to business-related goals. 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 121
页数:9
相关论文
共 50 条
  • [21] Combined SVM-based feature selection and classification
    Neumann, J
    Schnörr, C
    Steidl, G
    MACHINE LEARNING, 2005, 61 (1-3) : 129 - 150
  • [22] Comparison of Feature Selection Approaches based on the SVM Classification
    Li, F. C.
    Chen, F. L.
    Wang, G. E.
    IEEM: 2008 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1-3, 2008, : 400 - +
  • [23] Combined SVM-Based Feature Selection and Classification
    Julia Neumann
    Christoph Schnörr
    Gabriele Steidl
    Machine Learning, 2005, 61 : 129 - 150
  • [24] Rough set and Tabu search based feature selection for credit scoring
    Wang, Jue
    Guo, Kun
    Wang, Shouyang
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 2419 - 2426
  • [25] Feature Selection Based on the SVM Weight Vector for Classification of Dementia
    Bron, Esther E.
    Smits, Marion
    Niessen, Wiro J.
    Klein, Stefan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (05) : 1617 - 1626
  • [26] Feature Selection Based on SVM Significance Maps for Classification of Dementia
    Bron, Esther
    Smits, Marion
    van Swieten, John
    Niessen, Wiro
    Klein, Stefan
    MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2014), 2014, 8679 : 272 - 279
  • [27] Neighborhood based sample and feature selection for SVM classification learning
    He, Qiang
    Xie, Zongxia
    Hu, Qinghua
    Wu, Congxin
    NEUROCOMPUTING, 2011, 74 (10) : 1585 - 1594
  • [28] Fusion of feature selection methods for pairwise scoring SVM
    Mak, Man-Wai
    Kung, Sun-Yuan
    NEUROCOMPUTING, 2008, 71 (16-18) : 3104 - 3113
  • [29] Data mining feature selection for credit scoring models
    Liu, Y
    Schumann, M
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (09) : 1099 - 1108
  • [30] Filter- versus wrapper-based feature selection for credit scoring
    Somol, P
    Baesens, B
    Pudil, P
    Vanthienen, J
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2005, 20 (10) : 985 - 999