Integrated framework for profit-based feature selection and SVM classification in credit scoring

被引:79
|
作者
Maldonado, Sebastian [1 ]
Bravo, Cristian [2 ]
Lopez, Julio [3 ]
Perez, Juan [1 ]
机构
[1] Univ Los Andes, Fac Ingn & Ciencias Aplicadas, Monsenor Alvaro del Portillo 12455, Santiago, Chile
[2] Univ Southampton, Dept Decis Analyt & Risk, Univ Rd, Southampton SO17 1BJ, Hants, England
[3] Univ Diego Portales, Fac Ingn & Ciencias, Ejercito 441, Santiago, Chile
关键词
Profit measure; Group penalty; Credit scoring; Support Vector Machines; Analytics; SUPPORT VECTOR MACHINES; NETWORKS;
D O I
10.1016/j.dss.2017.10.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a profit-driven approach for classifier construction and simultaneous variable selection based on linear Support Vector Machines. The main goal is to incorporate business-related information such as the variable acquisition costs, the Types I and II error costs, and the profit generated by correctly classified instances, into the modeling process. Our proposal incorporates a group penalty function in the SVM formulation in order to penalize the variables simultaneously that belong to the same group, assuming that companies often acquire groups of related variables for a given cost rather than acquiring them individually. The proposed framework was studied in a credit scoring problem for a Chilean bank, and led to superior performance with respect to business-related goals. 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 121
页数:9
相关论文
共 50 条
  • [41] Fuzzy Preference Based Feature Selection and Semisupervised SVM for Cancer Classification
    Maulik, Ujjwal
    Chakraborty, Debasis
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (02) : 152 - 160
  • [42] FS-SVM based intrusion detection feature selection and classification
    Zhang, Xueqin
    Gu, Chunhua
    DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 1084 - 1086
  • [43] Deep Learning for Classification of Profit-Based Operating Regions in Industrial Processes
    Agarwal, Piyush
    Tamer, Melih
    Sahraei, M. Hossein
    Budman, Hector
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2020, 59 (06) : 2378 - 2395
  • [44] A profit-based model allowing for quality achievement and manufacturing process selection
    Liebhold, V.S.
    Kimbler, D.L.
    Gramopadhye, A.K.
    Quality Engineering, 2002, 14 (01) : 25 - 32
  • [45] Pattern filtering and classification for market basket analysis with profit-based measures
    Chen, Mu-Chen
    Chao, Chuang-Min
    Wu, Kuan-Ting
    EXPERT SYSTEMS, 2012, 29 (02) : 170 - 182
  • [46] An improved SVM based on 1-norm for selection of personal credit scoring index system
    Xue, Xin
    He, Guoping
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 3, PROCEEDINGS, 2007, 4493 : 441 - +
  • [47] Credit Risk Evaluation Based on Data Mining and Integrated Feature Selection
    Deng, Yuanjie
    Wei, Ying
    Li, Yujun
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [48] k-Top Scoring Pair Algorithm for feature selection in SVM with applications to microarray data classification
    Yoon, Sejong
    Kim, Saejoon
    SOFT COMPUTING, 2010, 14 (02) : 151 - 159
  • [49] k-Top Scoring Pair Algorithm for feature selection in SVM with applications to microarray data classification
    Sejong Yoon
    Saejoon Kim
    Soft Computing, 2010, 14 : 151 - 159
  • [50] Application of the VNS heuristic for feature selection in credit scoring problems
    Helder, Victor Gomes
    Filomena, Tiago Pascoal
    Ferreira, Luciano
    Kirch, Guilherme
    MACHINE LEARNING WITH APPLICATIONS, 2022, 9