ANALYSIS OF FEATURE SELECTION TECHNIQUES IN CREDIT RISK ASSESSMENT

被引:0
|
作者
Ramya, R. S. [1 ]
Kumaresan, S. [1 ]
机构
[1] Govt Coll Technol, Dept CSE, Coimbatore, Tamil Nadu, India
关键词
Data Mining; Credit risk assessment; Feature selection; Information gain; Gain ratio; Chi square correlation; GENETIC ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Mining is an automated extraction of hidden knowledge from large amount of data. The computational complexity of the data mining algorithms increases rapidly as the number of features in the dataset increases. Real world credit datasets have accumulated large quantities of information about clients and their financial and payment history. Feature selection techniques are used on such high dimensional data to reduce the dimensionality by removing irrelevant and redundant features to improve the predictive accuracy of data mining algorithms. The objective of this work is study the information gain, gain ratio and chi square correlation based feature selection method to reduce the feature dimensionality. Information gain measure identifies the entropy value of each specific feature. The amount of information gain or entropy is used to decide whether the feature is selected or deleted. Gain ratio applies normalization technique to information gain using spilt information value. The correlation based feature selection uses heuristic search strategies to estimate how the features are correlated with the class attribute and how they are important of each other. Experiments were conducted on the German credit dataset available at UCI Machine Learning Repository to reduce the feature dimensionality using these feature selection methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Hybridisation of Feature Selection and Classification Techniques in Credit Risk Assessment Modelling
    Sakri, Sapiah
    Othman, Jaizah
    Halid, Noreha
    KNOWLEDGE INNOVATION THROUGH INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_20), 2020, 327 : 367 - 380
  • [2] Hybridisation of feature selection and classification techniques in credit risk assessment modelling
    Sakri, Sapiah
    Othman, Jaizah
    Halid, Noreha
    Frontiers in Artificial Intelligence and Applications, 2020, 327 : 367 - 380
  • [3] Credit Risk Assessment Using Learning Algorithms for Feature Selection
    Hassani, Zeinab
    Meybodi, Mohsen Alambardar
    Hajihashemi, Vahid
    FUZZY INFORMATION AND ENGINEERING, 2020, 12 (04) : 529 - 544
  • [4] Feature Selection Engineering for Credit Risk Assessment in Retail Banking
    Jemai, Jaber
    Zarrad, Anis
    INFORMATION, 2023, 14 (03)
  • [5] Association Rule-based Feature Selection for Credit Risk Assessment
    Mei, Xueyan
    Jiang, Yilin
    2016 IEEE INTERNATIONAL CONFERENCE OF ONLINE ANALYSIS AND COMPUTING SCIENCE (ICOACS), 2016, : 301 - 305
  • [6] The Most Effective Strategy for Incorporating Feature Selection into Credit Risk Assessment
    Atif D.
    Salmi M.
    SN Computer Science, 4 (2)
  • [7] MSEs Credit Risk Assessment Model Based on Federated Learning and Feature Selection
    Xu, Zhanyang
    Cheng, Jianchun
    Cheng, Luofei
    Xu, Xiaolong
    Bilal, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 5573 - 5595
  • [8] Genetic algorithm-based heuristic for feature selection in credit risk assessment
    Oreski, Stjepan
    Oreski, Goran
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 2052 - 2064
  • [9] AI techniques in credit risk assessment
    Berka, Petr
    Neural Network World, 1995, 5 (06): : 851 - 859
  • [10] Enterprise Credit Risk Assessment Using Feature Selection Approach and Ensemble Learning Technique
    Wang, Di
    Zhang, Zuoquan
    2020 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2020), 2020, : 228 - 233