ANALYSIS OF FEATURE SELECTION TECHNIQUES IN CREDIT RISK ASSESSMENT

被引:0
|
作者
Ramya, R. S. [1 ]
Kumaresan, S. [1 ]
机构
[1] Govt Coll Technol, Dept CSE, Coimbatore, Tamil Nadu, India
关键词
Data Mining; Credit risk assessment; Feature selection; Information gain; Gain ratio; Chi square correlation; GENETIC ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Mining is an automated extraction of hidden knowledge from large amount of data. The computational complexity of the data mining algorithms increases rapidly as the number of features in the dataset increases. Real world credit datasets have accumulated large quantities of information about clients and their financial and payment history. Feature selection techniques are used on such high dimensional data to reduce the dimensionality by removing irrelevant and redundant features to improve the predictive accuracy of data mining algorithms. The objective of this work is study the information gain, gain ratio and chi square correlation based feature selection method to reduce the feature dimensionality. Information gain measure identifies the entropy value of each specific feature. The amount of information gain or entropy is used to decide whether the feature is selected or deleted. Gain ratio applies normalization technique to information gain using spilt information value. The correlation based feature selection uses heuristic search strategies to estimate how the features are correlated with the class attribute and how they are important of each other. Experiments were conducted on the German credit dataset available at UCI Machine Learning Repository to reduce the feature dimensionality using these feature selection methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] A machine learning approach combining expert knowledge with genetic algorithms in feature selection for credit risk assessment
    Lappas, Pantelis Z.
    Yannacopoulos, Athanasios N.
    APPLIED SOFT COMPUTING, 2021, 107
  • [22] Adaptive Credit Card Fraud Detection Techniques Based on Feature Selection Method
    Singh, Ajeet
    Jain, Anurag
    ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, IC4S 2018, 2019, 924 : 167 - 178
  • [23] Feature Selection on Credit Risk Prediction for Peer-to-Peer Lending
    Chen, Shin-Fu
    Chakraborty, Goutam
    Li, Li-Hua
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE (JSAI-ISAI 2018), 2019, 11717 : 5 - 18
  • [24] Credit Risk Evaluation Based on Data Mining and Integrated Feature Selection
    Deng, Yuanjie
    Wei, Ying
    Li, Yujun
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [25] Impact of Feature Selection Methods on the Perfromance of Credit Risk Classification Algorithms
    Singh, N. P.
    Singh, Devender
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2019), 2019, : 101 - 106
  • [26] Analysis of Feature Selection Techniques for Classification Problems
    Adamov, Abzetdin Z.
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [27] Efficient feature selection techniques for sentiment analysis
    Avinash Madasu
    Sivasankar Elango
    Multimedia Tools and Applications, 2020, 79 : 6313 - 6335
  • [28] Efficient feature selection techniques for sentiment analysis
    Madasu, Avinash
    Elango, Sivasankar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 6313 - 6335
  • [29] Feature Selection Techniques for Bioinformatics Data Analysis
    Theng, Dipti
    Bhoyar, K. K.
    2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 46 - 50
  • [30] A review of feature selection techniques in sentiment analysis
    Ahmad, Siti Rohaidah
    Abu Bakar, Azuraliza
    Yaakub, Mohd Ridzwan
    INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 159 - 189