A feature selection algorithm based on Hoeffding inequality and mutual information

被引:0
|
作者
Yin, Chunyong [1 ]
Feng, Lu [1 ]
Ma, Luyu [1 ]
Yin, Zhichao [2 ]
Wang, Jin [1 ]
机构
[1] School of Computer and Software, Jiangsu Key Laboratory of Meteorological Observation and Information Processing, Jiangsu Engineering Center of Network Monitoring Nanjing University of Information Science and Technology, Nanjing, China
[2] Nanjing No.1 Middle School, Nanjing, China
关键词
Classification (of information) - Data mining;
D O I
10.14257/ijsip.2015.8.11.39
中图分类号
学科分类号
摘要
With the rapid development of the Internet, the application of data mining in the Internet is becoming more and more extensive. However, the data source’s complex feature redundancy leads that data mining process becomes very inefficient and complex. So feature selection research is essential to make data mining more efficient and simple. In this paper, we propose a new way to measure the correlation degree of internal features of dataset which is a mutation of mutual information. Additionally we also introduce Hoeffding inequality as constraint of constructing algorithm. During the experiments, we use C4.5 classification algorithm as test algorithm and compare HSF with BIF(feature selection algorithm based on mutual information). Experiments results show that HSF performances better than BIF[1] in TP and FP rate, what’s more the feature subset obtained by HSF can significantly improve the TP, FP and memory usage of C4.5 classification algorithm. © 2015 SERSC.
引用
收藏
页码:433 / 444
相关论文
共 50 条
  • [31] Feature selection using a mutual information based measure
    Al-Ani, A
    Deriche, M
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITON, VOL IV, PROCEEDINGS, 2002, : 82 - 85
  • [32] Feature selection based on mutual information with correlation coefficient
    Hongfang Zhou
    Xiqian Wang
    Rourou Zhu
    Applied Intelligence, 2022, 52 : 5457 - 5474
  • [33] FEATURE SELECTION BASED ON STATISTICAL ESTIMATION OF MUTUAL INFORMATION
    Kozhevin, A. A.
    SIBERIAN ELECTRONIC MATHEMATICAL REPORTS-SIBIRSKIE ELEKTRONNYE MATEMATICHESKIE IZVESTIYA, 2021, 18 : 720 - 728
  • [34] Mutual information-based feature selection for radiomics
    Oubel, Estanislao
    Beaumont, Hubert
    Iannessi, Antoine
    MEDICAL IMAGING 2016: PACS AND IMAGING INFORMATICS: NEXT GENERATION AND INNOVATIONS, 2016, 9789
  • [35] A review of feature selection methods based on mutual information
    Vergara, Jorge R.
    Estevez, Pablo A.
    NEURAL COMPUTING & APPLICATIONS, 2014, 24 (01): : 175 - 186
  • [36] Early Stopping for Mutual Information Based Feature Selection
    Beinrucker, Andre
    Dogan, Ueruen
    Blanchard, Gilles
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 975 - 978
  • [37] Mutual Information Based Feature Selection for Fingerprint Identification
    Adjimi, Ahlem
    Hacine-Gharbi, Abdenour
    Ravier, Philippe
    Mostefai, Messaoud
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2019, 43 (02): : 187 - 198
  • [38] Feature selection based on weighted conditional mutual information
    Zhou, Hongfang
    Wang, Xiqian
    Zhang, Yao
    APPLIED COMPUTING AND INFORMATICS, 2024, 20 (1/2) : 55 - 68
  • [39] Feature Selection Based on Mutual Information for Language Recognition
    Deng, Yan
    Liu, Jia
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4319 - 4322
  • [40] Feature Selection by Computing Mutual Information Based on Partitions
    Yin, Chengxiang
    Zhang, Hongjun
    Zhang, Rui
    Zeng, Zilin
    Qi, Xiuli
    Feng, Yuntian
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (02): : 437 - 446