Constrained class-wise feature selection (CCFS)

被引:2
|
作者
Hussain, Syed Fawad [1 ,2 ]
Shahzadi, Fatima [1 ,2 ]
Munir, Badre [1 ]
机构
[1] GIK Inst Engn Sci & Technol, Topi 23460, Khyber Pakhtunk, Pakistan
[2] GIK Inst, Machine Learning & Data Sci Lab MDS, Topi, Pakistan
关键词
Feature selection; Information theory; Classification; Class-wise feature selection; MUTUAL INFORMATION; TEXT CLASSIFICATION; MACHINE;
D O I
10.1007/s13042-022-01589-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection plays a vital role as a preprocessing step for high dimensional data in machine learning. The basic purpose of feature selection is to avoid "curse of dimensionality" and reduce time and space complexity of training data. Several techniques, including those that use information theory, have been proposed in the literature as a means to measure the information content of a feature. Most of them incrementally select features with max dependency with the category but minimum redundancy with already selected features. A key missing idea in these techniques is the fair representation of features with max dependency among the different categories, i.e., skewed selection of features having high mutual information (MI) with a particular class. This can result in a biased classification in favor of that particular class while other classes have low matching scores during classification. We propose a novel approach based on information theory that selects features in a class-wise fashion rather than based on their global max dependency. In addition, a constrained search is used instead of a global sequential forward search. We prove that our proposed approach enhances Maximum Relevance while keeping Minimum Redundancy under a constrained search. Results on multiple benchmark datasets show that our proposed method improves accuracy as compared to other state-of-the-art feature selection algorithms while having a lower time complexity.
引用
收藏
页码:3211 / 3224
页数:14
相关论文
共 50 条
  • [31] PCA: Progressive class-wise attention for skin lesions diagnosis
    Naveed, Asim
    Naqvi, Syed S.
    Khan, Tariq M.
    Razzak, Imran
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [32] Discovering Class-wise Trends of Max-pooling in Subspace
    Zheng, Yuchen
    Iwana, Brian Kenji
    Uchida, Seiichi
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 98 - 103
  • [33] GeoHard: Towards Measuring Class-wise Hardness through Modelling Class Semantics
    Cai, Fengyu
    Zhao, Xinran
    Zhang, Hongming
    Gurevych, Iryna
    Koeppl, Heinz
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 5571 - 5597
  • [34] Neural Networks Classify through the Class-Wise Means of Their Representations
    Seddik, Mohamed El Amine
    Tamaazousti, Mohamed
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8204 - 8211
  • [35] Class-Wise Feature Alignment Based Transfer Network for Multi-Temporal Remote Sensing Image Classification
    Guo Y.
    Song J.
    Ma L.
    Yang M.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2021, 46 (10): : 3730 - 3739
  • [36] Unsupervised Domain Adaptation Using Robust Class-Wise Matching
    Zhang, Lei
    Wang, Peng
    Wei, Wei
    Lu, Hao
    Shen, Chunhua
    van den Hengel, Anton
    Zhang, Yanning
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (05) : 1339 - 1349
  • [37] Extensions of LDA by PCA mixture model and class-wise features
    Kim, HC
    Kim, D
    Bang, SY
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 387 - 392
  • [38] Alleviating Class-Wise Gradient Imbalance for Pulmonary Airway Segmentation
    Zheng, Hao
    Qin, Yulei
    Gu, Yun
    Xie, Fangfang
    Yang, Jie
    Sun, Jiayuan
    Yang, Guang-Zhong
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (09) : 2452 - 2462
  • [39] SeNPIS: Sequential Network Pruning by class-wise Importance Score
    Pachon, Cesar G.
    Ballesteros, Dora M.
    Renza, Diego
    APPLIED SOFT COMPUTING, 2022, 129
  • [40] Extensions of LDA by PCA mixture model and class-wise features
    Kim, HC
    Kim, D
    Bang, SY
    PATTERN RECOGNITION, 2003, 36 (05) : 1095 - 1105