Cascaded Two-Stage Feature Clustering and Selection via Separability and Consistency in Fuzzy Decision Systems

被引:0
|
作者
Chen, Yuepeng [1 ]
Ding, Weiping [2 ,3 ]
Ju, Hengrong [2 ]
Huang, Jiashuang
Yin, Tao [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Sch Artificial Intelligence & Comp Sci, Nantong 226019, Peoples R China
[3] City Univ Macau, Fac Data Sci, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering algorithms; Rough sets; Feature extraction; Uncertainty; Fuzzy systems; Partitioning algorithms; Classification algorithms; Feature selection; fuzzy decision systems; fuzzy neighborhood rough set (FNRS); granular computing; MUTUAL INFORMATION; DEPENDENCY; SETS;
D O I
10.1109/TFUZZ.2024.3420963
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a vital technique in machine learning, as it can reduce computational complexity, improve model performance, and mitigate the risk of overfitting. However, the increasing complexity and dimensionality of datasets pose significant challenges in the selection of features. Focusing on these challenges, this article proposes a cascaded two-stage feature clustering and selection algorithm for fuzzy decision systems. In the first stage, we reduce the search space by clustering relevant features and addressing interfeature redundancy. In the second stage, a clustering-based sequentially forward selection method that explores the global and local structure of data is presented. We propose a novel metric for assessing the significance of features, which considers both global separability and local consistency. Global separability measures the degree of intraclass cohesion and interclass separation based on fuzzy membership, providing a comprehensive understanding of data separability. Meanwhile, local consistency leverages the fuzzy neighborhood rough set (FNRS) model to capture uncertainty and fuzziness in the data. The effectiveness of our proposed algorithm is evaluated through experiments conducted on 18 public datasets and a real-world schizophrenia dataset. The experiment results demonstrate our algorithm's superiority over benchmarking algorithms in both classification accuracy and the number of selected features.
引用
收藏
页码:5320 / 5333
页数:14
相关论文
共 50 条
  • [1] A clustering-based feature selection via feature separability
    Jiang, Shengyi
    Wang, Lianxi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (02) : 927 - 937
  • [2] Feature selection via fuzzy clustering
    Sun, Hao-Jun
    Sun, Mei
    Mei, Zhen
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1400 - +
  • [3] Ranking and selection with two-stage decision
    Wang, Tianxiang
    Xu, Jie
    Branke, Juergen
    Hu, Jian-Qiang
    Chen, Chun-Hung
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2025, 322 (01) : 121 - 132
  • [4] Research on two-stage fuzzy clustering method
    School of Computer, Taiyuan University of Science and Technology, Taiyuan 030024, China
    不详
    Harbin Gongcheng Daxue Xuebao, 2006, SUPPL. (552-557):
  • [5] Two-Stage Feature Selection with Unsupervised Second Stage
    Xu, Ke
    Arai, Hiromasa
    Maung, Crystal
    Schweitzer, Haim
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 153 - 159
  • [6] Two-Stage Feature Selection with Unsupervised Second Stage
    Xu, Ke
    Maung, Crystal
    Arai, Hiromasa
    Schweitzer, Haim
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (07)
  • [7] Two-Stage Feature Selection for Text Classification
    Ozgur, Levent
    Gungor, Tunga
    INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 329 - 337
  • [8] Learning for Efficient Supervised Query Expansion via Two-stage Feature Selection
    Zhang, Zhiwei
    Wang, Qifan
    Si, Luo
    Gao, Jianfeng
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 265 - 274
  • [9] Two-stage clustering via neural networks
    Wang, JH
    Rau, JD
    Liu, WJ
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (03): : 606 - 615
  • [10] Two-stage Fuzzy Clustering Approach for Load Profiling
    Zakaria, Z.
    Lo, K. L.
    UPEC: 2009 44TH INTERNATIONAL UNIVERSITIES POWER ENGINEERING CONFERENCE, 2009, : 976 - +