Feature Selection With Local Density-Based Fuzzy Rough Set Model for Noisy Data

被引:10
|
作者
Yang, Xiaoling [1 ,2 ,3 ]
Chen, Hongmei [1 ,2 ,3 ]
Wang, Hao [4 ]
Li, Tianrui [1 ,2 ,3 ]
Yu, Zeng [1 ,2 ,3 ]
Wang, Zhihong [1 ,2 ,3 ]
Luo, Chuan [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence 611756, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Inst Artificial Intelligence, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu 611756, Peoples R China
[4] Zhejiang Lab, Res Inst Artificial Intelligence, Hangzhou 311000, Peoples R China
[5] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Data uncertainty; density function; feature selection; fuzzy rough set (FRS); mutual information; noisy data; ATTRIBUTE REDUCTION; MUTUAL INFORMATION; MAX-RELEVANCE;
D O I
10.1109/TFUZZ.2022.3206508
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy rough set theory canmodel uncertainty in data and has been applied to feature selection for machine learning tasks. The existence of noise in data is one of the reasons for data uncertainty. However, most classical fuzzy rough set models are often sensitive to the noise in data, which somewhat degrades their applicability to process uncertainty of data. Furthermore, a robust feature evaluation function is nontrivial in a fuzzy rough set model as a nonoptimal feature subsets may be selected due to the perturbations from redundant features. In this article, we delve into local density and indispensable features for fuzzy rough feature selection to address these challenges. We first propose a local density-based fuzzy rough set (LDFRS) model to tackle noisy data. Mutual information is then plugged into the proposed LDFRS model to evaluate uncertainty in data. A joint feature evaluation function on the indispensability and relevance of features is constructed to evaluate the significance of features. On this basis, a fuzzy rough feature selection algorithm is built upon the LDFRS model. Experimental results using four typical classifiers demonstrate the robustness and effectiveness of the proposed model including our feature selection algorithm and its superiority against baseline methods.
引用
收藏
页码:1614 / 1627
页数:14
相关论文
共 50 条
  • [41] Predictive Maintenance Model Based on Multisensor Data Fusion of Hybrid Fuzzy Rough Set Theory Feature Selection and Stacked Ensemble for Fault Classification
    Buabeng, Albert
    Simons, Anthony
    Frempong, Nana Kena
    Ziggah, Yao Yevenyo
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [42] A noise-aware fuzzy rough set approach for feature selection
    Yang, Xiaoling
    Chen, Hongmei
    Li, Tianrui
    Luo, Chuan
    KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [43] Feature Selection Based on PSO and Decision-Theoretic Rough Set Model
    Stevanovic, Aneta
    Xue, Bing
    Zhang, Mengjie
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 2840 - 2847
  • [44] A model based on ant colony system and rough set theory to feature selection
    Bello, R.
    Nowe, A.
    Caballero, Y.
    Gomez, Y.
    Vrancx, P.
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 275 - 276
  • [45] Rough set-based feature selection method
    Zhan, YM
    Zeng, XY
    Sun, JC
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2005, 15 (03) : 280 - 284
  • [46] Feature selection based on rough set and information entropy
    Han, JC
    2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 153 - 158
  • [47] A Rough Set Based Hybrid Method to Feature Selection
    Ming, He
    KAM: 2008 INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING, PROCEEDINGS, 2008, : 585 - 588
  • [48] Rough set-based feature selection method
    ZHAN Yanmei
    Progress in Natural Science, 2005, (03) : 88 - 92
  • [49] Rough set model based on axiomatic fuzzy set
    Xu, Siyu
    Qin, Keyun
    Pan, Xiaodong
    Fu, Chao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (01) : 1423 - 1436
  • [50] A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering
    Yan, Xuyang
    Sarkar, Mrinmoy
    Gebru, Biniam
    Nazmi, Shabnam
    Homaifar, Abdollah
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1900 - 1905