Fuzzy-Rough Set Bireducts for Data Reduction

被引:20
|
作者
Parthalain, Neil Mac [1 ]
Jensen, Richard [1 ]
Diao, Ren [2 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
[2] Candela Shenzhen Technol Innovate Co Ltd, Shenzhen 815000, Peoples R China
关键词
Rough sets; Tools; Uncertainty; Feature extraction; Noise measurement; Training data; Dimensionality reduction; Bireducts; feature selection (FS); fuzzy-rough sets; instance selection;
D O I
10.1109/TFUZZ.2019.2921935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction is an important step that helps ease the computational intractability for learning techniques when data are large. This is particularly true for the huge datasets that have become commonplace in recent times. The main problem facing both data preprocessors and learning techniques is that data are expanding both in terms of dimensionality and also in terms of the number of data instances. Approaches based on fuzzy-rough sets offer many advantages for both feature selection and classification, particularly for real-valued and noisy data; however, the majority of recent approaches tend to address the task of data reduction in terms of either dimensionality or training data size in isolation. This paper demonstrates how the notion of fuzzy-rough bireducts can be used for the simultaneous reduction of data size and dimensionality. It also shows how bireducts and, therefore, reduced subtables of data can be used not only as a preprocessing tool but also for the learning of compact and robust classifiers. Furthermore, the ideas can also be extended to the unsupervised domain when dealing with unlabeled data. Experimental evaluation of various techniques demonstrate that high levels of simultaneous reduction of both dimensionality and data size can be achieved whilst maintaining robust performance.
引用
收藏
页码:1840 / 1850
页数:11
相关论文
共 50 条
  • [1] Fuzzy-rough set models and fuzzy-rough data reduction
    Ghroutkhar, Alireza Mansouri
    Nehi, Hassan Mishmast
    CROATIAN OPERATIONAL RESEARCH REVIEW, 2020, 11 (01) : 67 - 80
  • [2] Fuzzy-Rough Bireducts With Supervised Multiscale Granulation
    Wang, Zhihong
    Chen, Hongmei
    Liao, Huming
    Yin, Tengyu
    Xiang, Biao
    Horng, Shi-Jinn
    Li, Tianrui
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2025, 33 (04) : 1253 - 1264
  • [3] Simultaneous Feature And Instance Selection Using Fuzzy-Rough Bireducts
    Mac Parthalain, Neil
    Jensen, Richard
    2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [4] Fuzzy-Rough Bireducts Algorithm Based on Particle Swarm Optimization
    Liu Z.-F.
    Pan S.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (04): : 49 - 55
  • [5] Compatibility rough-fuzzy set and compatibility fuzzy-rough set
    Hu, SS
    He, YQ
    Zhang, M
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 1922 - 1928
  • [6] Unsupervised fuzzy-rough set-based dimensionality reduction
    Mac Parthalain, Neil
    Jensen, Richard
    INFORMATION SCIENCES, 2013, 229 : 106 - 121
  • [7] Heuristic Search for Fuzzy-Rough Bireducts and its Use in Classifier Ensembles
    Diao, Ren
    Mac Parthalain, Neil
    Jensen, Richard
    Shen, Qiang
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 1504 - 1511
  • [8] Fuzzy-rough data reduction based on information entropy
    Zhao, Jun-Yang
    Mang, Zhi-Li
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3708 - 3712
  • [9] Fuzzy-rough data reduction with ant colony optimization
    Jensen, R
    Shen, Q
    FUZZY SETS AND SYSTEMS, 2005, 149 (01) : 5 - 20
  • [10] Fuzzy-Rough Set Based Attribute Reduction with a Simple Fuzzification Method
    Wang, Xueen
    Han, Deqiang
    Han, Chongzhao
    PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3793 - 3797