Fuzzy-Rough Set Bireducts for Data Reduction

被引:20
|
作者
Parthalain, Neil Mac [1 ]
Jensen, Richard [1 ]
Diao, Ren [2 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
[2] Candela Shenzhen Technol Innovate Co Ltd, Shenzhen 815000, Peoples R China
关键词
Rough sets; Tools; Uncertainty; Feature extraction; Noise measurement; Training data; Dimensionality reduction; Bireducts; feature selection (FS); fuzzy-rough sets; instance selection;
D O I
10.1109/TFUZZ.2019.2921935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction is an important step that helps ease the computational intractability for learning techniques when data are large. This is particularly true for the huge datasets that have become commonplace in recent times. The main problem facing both data preprocessors and learning techniques is that data are expanding both in terms of dimensionality and also in terms of the number of data instances. Approaches based on fuzzy-rough sets offer many advantages for both feature selection and classification, particularly for real-valued and noisy data; however, the majority of recent approaches tend to address the task of data reduction in terms of either dimensionality or training data size in isolation. This paper demonstrates how the notion of fuzzy-rough bireducts can be used for the simultaneous reduction of data size and dimensionality. It also shows how bireducts and, therefore, reduced subtables of data can be used not only as a preprocessing tool but also for the learning of compact and robust classifiers. Furthermore, the ideas can also be extended to the unsupervised domain when dealing with unlabeled data. Experimental evaluation of various techniques demonstrate that high levels of simultaneous reduction of both dimensionality and data size can be achieved whilst maintaining robust performance.
引用
收藏
页码:1840 / 1850
页数:11
相关论文
共 50 条
  • [31] Electric load forecasting method based on fuzzy-rough set
    Lei, Shaolan
    Sun, Caixin
    Zhou, Quan
    Zhang, Xiaoxing
    Gaodianya Jishu/High Voltage Engineering, 2004, 30 (09):
  • [32] Fuzzy-Rough Set based Semi-Supervised Learning
    Mac Parthalain, Neil
    Jensen, Richard
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 2465 - 2472
  • [33] On GPU–CUDA as preprocessing of fuzzy-rough data reduction by means of singular value decomposition
    Salvatore Cuomo
    Ardelio Galletti
    Livia Marcellino
    Guglielmo Navarra
    Gerardo Toraldo
    Soft Computing, 2018, 22 : 1525 - 1532
  • [34] Missing data imputation using fuzzy-rough methods
    Amiri, Mehran
    Jensen, Richard
    NEUROCOMPUTING, 2016, 205 : 152 - 164
  • [35] An integrated fuzzy-rough set model for identification of tea leaf diseases
    Krishnan, Jayapal Santhana
    Sivakumar, Poruran
    PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES, 2022, 59 (06): : 947 - 952
  • [36] An Attribute Reduction Method based on Fuzzy-Rough Sets Theories
    Hu Guohua
    Shi Yuemei
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL III, 2009, : 828 - 831
  • [37] Fuzzy-rough set based nearest neighbor clustering classification algorithm
    Wang, XY
    Yang, J
    Teng, XL
    Peng, NS
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 1, PROCEEDINGS, 2005, 3613 : 370 - 373
  • [38] Upper Nearness Degree and Lower Nearness Degree of Fuzzy-Rough Set
    Wang Biao
    Gao Guanglai
    KAM: 2008 INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING, PROCEEDINGS, 2008, : 54 - 58
  • [39] Uncertainty representation of ocean fronts based on fuzzy-rough set theory
    Xue C.
    Zhou C.
    Su F.
    Zhang D.
    Journal of Ocean University of China, 2008, 7 (2) : 131 - 136