Neighborhood attribute reduction for imbalanced data

被引:5
|
作者
Zhang, Wendong [1 ]
Wang, Xun [1 ]
Yang, Xibei [1 ]
Chen, Xiangjian [1 ]
Wang, Pingxin [1 ,2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Sci, Zhenjiang 212003, Jiangsu, Peoples R China
关键词
Attribute reduction; Granular computing; K-means; Neighborhood decision error rate; Neighborhood classifier; SMOTE; ROUGH SET; FEATURE-SELECTION;
D O I
10.1007/s41066-018-0105-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the viewpoint of rough granular computing, neighborhood decision error rate-based attribute reduction aims to improve the classification performance of the neighborhood classifier. Nevertheless, for imbalanced data which can be seen everywhere in real-world applications, such reduction does not pay much attention to the classification results of samples in minority class. Therefore, a new strategy to attribute reduction is proposed, which is embedded with preprocessing of the imbalanced data. First, the widely accepted SMOTE algorithm and K-means algorithm are used for oversampling and undersampling, respectively. Second, the neighborhood decision error rate-based attribute reduction is designed for those updated data. Finally, the neighborhood classifier can be tested with the attributes in reducts. The experimental results on some UCI and PROMISE data sets show that our approach is superior to the traditional attribute reduction based on the evaluations of F-measure and G-mean. Therefore, the contribution of this paper is to construct the attribute reduction strategy for imbalanced data, which can select useful attributes for improving the classification performance in such data.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 50 条
  • [21] A Fast Attribute Reduction Algorithm of Neighborhood Rough Set
    Li, Wenhua
    Xia, Shuyin
    Chen, Zizhong
    2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 43 - 48
  • [22] Research on Attribute Reduction Using Rough Neighborhood Model
    He, Ming
    Du, Yong-ping
    ISBIM: 2008 INTERNATIONAL SEMINAR ON BUSINESS AND INFORMATION MANAGEMENT, VOL 1, 2009, : 268 - 270
  • [23] Hypersphere Neighborhood Rough Set for Rapid Attribute Reduction
    Fang, Yu
    Cao, Xue-Mei
    Wang, Xin
    Min, Fan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 161 - 173
  • [24] An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets
    Sun, Lin
    Zhang, Xiaoyu
    Xu, Jiucheng
    Zhang, Shiguang
    ENTROPY, 2019, 21 (02)
  • [25] Data reduction and stacking for imbalanced data classification
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (06) : 7239 - 7249
  • [26] Attribute Reduction for Heterogeneous Data by Hybrid Neighborhood Graph Structure and Neighbor Inconsistent Pair Selection
    Dai, Jianhua
    Liu, Jie
    Ding, Weiping
    Zhang, Chucai
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [27] Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Seliya, Naeem
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 1, 2010,
  • [28] irrelevant attribute resistance approach to binary classification for imbalanced data
    Zheng, Jian
    Hu, Xin
    INFORMATION SCIENCES, 2024, 655
  • [29] Attribute Reduction of Boolean Matrix in Neighborhood Rough Set Model
    Gao, Yan
    Lv, Changwei
    Wu, Zhengjiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 1473 - 1482
  • [30] Attribute Reduction of Boolean Matrix in Neighborhood Rough Set Model
    Yan Gao
    Changwei Lv
    Zhengjiang Wu
    International Journal of Computational Intelligence Systems, 2020, 13 : 1473 - 1482