Neighborhood attribute reduction for imbalanced data

被引:5
|
作者
Zhang, Wendong [1 ]
Wang, Xun [1 ]
Yang, Xibei [1 ]
Chen, Xiangjian [1 ]
Wang, Pingxin [1 ,2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Sci, Zhenjiang 212003, Jiangsu, Peoples R China
关键词
Attribute reduction; Granular computing; K-means; Neighborhood decision error rate; Neighborhood classifier; SMOTE; ROUGH SET; FEATURE-SELECTION;
D O I
10.1007/s41066-018-0105-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the viewpoint of rough granular computing, neighborhood decision error rate-based attribute reduction aims to improve the classification performance of the neighborhood classifier. Nevertheless, for imbalanced data which can be seen everywhere in real-world applications, such reduction does not pay much attention to the classification results of samples in minority class. Therefore, a new strategy to attribute reduction is proposed, which is embedded with preprocessing of the imbalanced data. First, the widely accepted SMOTE algorithm and K-means algorithm are used for oversampling and undersampling, respectively. Second, the neighborhood decision error rate-based attribute reduction is designed for those updated data. Finally, the neighborhood classifier can be tested with the attributes in reducts. The experimental results on some UCI and PROMISE data sets show that our approach is superior to the traditional attribute reduction based on the evaluations of F-measure and G-mean. Therefore, the contribution of this paper is to construct the attribute reduction strategy for imbalanced data, which can select useful attributes for improving the classification performance in such data.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 50 条
  • [31] Spectral Clustering with Neighborhood Attribute Reduction Based on Information Entropy
    Jia, Hongjie
    Ding, Shifei
    Ma, Heng
    Xing, Wanqiu
    JOURNAL OF COMPUTERS, 2014, 9 (06) : 1316 - 1324
  • [32] Numerical attribute reduction based on neighborhood granulation and rough approximation
    College of Energy Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
    Ruan Jian Xue Bao, 2008, 3 (640-649):
  • [34] A Mixed Sampling Method for Imbalanced Data Based on Neighborhood Density
    Hu, Feng
    Yu, Chunlin
    Dai, Jin
    Liu, Ke
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 94 - 98
  • [35] Attribute reduction based on neighborhood constrained fuzzy rough sets
    Hu, Meng
    Guo, Yanting
    Chen, Degang
    Tsang, Eric C. C.
    Zhang, Qingshuo
    KNOWLEDGE-BASED SYSTEMS, 2023, 274
  • [36] The research of attribute reduction algorithm based on extension neighborhood relation
    Department of Information Engineering, Taiyuan University of Technology, Taiyuan 030024, China
    J. Comput. Inf. Syst., 16 (6613-6620):
  • [37] A neighborhood classifier based on adaptive radius selection and attribute reduction
    Tang, Dechang
    Zhang, Qinghua
    Liao, Wei
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2024,
  • [38] Neighborhood Attribute Reduction: A Multicriterion Strategy Based on Sample Selection
    Gao, Yuan
    Chen, Xiangjian
    Yang, Xibei
    Wang, Pingxin
    INFORMATION, 2018, 9 (11)
  • [39] Ensemble-Based Neighborhood Attribute Reduction: A Multigranularity View
    Gao, Yuan
    Chen, Xiangjian
    Yang, Xibei
    Wang, Pingxin
    Mi, Jusheng
    COMPLEXITY, 2019, 2019
  • [40] Feature selection for imbalanced data based on neighborhood rough sets
    Chen, Hongmei
    Li, Tianrui
    Fan, Xin
    Luo, Chuan
    INFORMATION SCIENCES, 2019, 483 : 1 - 20