Neighborhood attribute reduction for imbalanced data

被引:5
|
作者
Zhang, Wendong [1 ]
Wang, Xun [1 ]
Yang, Xibei [1 ]
Chen, Xiangjian [1 ]
Wang, Pingxin [1 ,2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Sci, Zhenjiang 212003, Jiangsu, Peoples R China
关键词
Attribute reduction; Granular computing; K-means; Neighborhood decision error rate; Neighborhood classifier; SMOTE; ROUGH SET; FEATURE-SELECTION;
D O I
10.1007/s41066-018-0105-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the viewpoint of rough granular computing, neighborhood decision error rate-based attribute reduction aims to improve the classification performance of the neighborhood classifier. Nevertheless, for imbalanced data which can be seen everywhere in real-world applications, such reduction does not pay much attention to the classification results of samples in minority class. Therefore, a new strategy to attribute reduction is proposed, which is embedded with preprocessing of the imbalanced data. First, the widely accepted SMOTE algorithm and K-means algorithm are used for oversampling and undersampling, respectively. Second, the neighborhood decision error rate-based attribute reduction is designed for those updated data. Finally, the neighborhood classifier can be tested with the attributes in reducts. The experimental results on some UCI and PROMISE data sets show that our approach is superior to the traditional attribute reduction based on the evaluations of F-measure and G-mean. Therefore, the contribution of this paper is to construct the attribute reduction strategy for imbalanced data, which can select useful attributes for improving the classification performance in such data.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 50 条
  • [1] Neighborhood attribute reduction for imbalanced data
    Wendong Zhang
    Xun Wang
    Xibei Yang
    Xiangjian Chen
    Pingxin Wang
    Granular Computing, 2019, 4 : 301 - 311
  • [2] Neighborhood attribute reduction approach to partially labeled data
    Keyu Liu
    Eric C. C. Tsang
    Jingjing Song
    Hualong Yu
    Xiangjian Chen
    Xibei Yang
    Granular Computing, 2020, 5 : 239 - 250
  • [3] Neighborhood attribute reduction approach to partially labeled data
    Liu, Keyu
    Tsang, Eric C. C.
    Song, Jingjing
    Yu, Hualong
    Chen, Xiangjian
    Yang, Xibei
    GRANULAR COMPUTING, 2020, 5 (02) : 239 - 250
  • [4] Attribute reduction for incomplete mixed data based on neighborhood information system
    Li, Ran
    Chen, Hongchang
    Liu, Shuxin
    Jiang, Haocong
    Wang, Biao
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2024, 53 (02) : 127 - 153
  • [5] Neighborhood Discernibility Degree Incremental Attribute Reduction Algorithm for Mixed Data
    Sheng K.
    Wang W.
    Bian X.-F.
    Dong H.
    Ma J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (04): : 682 - 696
  • [6] Attribute reduction for heterogeneous data based on monotonic relative neighborhood granularity
    Dai, Jianhua
    Zhu, Zhilin
    Li, Min
    Zou, Xiongtao
    Zhang, Chucai
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 170
  • [7] Unsupervised attribute reduction based on neighborhood dependency
    Li, Yi
    Zhang, Benwen
    Yuan, Zhong
    Liu, Yuncheng
    Lei, Shenhong
    Tan, Xingqiang
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10653 - 10670
  • [8] DISTANCE BASED ON NEIGHBORHOOD CLASSIFIER AND ATTRIBUTE REDUCTION
    Gao, Yuan
    Liu, Ke-Yu
    Song, Jing-Jing
    Chen, Xiang-Jian
    Yang, Xi-Bei
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2018, : 257 - 262
  • [9] Accelerator for supervised neighborhood based attribute reduction
    Jiang, Zehua
    Liu, Keyu
    Yang, Xibei
    Yu, Hualong
    Fujitac, Hamido
    Qian, Yuhua
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 119 : 122 - 150
  • [10] Attribute Reduction Based on Rough Neighborhood Approximation
    He, Ming
    Du, Yong-ping
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 343 - 345