DSPOTE: Density-induced Selection Probability-based Oversampling TEchnique for Imbalanced Learning

被引:0
|
作者
Wei, Zhen [1 ]
Zhang, Li [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
SAMPLING METHOD; SMOTE; CLASSIFICATION;
D O I
10.1109/ICPR56361.2022.9956583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imbalanced learning, oversampling is incredibly prevalent. However, it is disappointing that existing oversampling methods have their own limitations, such as new synthetic samples may be uninformative or noisy. To better address imbalanced learning tasks, this paper proposes a novel oversampling method, named Density-induced Selection Probability-based Over-sampling TEchnique (DSPOTE). To increase the number of samples in the minority class, DSPOTE designs a novel scheme for filtering noisy samples based on the Chebychev distance and a new way of calculating selection probability based on relative density. DSPOTE first filters noisy samples and then gets borderline ones from the minority class. Next, DSPOTE calculates the selection probabilities for all borderline samples and applies these probabilities to pick up borderline samples. Finally, DSPOTE generates synthetic samples for the minority class based on the selected borderline ones. Experimental results indicate that our method has good performance in terms of metrics, recall and AUC (Area Under the Curve), when compared with other eight methods.
引用
收藏
页码:2165 / 2171
页数:7
相关论文
共 50 条
  • [1] Density-induced oversampling for highly imbalanced datasets
    Fecker, Daniel
    Maergner, Volker
    Fingscheidt, Tim
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS VI, 2013, 8661
  • [2] Probability-Based Synthetic Minority Oversampling Technique
    Altwaijry, Najwa
    IEEE ACCESS, 2023, 11 : 28831 - 28839
  • [3] Minority-prediction-probability-based oversampling technique for imbalanced learning
    Wei, Zhen
    Zhang, Li
    Zhao, Lei
    INFORMATION SCIENCES, 2023, 622 : 1273 - 1295
  • [4] WOTBoost: Weighted Oversampling Technique in Boosting for imbalanced learning
    Zhang, Wenhao
    Ramezani, Ramin
    Naeim, Arash
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2523 - 2531
  • [5] SVDD-based weighted oversampling technique for imbalanced and overlapped dataset learning
    Tao, Xinmin
    Zheng, Yujia
    Chen, Wei
    Zhang, Xiaohan
    Qi, Lin
    Fan, Zhiting
    Huang, Shan
    INFORMATION SCIENCES, 2022, 588 : 13 - 51
  • [6] ODBOT: Outlier detection-based oversampling technique for imbalanced datasets learning
    Mohammed H. IBRAHIM
    Neural Computing and Applications, 2021, 33 : 15781 - 15806
  • [7] Entropy difference and kernel-based oversampling technique for imbalanced data learning
    Wu, Xu
    Yang, Youlong
    Ren, Lingyu
    INTELLIGENT DATA ANALYSIS, 2020, 24 (06) : 1239 - 1255
  • [8] ODBOT: Outlier detection-based oversampling technique for imbalanced datasets learning
    Ibrahim, Mohammed H.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (22): : 15781 - 15806
  • [9] Fuzzy rule-based oversampling technique for imbalanced and incomplete data learning
    Liu, Gencheng
    Yang, Youlong
    Li, Benchong
    KNOWLEDGE-BASED SYSTEMS, 2018, 158 : 154 - 174
  • [10] A Membership Probability-Based Undersampling Algorithm for Imbalanced Data
    Ahn, Gilseung
    Park, You-Jin
    Hur, Sun
    JOURNAL OF CLASSIFICATION, 2021, 38 (01) : 2 - 15