DSPOTE: Density-induced Selection Probability-based Oversampling TEchnique for Imbalanced Learning

被引:0
|
作者
Wei, Zhen [1 ]
Zhang, Li [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
SAMPLING METHOD; SMOTE; CLASSIFICATION;
D O I
10.1109/ICPR56361.2022.9956583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imbalanced learning, oversampling is incredibly prevalent. However, it is disappointing that existing oversampling methods have their own limitations, such as new synthetic samples may be uninformative or noisy. To better address imbalanced learning tasks, this paper proposes a novel oversampling method, named Density-induced Selection Probability-based Over-sampling TEchnique (DSPOTE). To increase the number of samples in the minority class, DSPOTE designs a novel scheme for filtering noisy samples based on the Chebychev distance and a new way of calculating selection probability based on relative density. DSPOTE first filters noisy samples and then gets borderline ones from the minority class. Next, DSPOTE calculates the selection probabilities for all borderline samples and applies these probabilities to pick up borderline samples. Finally, DSPOTE generates synthetic samples for the minority class based on the selected borderline ones. Experimental results indicate that our method has good performance in terms of metrics, recall and AUC (Area Under the Curve), when compared with other eight methods.
引用
收藏
页码:2165 / 2171
页数:7
相关论文
共 50 条
  • [31] An Improved Oversampling Algorithm Based on the Samples' Selection Strategy for Classifying Imbalanced Data
    Xie, Wenhao
    Liang, Gongqian
    Dong, Zhonghui
    Tan, Baoyu
    Zhang, Baosheng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [32] MWMOTE-Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning
    Barua, Sukarna
    Islam, Md. Monirul
    Yao, Xin
    Murase, Kazuyuki
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) : 405 - 425
  • [33] An extension of Synthetic Minority Oversampling Technique based on Kalman filter for imbalanced datasets
    Thejas, G. S.
    Hariprasad, Yashas
    Iyengar, S. S.
    Sunitha, N. R.
    Badrinath, Prajwal
    Chennupati, Shasank
    MACHINE LEARNING WITH APPLICATIONS, 2022, 8
  • [34] OALDPC: oversampling approach based on local density peaks clustering for imbalanced classification
    Li, Junnan
    Zhu, Qingsheng
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30987 - 31017
  • [35] OALDPC: oversampling approach based on local density peaks clustering for imbalanced classification
    Junnan Li
    Qingsheng Zhu
    Applied Intelligence, 2023, 53 : 30987 - 31017
  • [36] Natural local density-based adaptive oversampling algorithm for imbalanced classification
    Wang, Wentong
    Yang, Lijun
    Zhang, Jinghui
    Yang, Juntao
    Tang, Dongming
    Liu, Tao
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [37] Genetic Algorithm-Based Oversampling Technique to Learn from Imbalanced Data
    Saladi, Puneeth Srinivas Mohan
    Dash, Tirtharaj
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2017, VOL 1, 2019, 816 : 387 - 397
  • [38] Minimum Bayesian error probability-based gene subset selection
    Li, Jian
    Yu, Tian
    Wei, Jin-Mao
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) : 434 - 450
  • [39] Affinity and class probability-based fuzzy support vector machine for imbalanced data sets
    Tao, Xinmin
    Li, Qing
    Ren, Chao
    Guo, Wenjie
    He, Qing
    Liu, Rui
    Zou, Junrong
    NEURAL NETWORKS, 2020, 122 (122) : 289 - 307
  • [40] Centric Probability-Based Sample Selection for Oriented Object Detection
    Yao, Yanqing
    Cheng, Gong
    Lang, Chunbo
    Xie, Xingxing
    Han, Junwei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (04) : 3290 - 3302