DSPOTE: Density-induced Selection Probability-based Oversampling TEchnique for Imbalanced Learning

被引:0
|
作者
Wei, Zhen [1 ]
Zhang, Li [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
SAMPLING METHOD; SMOTE; CLASSIFICATION;
D O I
10.1109/ICPR56361.2022.9956583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imbalanced learning, oversampling is incredibly prevalent. However, it is disappointing that existing oversampling methods have their own limitations, such as new synthetic samples may be uninformative or noisy. To better address imbalanced learning tasks, this paper proposes a novel oversampling method, named Density-induced Selection Probability-based Over-sampling TEchnique (DSPOTE). To increase the number of samples in the minority class, DSPOTE designs a novel scheme for filtering noisy samples based on the Chebychev distance and a new way of calculating selection probability based on relative density. DSPOTE first filters noisy samples and then gets borderline ones from the minority class. Next, DSPOTE calculates the selection probabilities for all borderline samples and applies these probabilities to pick up borderline samples. Finally, DSPOTE generates synthetic samples for the minority class based on the selected borderline ones. Experimental results indicate that our method has good performance in terms of metrics, recall and AUC (Area Under the Curve), when compared with other eight methods.
引用
收藏
页码:2165 / 2171
页数:7
相关论文
共 50 条
  • [41] Efficient Object Search Through Probability-Based Viewpoint Selection
    Hernandez, Alejandra C.
    Derner, Erik
    Gomez, Clara
    Barber, Ramon
    Babuska, Robert
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6172 - 6179
  • [42] A Novel Probability-Based Logic-Locking Technique: ProbLock
    Yue, Michael
    Tehranipoor, Sara
    SENSORS, 2021, 21 (23)
  • [43] Radius-SMOTE: A New Oversampling Technique of Minority Samples Based on Radius Distance for Learning From Imbalanced Data
    Pradipta, Gede Angga
    Wardoyo, Retantyo
    Musdholifah, Aina
    Sanjaya, I. Nyoman Hariyasa
    IEEE ACCESS, 2021, 9 : 74763 - 74777
  • [44] Conditional Wasserstein GAN-based oversampling of tabular data for imbalanced learning
    Engelmann, Justin
    Lessmann, Stefan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 174
  • [45] Learning class-imbalanced data with region-impurity synthetic minority oversampling technique
    Li, Der -Chiang
    Wang, Ssu-Yang
    Huang, Kuan-Cheng
    Tsai, Tung -, I
    INFORMATION SCIENCES, 2022, 607 : 1391 - 1407
  • [46] A novel synthetic minority oversampling technique based on relative and absolute densities for imbalanced classification
    Liu, Ruijuan
    APPLIED INTELLIGENCE, 2023, 53 (01) : 786 - 803
  • [47] LSMOTE: A link-based Synthetic Minority Oversampling Technique for binary imbalanced datasets
    Cai, Qin-Nan
    Zhang, Zhong-Liang
    Wu, Yu-Heng
    Zhang, Xiu-Ming
    NEUROCOMPUTING, 2024, 608
  • [48] Hybrid Oversampling Technique Based on Star Topology and Rejection Methodology for Classifying Imbalanced Data
    Lee, Chaekyu
    Kim, Jaekwang
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 1217 - 1226
  • [49] Conditional probability-based ensemble learning for indoor landmark localization
    Zhao, Zhongliang
    Carrera, Jose Luis
    Braun, Torsten
    Pan, Zhiyang
    COMPUTER COMMUNICATIONS, 2019, 145 : 319 - 325
  • [50] Probability-based Vendor Selection Model for the Hungarian Automotive Supply Network
    Domotorfi A.
    Nagy Z.A.
    Harmati I.A.
    Periodica Polytechnica Transportation Engineering, 2022, 50 (02): : 216 - 222