A New Safe-Level Enabled Borderline-SMOTE for Condition Recognition of Imbalanced Dataset

被引:3
|
作者
Chen, Chao [1 ]
Shen, Wei [1 ,2 ]
Yang, Chenhao [1 ]
Fan, Wei [1 ]
Liu, Xin [3 ]
Li, Ying [4 ]
机构
[1] Jiangsu Univ, Sch Mech Engn, Zhenjiang 212013, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai 200030, Peoples R China
[3] Jilin Univ, Sch Mech & Aerosp Engn, Changchun 130000, Peoples R China
[4] Minist Agr & Rural Affairs, Nanjing Inst Agr Mechanizat, Nanjing 210095, Peoples R China
关键词
Boundary data; condition recognition; imbalanced classification; light gradient boosting machine (LightGBM); safe-level synthetic minority oversampling technique (SMOTE); synthetic minority oversampling technique; PREDICTION;
D O I
10.1109/TIM.2023.3289545
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning (ML)-based classification strategy has been successfully applied in actual industrial monitoring but it is often hindered when the dataset is imbalanced. Technically, the misclassification phenomenon, as a serious performance degradation of generalization ability, often occurs in minority class. For this problem, borderline-synthetic minority oversampling technique (B-SMOTE), which aims to enrich the quantity of minority samples around decision boundaries, has received considerable attention. However, most imbalanced classification techniques under the framework of B-SMOTE generate instances by a random weight number from 0 to 1, which may result in an authentic reduction of newly born samples. Herein, a novel oversampling strategy, which aims to provide new safety criteria and reassign the threshold of weight coefficient, is proposed to boost the authenticity of generated samples and classification accuracy. In addition, light gradient boosting machine (LightGBM) is adopted to build the classification model. Related experiments show the effectiveness and superiority of the proposed method in handling imbalanced classification tasks.
引用
收藏
页数:10
相关论文
共 14 条
  • [1] Safe-level SMOTE method for handling the class imbalanced problem in electroencephalography dataset of adult anxious state
    Daud, Syarifah Noor Syakiylla Sayed
    Sudirman, Rubita
    Shing, Tee Wee
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
  • [2] Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning
    Han, H
    Wang, WY
    Mao, BH
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 878 - 887
  • [3] A Novel Method for Identification of Glutarylation Sites Combining Borderline-SMOTE With Tomek Links Technique in Imbalanced Data
    Ning, Qiao
    Zhao, Xiaowei
    Ma, Zhiqiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (05) : 2632 - 2641
  • [4] Hybrid Oversampling and Undersampling Method (HOUM) via Safe-Level SMOTE and Support Vector Machine
    Eroglu, Duygu Yilmaz
    Pir, Mestan Sahin
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [5] SVM Classification of Microaneurysms with Imbalanced Dataset Based on Borderline- SMOTE and Data Cleaning Techniques
    Wang, Qingjie
    Xin, Jingmin
    Wu, Jiayi
    Zheng, Nanning
    NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [6] BS-SC Model: A Novel Method for Predicting Child Abuse Using Borderline-SMOTE Enabled Stacking Classifier
    Parthasarathy S.
    Lakshminarayanan A.R.
    Computer Systems Science and Engineering, 2023, 46 (02): : 1311 - 1336
  • [7] Effects of Data Augmentation Method Borderline-SMOTE on Emotion Recognition of EEG Signals Based on Convolutional Neural Network
    Chen, Yu
    Chang, Rui
    Guo, Jifeng
    IEEE ACCESS, 2021, 9 : 47491 - 47502
  • [8] Safe-Level-SMOTE: Safe-Level-Synthetic Minority Over-Sampling TEchnique for Handling the Class Imbalanced Problem
    Bunkhumpornpat, Chumphol
    Sinapiromsaran, Krung
    Lursinsap, Chidchanok
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 475 - 482
  • [9] Predicting Seminal Quality via Imbalanced Learning with Evolutionary Safe-Level Synthetic Minority Over-Sampling Technique
    Jieming Ma
    David Olalekan Afolabi
    Jie Ren
    Aiyan Zhen
    Cognitive Computation, 2021, 13 : 833 - 844
  • [10] Predicting Seminal Quality via Imbalanced Learning with Evolutionary Safe-Level Synthetic Minority Over-Sampling Technique
    Ma, Jieming
    Afolabi, David Olalekan
    Ren, Jie
    Zhen, Aiyan
    COGNITIVE COMPUTATION, 2021, 13 (04) : 833 - 844