SOM-US: A Novel Under-Sampling Technique for Handling Class Imbalance Problem

被引:1
|
作者
Kumar, Ajay [1 ]
机构
[1] KIET Grp Inst, Dept Informat Technol, Ghaziabad 201206, India
关键词
Class Imbalance; Under-Sampling; Software Defect Prediction; SOFTWARE DEFECT PREDICTION;
D O I
10.24138/jcomss-2023-0133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
significant research challenge in data mining and machine learning is class imbalance classification since the majority of real -world datasets are imbalanced. When the dataset is highly unbalanced, the majority of available classification techniques frequently underperform on minority -class cases. This is due to the fact that they disregard the relative distribution of each class in favor of maximizing the overall accuracy. Various techniques based on sampling methods, cost -sensitive learning, and ensemble methods have recently been employed to handle the class imbalance problem. This paper proposes a new clusteringbased under -sampling (US) technique, called SOM-US, for handling the class imbalance problem using the self -organized map (SOM). To validate the proposed approach, an experimental study was conducted to improve the capability of a classifierlogistic regression for software defect prediction by applying SOM-US over a NASA software defect dataset. The proposed approach was compared with six existing under -sampling methods on two performance measures. The results demonstrate that the SOM-US significantly improves the prediction capability of logistic regression over other under -sampling techniques for software defect prediction.
引用
收藏
页码:69 / 75
页数:7
相关论文
共 50 条
  • [21] Novel RF Direct Under-Sampling Technique and its Application to OFDM Systems
    Okuizumi, Ryoichi
    Namiki, Tatsuya
    Muraguchi, Masahiro
    2009 EUROPEAN MICROWAVE CONFERENCE, VOLS 1-3, 2009, : 1227 - 1230
  • [22] Handling Class Imbalance Problem in Cultural Modeling
    Su, Peng
    Mao, Wenji
    Zeng, Daniel
    Li, Xiaochen
    Wang, Fei-Yue
    ISI: 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS, 2009, : 251 - 256
  • [23] A Novel Hybrid Sampling Method ESMOTE plus SSLM for Handling the Problem of Class Imbalance with Overlap in Financial Distress Detection
    Wang, Xiaomin
    Zhang, Rui
    Zhang, Zuoquan
    NEURAL PROCESSING LETTERS, 2023, 55 (03) : 3081 - 3105
  • [24] A Novel Hybrid Sampling Method ESMOTE+SSLM for Handling the Problem of Class Imbalance with Overlap in Financial Distress Detection
    Xiaomin Wang
    Rui Zhang
    Zuoquan Zhang
    Neural Processing Letters, 2023, 55 : 3081 - 3105
  • [25] RFCL: A new under-sampling method of reducing the degree of imbalance and overlap
    Rui Zhang
    Zuoquan Zhang
    Di Wang
    Pattern Analysis and Applications, 2021, 24 : 641 - 654
  • [26] Novel Technique for Wideband Digital Predistortion of Power Amplifiers With an Under-Sampling ADC
    Liu, Youjiang
    Yan, Jonmei J.
    Dabag, Hayg-Taniel
    Asbeck, Peter M.
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2014, 62 (11) : 2604 - 2617
  • [27] RFCL: A new under-sampling method of reducing the degree of imbalance and overlap
    Zhang, Rui
    Zhang, Zuoquan
    Wang, Di
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (02) : 641 - 654
  • [28] Applications of digital IF receivers and under-sampling technique in ladar
    Song Zhi-yuan
    Zhu Shao-lan
    Dong Li-jun
    Feng Li
    He Hao-dong
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2011: LASER SENSING AND IMAGING AND BIOLOGICAL AND MEDICAL APPLICATIONS OF PHOTONICS SENSING AND IMAGING, 2011, 8192
  • [29] An Ensemble Learning-Based Undersampling Technique for Handling Class-Imbalance Problem
    Sarkar, Sobhan
    Khatedi, Nikhil
    Pramanik, Anima
    Maiti, J.
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 586 - 595
  • [30] Under-sampling technique for sine wave frequency estimation
    Kubo, K
    SICE 2001: PROCEEDINGS OF THE 40TH SICE ANNUAL CONFERENCE, INTERNATIONAL SESSION PAPERS, 2001, : 380 - 385