SOM-US: A Novel Under-Sampling Technique for Handling Class Imbalance Problem

被引:1
|
作者
Kumar, Ajay [1 ]
机构
[1] KIET Grp Inst, Dept Informat Technol, Ghaziabad 201206, India
关键词
Class Imbalance; Under-Sampling; Software Defect Prediction; SOFTWARE DEFECT PREDICTION;
D O I
10.24138/jcomss-2023-0133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
significant research challenge in data mining and machine learning is class imbalance classification since the majority of real -world datasets are imbalanced. When the dataset is highly unbalanced, the majority of available classification techniques frequently underperform on minority -class cases. This is due to the fact that they disregard the relative distribution of each class in favor of maximizing the overall accuracy. Various techniques based on sampling methods, cost -sensitive learning, and ensemble methods have recently been employed to handle the class imbalance problem. This paper proposes a new clusteringbased under -sampling (US) technique, called SOM-US, for handling the class imbalance problem using the self -organized map (SOM). To validate the proposed approach, an experimental study was conducted to improve the capability of a classifierlogistic regression for software defect prediction by applying SOM-US over a NASA software defect dataset. The proposed approach was compared with six existing under -sampling methods on two performance measures. The results demonstrate that the SOM-US significantly improves the prediction capability of logistic regression over other under -sampling techniques for software defect prediction.
引用
收藏
页码:69 / 75
页数:7
相关论文
共 50 条
  • [41] DBMUTE: density-based majority under-sampling technique
    Bunkhumpornpat, Chumphol
    Sinapiromsaran, Krung
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (03) : 827 - 850
  • [42] DBMUTE: density-based majority under-sampling technique
    Chumphol Bunkhumpornpat
    Krung Sinapiromsaran
    Knowledge and Information Systems, 2017, 50 : 827 - 850
  • [43] A Multiple Expert Approach to the Class Imbalance Problem Using Inverse Random under Sampling
    Tahir, Muhammad Atif
    Kittler, Josef
    Mikolajczyk, Krystian
    Yan, Fei
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 82 - 91
  • [44] An Approach to Class Imbalance Problem Based on Stacking and Inverse Random Under Sampling Methods
    Zhang, Yuwei
    Liu, Guanjun
    Luan, Wenjing
    Yan, Chungang
    Jiang, Changjun
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2018,
  • [45] Boosting the performance of over-sampling algorithms through under-sampling the minority class
    de Morais, Romero F. A. B.
    Vasconcelos, Germano C.
    NEUROCOMPUTING, 2019, 343 : 3 - 18
  • [46] A Novel Expandable Borderline Smote Over-Sampling Method for Class Imbalance Problem
    Sun, Hao
    Li, Jianping
    Zhu, Xiaoqian
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (05) : 2183 - 2199
  • [47] Benchmarking framework for class imbalance problem using novel sampling approach for big data
    Ahlawat, Khyati
    Chug, Anuradha
    Singh, Amit Prakash
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2019, 10 (04) : 824 - 835
  • [48] Benchmarking framework for class imbalance problem using novel sampling approach for big data
    Khyati Ahlawat
    Anuradha Chug
    Amit Prakash Singh
    International Journal of System Assurance Engineering and Management, 2019, 10 : 824 - 835
  • [49] Alleviating Class Imbalance Issue in Software Fault Prediction Using DBSCAN-Based Induced Graph Under-Sampling Method
    Bhandari, Kirti
    Kumar, Kuldeep
    Sangal, Amrit Lal
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 12589 - 12627
  • [50] A Comparative Study on Sampling Techniques for Handling Class Imbalance in Streaming Data
    Nguyen, Hien M.
    Cooper, Eric W.
    Kamei, Katsuari
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1762 - 1767