SOM-US: A Novel Under-Sampling Technique for Handling Class Imbalance Problem

被引:1
|
作者
Kumar, Ajay [1 ]
机构
[1] KIET Grp Inst, Dept Informat Technol, Ghaziabad 201206, India
关键词
Class Imbalance; Under-Sampling; Software Defect Prediction; SOFTWARE DEFECT PREDICTION;
D O I
10.24138/jcomss-2023-0133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
significant research challenge in data mining and machine learning is class imbalance classification since the majority of real -world datasets are imbalanced. When the dataset is highly unbalanced, the majority of available classification techniques frequently underperform on minority -class cases. This is due to the fact that they disregard the relative distribution of each class in favor of maximizing the overall accuracy. Various techniques based on sampling methods, cost -sensitive learning, and ensemble methods have recently been employed to handle the class imbalance problem. This paper proposes a new clusteringbased under -sampling (US) technique, called SOM-US, for handling the class imbalance problem using the self -organized map (SOM). To validate the proposed approach, an experimental study was conducted to improve the capability of a classifierlogistic regression for software defect prediction by applying SOM-US over a NASA software defect dataset. The proposed approach was compared with six existing under -sampling methods on two performance measures. The results demonstrate that the SOM-US significantly improves the prediction capability of logistic regression over other under -sampling techniques for software defect prediction.
引用
收藏
页码:69 / 75
页数:7
相关论文
共 50 条
  • [1] A Hybrid Evolutionary Under-sampling Method for Handling the Class Imbalance Problem with Overlap in Credit Classification
    Ping Gong
    Junguang Gao
    Li Wang
    Journal of Systems Science and Systems Engineering, 2022, 31 : 728 - 752
  • [2] A Hybrid Evolutionary Under-sampling Method for Handling the Class Imbalance Problem with Overlap in Credit Classification
    Gong, Ping
    Gao, Junguang
    Wang, Li
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2022, 31 (06) : 728 - 752
  • [3] A majority affiliation based under-sampling method for class imbalance problem
    Xie, Ying
    Huang, Xian
    Qin, Feng
    Li, Fagen
    Ding, Xuyang
    INFORMATION SCIENCES, 2024, 662
  • [4] A Novel Clustering-Based Three Level Under-Sampling Algorithm For Class Imbalance Problem
    Pratap, Vibha
    Singh, Amit Prakash
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2023, 27 (04): : 2319 - 2329
  • [5] DBIG-US: A two-stage under-sampling algorithm to face the class imbalance problem
    Guzman-Ponce, A.
    Sanchez, J. S.
    Valdovinos, R. M.
    Marcial-Romero, J. R.
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [6] A novel framework for class imbalance learning using intelligent under-sampling
    Naganjaneyulu S.
    Kuppa M.R.
    Naganjaneyulu, S. (svna2198@gmail.com), 1600, Springer Verlag (02): : 73 - 84
  • [7] Handling Class-Imbalance with KNN (Neighbourhood) Under-Sampling for Software Defect Prediction
    Goyal, Somya
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) : 2023 - 2064
  • [8] Handling Class-Imbalance with KNN (Neighbourhood) Under-Sampling for Software Defect Prediction
    Somya Goyal
    Artificial Intelligence Review, 2022, 55 : 2023 - 2064
  • [9] An empirical study of dynamic selection and random under-sampling for the class imbalance problem
    Liu, Shuhua Monica
    Chen, Jiun-Hung
    Liu, Zhiheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [10] Controlled Under-Sampling with Majority Voting Ensemble Learning for Class Imbalance Problem
    Sikora, Riyaz
    Raina, Sahil
    INTELLIGENT COMPUTING, VOL 2, 2019, 857 : 33 - 39