IFCM: An improved Fuzzy C-means clustering method to handle Class Overlap on Aging-related Software Bug Prediction

被引:2
|
作者
Zhang, Chen [1 ,2 ]
Feng, Shuo [3 ]
Xie, Wenzhi [1 ,2 ]
Zhao, Dongdong [1 ,2 ]
Xiang, Jianwen [1 ,2 ]
Pietrantuono, Roberto [4 ]
Natella, Roberto [4 ]
Cotroneo, Domenico [4 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Peoples R China
[2] Wuhan Univ Technol, Chongqing Res Inst, Chongqing, Peoples R China
[3] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou, Peoples R China
[4] Univ Naples Federico II, Naples, Italy
关键词
Aging-related Bug Prediction; Class Overlap; Fuzzy C-means Cluster; Software Quality; Software Reliability;
D O I
10.1109/ISSRE59848.2023.00053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software aging refers to a problem of performance decay in long-running software systems. This phenomenon is primarily attributed to the accumulation of run-time errors, commonly known as aging-related bugs (ARBs). Detecting ARBs through Aging-related Bug Prediction (ARBP) is crucial in ensuring system reliability. The effectiveness of ARBP heavily relies on the quality of datasets. However, ARB datasets often suffer from class overlap, where instances from different classes exhibit similar feature values. Class overlap poses a significant challenge as it compromises the quality of training data and subsequently impacts ARBP accuracy. To address this issue, we propose an improved Fuzzy C-means clustering method named IFCM, designed to mitigate class overlap in ARBP tasks. IFCM can identify whether an instance occurs overlap, and identify the overlap degree of this instance through the predefined parameters. We evaluate our proposed method on two public datasets Linux and MySQL and one self-collected dataset NetBSD using five different classifiers with five performance metrics (AUC, F1, Balance, PD, PF). Comparison with four existing methods (No clean, NCL, IKMCCA, ROCT) demonstrates that IFCM is effective in alleviating class overlap in ARBP. For Instance, IFCM achieves promising results in terms of AUC blue (which are 0.762, 0.757, and 0.642) and Balance (which are 0.709, 0.736, and 0.595) at the dataset level.
引用
收藏
页码:590 / 600
页数:11
相关论文
共 50 条
  • [41] A Novel Image Segmentation Method Based on Improved Intuitionistic Fuzzy C-Means Clustering Algorithm
    Kong, Jun
    Hou, Jian
    Jiang, Min
    Sun, Jinhua
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (06) : 3121 - 3143
  • [42] A Improved Clustering Analysis Method Based on Fuzzy C-Means Algorithm by Adding PSO Algorithm
    Pang, Liang
    Xiao, Kai
    Liang, Alei
    Guan, Haibing
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT I, 2012, 7208 : 231 - 242
  • [43] P-IT2IFCM: Probabilistic Interval Type-2 Intuitionistic Fuzzy c-Means Clustering Algorithm
    Chakraborty, Debanjan
    Varshney, Ayush K.
    Muhuri, Pranab K.
    Lohani, Q. M. Danish
    2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
  • [44] Generative Adversarial Networks-Based Imbalance Learning in Software Aging-Related Bug Prediction
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (02) : 626 - 642
  • [45] A Software Aging-Related Bug Prediction Framework Based on Deep Learning and Weakly Supervised Oversampling
    Zhou, Yancai
    Zhang, Chen
    Jia, Kai
    Zhao, Dongdong
    Xiang, Jianwen
    2022 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2022), 2022, : 185 - 192
  • [46] A Novel Semi-Supervised Fuzzy C-Means Clustering Method
    Li, Kunlun
    Cao, Zheng
    Cao, Liping
    Zhao, Rui
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 3761 - +
  • [47] Gene Function Prediction using Improved Fuzzy c-means Algorithm
    Kasim, Shahreen
    Fudzee, Mohd Farhan Md
    Deris, Safaai
    Othman, Razib M.
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [48] Source number estimation method based on fuzzy C-means clustering
    Liu Z.
    Chen X.
    Liu J.
    Liu Q.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2019, 41 (02): : 244 - 248
  • [49] A generalization of Possibilistic Fuzzy C-Means Method for Statistical Clustering of Data
    Azzouzi S.
    El-Mekkaoui J.
    Hjouji A.
    Khalfi A.E.L.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 1766 - 1780
  • [50] Abnormal State Detection of OLTC Based on Improved Fuzzy C-means Clustering
    Li, Hongwei
    Dou, Lilong
    Li, Shuaibing
    Kang, Yongqiang
    Yang, Xingzu
    Dong, Haiying
    CHINESE JOURNAL OF ELECTRICAL ENGINEERING, 2023, 9 (01): : 129 - 141