IFCM: An improved Fuzzy C-means clustering method to handle Class Overlap on Aging-related Software Bug Prediction

被引:2
|
作者
Zhang, Chen [1 ,2 ]
Feng, Shuo [3 ]
Xie, Wenzhi [1 ,2 ]
Zhao, Dongdong [1 ,2 ]
Xiang, Jianwen [1 ,2 ]
Pietrantuono, Roberto [4 ]
Natella, Roberto [4 ]
Cotroneo, Domenico [4 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Peoples R China
[2] Wuhan Univ Technol, Chongqing Res Inst, Chongqing, Peoples R China
[3] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou, Peoples R China
[4] Univ Naples Federico II, Naples, Italy
关键词
Aging-related Bug Prediction; Class Overlap; Fuzzy C-means Cluster; Software Quality; Software Reliability;
D O I
10.1109/ISSRE59848.2023.00053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software aging refers to a problem of performance decay in long-running software systems. This phenomenon is primarily attributed to the accumulation of run-time errors, commonly known as aging-related bugs (ARBs). Detecting ARBs through Aging-related Bug Prediction (ARBP) is crucial in ensuring system reliability. The effectiveness of ARBP heavily relies on the quality of datasets. However, ARB datasets often suffer from class overlap, where instances from different classes exhibit similar feature values. Class overlap poses a significant challenge as it compromises the quality of training data and subsequently impacts ARBP accuracy. To address this issue, we propose an improved Fuzzy C-means clustering method named IFCM, designed to mitigate class overlap in ARBP tasks. IFCM can identify whether an instance occurs overlap, and identify the overlap degree of this instance through the predefined parameters. We evaluate our proposed method on two public datasets Linux and MySQL and one self-collected dataset NetBSD using five different classifiers with five performance metrics (AUC, F1, Balance, PD, PF). Comparison with four existing methods (No clean, NCL, IKMCCA, ROCT) demonstrates that IFCM is effective in alleviating class overlap in ARBP. For Instance, IFCM achieves promising results in terms of AUC blue (which are 0.762, 0.757, and 0.642) and Balance (which are 0.709, 0.736, and 0.595) at the dataset level.
引用
收藏
页码:590 / 600
页数:11
相关论文
共 50 条
  • [31] An Outlier Detection Method based on Fuzzy C-Means Clustering
    Li, Qiang
    Zhang, Jianpei
    Feng, Guangsheng
    ADVANCED DESIGN AND MANUFACTURE II, 2010, 419-420 : 165 - 168
  • [32] An Efficient Federated Multiview Fuzzy C-Means Clustering Method
    Hu, Xingchen
    Qin, Jindong
    Shen, Yinghua
    Pedrycz, Witold
    Liu, Xinwang
    Liu, Jiyuan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (04) : 1886 - 1899
  • [33] An Improved Fuzzy C-Means Clustering for Brain MR Images Segmentation
    Chen, Aiguo
    Yan, Haoyuan
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2021, 11 (02) : 386 - 390
  • [34] An Improved Fuzzy C-Means Clustering Algorithm and Application in Meteorological Data
    Li, Hongfei
    Wang, Fuling
    Zheng, Shijue
    Gao, Li
    ADVANCED MATERIALS SCIENCE AND TECHNOLOGY, PTS 1-2, 2011, 181-182 : 545 - 550
  • [35] MEDICAL IMAGE REGISTRATION BASED ON IMPROVED FUZZY C-MEANS CLUSTERING
    Pan, Meisen
    Jiang, Jianjun
    Zhang, Fen
    Rong, Qiusheng
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2015, 27 (04):
  • [36] Optimization of the clusters number of An improved fuzzy C-means clustering algorithm
    Xu Yejun
    10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 931 - 935
  • [37] Medical Image Segmentation based on Improved Fuzzy C-means Clustering
    Liu, Dongling
    Ma, Ling
    Chen, Hui
    Meng, Ke
    2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 406 - 410
  • [38] An Improved Particle Swarm Optimization With Fuzzy c-means Clustering Algorithm
    Mei Congli
    Zhou Dawei
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 118 - 122
  • [39] An Improved Fuzzy C-means Clustering Algorithm Based on Simulated Annealing
    Liu, Peiyu
    Duan, Linshan
    Chi, Xuezhi
    Zhu, Zhenfang
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 39 - 43
  • [40] An Improved Generalized Fuzzy C-Means Clustering Algorithm Based on GA
    Ma, Wenping
    Ge, Xiaohua
    Jiao, Licheng
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 599 - 606