Does class size matter? An in-depth assessment of the effect of class size in software defect prediction

被引:5
|
作者
Tahir, Amjed [1 ]
Bennin, Kwabena E. [2 ]
Xiao, Xun [1 ,3 ]
MacDonell, Stephen G. [4 ,5 ]
机构
[1] Massey Univ, Software Engn, Palmerston North, New Zealand
[2] Wageningen Univ & Res, Informat Technol Grp, Wageningen, Netherlands
[3] Massey Univ, Sch Fundamental Sci, Palmerston North, New Zealand
[4] Auckland Univ Technol, Software Engn, Auckland, New Zealand
[5] Univ Otago, Informat Sci, Dunedin, New Zealand
关键词
Defect prediction; Class size; Metrics; Software quality; ORIENTED DESIGN METRICS; EMPIRICAL VALIDATION; VALIDITY; CODE; MEDIATION; FAILURES; QUALITY;
D O I
10.1007/s10664-021-09991-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the past 20 years, defect prediction studies have generally acknowledged the effect of class size on software prediction performance. To quantify the relationship between object-oriented (OO) metrics and defects, modelling has to take into account the direct, and potentially indirect, effects of class size on defects. However, some studies have shown that size cannot be simply controlled or ignored, when building prediction models. As such, there remains a question whether, and when, to control for class size. This study provides a new in-depth examination of the impact of class size on the relationship between OO metrics and software defects or defect-proneness. We assess the impact of class size on the number of defects and defect-proneness in software systems by employing a regression-based mediation (with bootstrapping) and moderation analysis to investigate the direct and indirect effect of class size in count and binary defect prediction. Our results show that the size effect is not always significant for all metrics. Of the seven OO metrics we investigated, size consistently has significant mediation impact only on the relationship between Coupling Between Objects (CBO) and defects/defect-proneness, and a potential moderation impact on the relationship between Fan-out and defects/defect-proneness. Other metrics show mixed results, in that they are significant for some systems but not for others. Based on our results we make three recommendations. One, we encourage researchers and practitioners to examine the impact of class size for the specific data they have in hand and through the use of the proposed statistical mediation/moderation procedures. Two, we encourage empirical studies to investigate the indirect effect of possible additional variables in their models when relevant. Three, the statistical procedures adopted in this study could be used in other empirical software engineering research to investigate the influence of potential mediators/moderators.
引用
收藏
页数:38
相关论文
共 50 条
  • [21] The optimal class size for object-oriented software
    El Emam, K
    Benlarbi, S
    Goel, N
    Melo, W
    Lounis, H
    Rai, SN
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (05) : 494 - 509
  • [22] THE EFFECT OF CLASS SIZE ON ACHIEVEMENT AND ATTITUDES
    EASH, MJ
    BENNETT, CM
    AMERICAN EDUCATIONAL RESEARCH JOURNAL, 1964, 1 (04) : 229 - 239
  • [23] The impact of details in the class diagram on software size estimation
    Zivkovic, A
    Hericko, M
    Brumen, B
    Beloglavec, S
    Rozman, I
    INFORMATICA, 2005, 16 (02) : 295 - 312
  • [24] Analyzing the Role of Class Rebalancing Techniques in Software Defect Prediction
    Alqasrawi, Yousef
    Azzeh, Mohammad
    Elsheikh, Yousef
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2024, 23 (06) : 2167 - 2207
  • [25] A Comprehensive Investigation of the Impact of Class Overlap on Software Defect Prediction
    Gong, Lina
    Zhang, Haoxiang
    Zhang, Jingxuan
    Wei, Mingqiang
    Huang, Zhiqiu
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 2440 - 2458
  • [26] Tackling class overlap and imbalance problems in software defect prediction
    Lin Chen
    Bin Fang
    Zhaowei Shang
    Yuanyan Tang
    Software Quality Journal, 2018, 26 : 97 - 125
  • [27] Class Imbalance Data-Generation for Software Defect Prediction
    Li, Zheng
    Zhang, Xingyao
    Guo, Junxia
    Shang, Ying
    2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 276 - 283
  • [28] Software Defect Prediction Based on Class-Association Rules
    Shao, Yuanxun
    Liu, Bin
    Li, Guoqi
    Wang, Shihai
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON RELIABILITY SYSTEMS ENGINEERING (ICRSE 2017), 2017,
  • [29] Tackling class overlap and imbalance problems in software defect prediction
    Chen, Lin
    Fang, Bin
    Shang, Zhaowei
    Tang, Yuanyan
    SOFTWARE QUALITY JOURNAL, 2018, 26 (01) : 97 - 125
  • [30] SOFTWARE DEFECT PREDICTION: ANALYSIS OF CLASS IMBALANCE AND PERFORMANCE STABILITY
    Balogun, Abdullateef O.
    Basri, Shuib
    Abdulkadir, Said J.
    Adeyemo, Victor E.
    Imam, Abdullahi A.
    Bajeh, Amos O.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2019, 14 (06): : 3294 - 3308