Does class size matter? An in-depth assessment of the effect of class size in software defect prediction

被引：5

作者：

Tahir, Amjed ^{[1
]}

Bennin, Kwabena E. ^{[2
]}

Xiao, Xun ^{[1
,3
]}

MacDonell, Stephen G. ^{[4
,5
]}

机构：

[1] Massey Univ, Software Engn, Palmerston North, New Zealand

[2] Wageningen Univ & Res, Informat Technol Grp, Wageningen, Netherlands

[3] Massey Univ, Sch Fundamental Sci, Palmerston North, New Zealand

[4] Auckland Univ Technol, Software Engn, Auckland, New Zealand

[5] Univ Otago, Informat Sci, Dunedin, New Zealand

来源：

EMPIRICAL SOFTWARE ENGINEERING | 2021年 / 26卷 / 05期

关键词：

Defect prediction; Class size; Metrics; Software quality; ORIENTED DESIGN METRICS; EMPIRICAL VALIDATION; VALIDITY; CODE; MEDIATION; FAILURES; QUALITY;

D O I：

10.1007/s10664-021-09991-3

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In the past 20 years, defect prediction studies have generally acknowledged the effect of class size on software prediction performance. To quantify the relationship between object-oriented (OO) metrics and defects, modelling has to take into account the direct, and potentially indirect, effects of class size on defects. However, some studies have shown that size cannot be simply controlled or ignored, when building prediction models. As such, there remains a question whether, and when, to control for class size. This study provides a new in-depth examination of the impact of class size on the relationship between OO metrics and software defects or defect-proneness. We assess the impact of class size on the number of defects and defect-proneness in software systems by employing a regression-based mediation (with bootstrapping) and moderation analysis to investigate the direct and indirect effect of class size in count and binary defect prediction. Our results show that the size effect is not always significant for all metrics. Of the seven OO metrics we investigated, size consistently has significant mediation impact only on the relationship between Coupling Between Objects (CBO) and defects/defect-proneness, and a potential moderation impact on the relationship between Fan-out and defects/defect-proneness. Other metrics show mixed results, in that they are significant for some systems but not for others. Based on our results we make three recommendations. One, we encourage researchers and practitioners to examine the impact of class size for the specific data they have in hand and through the use of the proposed statistical mediation/moderation procedures. Two, we encourage empirical studies to investigate the indirect effect of possible additional variables in their models when relevant. Three, the statistical procedures adopted in this study could be used in other empirical software engineering research to investigate the influence of potential mediators/moderators.

引用

页数：38

共 50 条

[41] ON THE EFFECT OF CLASS SIZE ON THE EVALUATION OF LECTURERS PERFORMANCE
COHEN, A
AMERICAN STATISTICIAN, 1983, 37 (04): : 331 - 333
[42] Undergraduate Research Methods: Does Size Matter? A Look at the Attitudes and Outcomes of Students in a Hybrid Class Format versus a Traditional Class Format
Gordon, Jill
Barnes, Christina
Martin, Kasey
JOURNAL OF CRIMINAL JUSTICE EDUCATION, 2009, 20 (03) : 227 - 248
[43] Size does matter: An assessment of reproductive potential in seahorses
Faleiro, Filipa
Almeida, Armando J.
Re, Pedro
Narciso, Luis
ANIMAL REPRODUCTION SCIENCE, 2016, 170 : 61 - 67
[44] Combat with Class Overlapping in Software Defect Prediction Using Neighbourhood Metric
Gupta S.
Richa
Kumar R.
Jain K.L.
SN Computer Science, 4 (5)
[45] DOES CLASS MATTER? THE EFFECT OF SOCIAL CLASS ON JOURNALISTS' ETHICAL DECISION MAKING
Correa, Teresa
JOURNALISM & MASS COMMUNICATION QUARTERLY, 2009, 86 (03) : 654 - 672
[46] Adapting God Class thresholds for software defect prediction: A case study
Gradisnik, Mitja
Beranic, Tina
Karakatic, Saso
Mausa, Goran
2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1537 - 1542
[47] The Impact Study of Class Imbalance on the Performance of Software Defect Prediction Models
Yu Q.
Jiang S.-J.
Zhang Y.-M.
Wang X.-Y.
Gao P.-F.
Qian J.-Y.
Qian, Jun-Yan (qjy2000@gmail.com), 2018, Science Press (41): : 809 - 824
[48] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
Huda, Shamsul
Liu, Kevin
Abdelrazek, Mohamed
Ibrahim, Amani
Alyahya, Sultan
Al-Dossari, Hmood
Ahmad, Shafiq
IEEE ACCESS, 2018, 6 : 24184 - 24195
[49] A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction
Dar, Abdul Waheed
Farooq, Sheikh Umar
INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
[50] IH:mpirical Evaluation of the Impact of Class Overlap on Software Defect Prediction
Gong, Lina
Jiang, Shujuan
Wang, Rongcun
Jiang, Li
34TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2019), 2019, : 710 - 721

← 1 2 3 4 5 →