Statistical significance testing - a panacea for software technology experiments?

被引:24
|
作者
Miller, J [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, STEAM Res Ctr, Edmonton, AB T6H 5M3, Canada
关键词
empirical; hypothesis; replication;
D O I
10.1016/j.jss.2003.12.019
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Empirical software engineering has a long history of utilizing statistical significance testing, and in many ways, it has become the backbone of the topic. What is less obvious is how much consideration has been given to its adoption. Statistical significance testing was initially designed for testing hypotheses in a very different area, and hence the question must be asked: does it transfer into empirical software engineering research? This paper attempts to address this question. The paper finds that this transference is far from straightforward, resulting in several problems in its deployment within the area. Principally problems exist in: formulating hypotheses, the calculation of the probability values and its associated cut-off value, and the construction of the sample and its distribution. Hence, the paper concludes that the topic should explore other avenues of analysis, in an attempt to establish which analysis approaches are preferable under which conditions, when conducting empirical software engineering studies. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:183 / 192
页数:10
相关论文
共 50 条
  • [31] A Machine Learning Approach for Statistical Software Testing
    Baskiotis, Nicolas
    Sebag, Michele
    Gaudel, Marie-Claude
    Gouraud, Sandrine
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2274 - 2279
  • [32] On Some Statistical Aspects of Software Testing and Reliability
    Coolen, Frank P. A.
    COMPLEX SYSTEMS AND DEPENDABILITY, 2012, 170 : 103 - 113
  • [33] A case for new statistical software testing models
    May, John
    Ponomarev, Maxim
    Kuball, Silke
    Gallardo, Julio
    2006 PROCEEDINGS - ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, VOLS 1 AND 2, 2006, : 349 - +
  • [34] Statistical testing of software based on a usage model
    Walton, Gwendolyn H.
    Poore, J.H.
    Trammell, Carmen J.
    Software - Practice and Experience, 1995, 25 (01): : 97 - 108
  • [35] Tool support for statistical testing of software components
    Shukla, R
    Strooper, P
    Carrington, D
    12TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2005, : 719 - 726
  • [36] Application of Delphi technology in the testing software
    Lu, Changhua
    Xu, Wen
    Liu, Chun
    Wang, Guangchun
    Huagong Zidonghua Ji Yibiao/Control and Instruments in Chemical Industry, 2000, 27 (03): : 37 - 40
  • [37] Technology alone is not a panacea
    Tannous, AI
    FUTURIST, 1997, 31 (04) : 2 - 2
  • [38] Statistical Significance Testing for Natural Language Processing
    Levy, Francois
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2020, 61 (03): : 95 - 98
  • [39] Testing the statistical significance of linear programming estimators
    Horsky, D
    Nelson, P
    MANAGEMENT SCIENCE, 2006, 52 (01) : 128 - 135
  • [40] Advances in testing the statistical significance of mediation effects
    Mallinckrodt, Brent
    Abraham, W. Todd
    Wei, Meifen
    Russell, Daniel W.
    JOURNAL OF COUNSELING PSYCHOLOGY, 2006, 53 (03) : 372 - 378