Effect size, confidence interval and statistical significance: a practical guide for biologists

被引:2784
|
作者
Nakagawa, Shinichi [1 ]
Cuthill, Innes C.
机构
[1] Univ Sheffield, Dept Anim & Plant Sci, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Bristol, Sch Biol Sci, Bristol BS8 1UG, Avon, England
关键词
bonferroni correction; confidence interval; effect size; effect statistic; meta-analysis; null hypothesis significance testing; p value; power analysis; statistical significance;
D O I
10.1111/j.1469-185X.2007.00027.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Null hypothesis significance testing (NHST) is the dominant statistical approach in biology, although it has many, frequently unappreciated, problems. Most importantly, NHST does not provide us with two crucial pieces of information: (1) the magnitude of an effect of interest, and (2) the precision of the estimate of the magnitude of that effect. All biologists should be ultimately interested in biological importance, which may be assessed using the magnitude of an effect, but not its statistical significance. Therefore, we advocate presentation of measures of the magnitude of effects (i.e. effect size statistics) and their confidence intervals (CIs) in all biological journals. Combined use of an effect size and its CIs enables one to assess the relationships within data more effectively than the use of p values, regardless of statistical significance. In addition, routine presentation of effect sizes will encourage researchers to view their results in the context of previous research and facilitate the incorporation of results into future meta-analysis, which has been increasingly used as the standard method of quantitative review in biology. In this article, we extensively discuss two dimensionless (and thus standardised) classes of effect size statistics: d statistics (standardised mean difference) and r statistics (correlation coefficient), because these can be calculated from almost all study designs and also because their calculations are essential for meta-analysis. However, our focus on these standardised effect size statistics does not mean unstandardised effect size statistics (e.g. mean difference and regression coefficient) are less important. We provide potential solutions for four main technical problems researchers may encounter when calculating effect size and CIs: (1) when covariates exist, (2) when bias in estimating effect size is possible, (3) when data have non-normal error structure and/or variances, and (4) when data are non-independent. Although interpretations of effect sizes are often difficult, we provide some pointers to help researchers. This paper serves both as a beginner's instruction manual and a stimulus for changing statistical practice for the better in the biological sciences.
引用
收藏
页码:591 / 605
页数:15
相关论文
共 50 条
  • [41] Misconceptions about sample size, statistical significance, and treatment effect
    Wilkerson, M
    Olson, MR
    JOURNAL OF PSYCHOLOGY, 1997, 131 (06): : 627 - 631
  • [42] A practical guide for understanding confidence intervals and P values
    Wang, Eric W.
    Ghogomu, Nsangou
    Voelker, Courtney C. J.
    Rich, Jason T.
    Paniello, Randal C.
    Nussenbaum, Brian
    Karni, Ron J.
    Neely, J. Gail
    OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2009, 140 (06) : 794 - +
  • [43] PRACTICAL SIGNIFICANCE AND STATISTICAL-MODELS
    MCNAMARA, JF
    EDUCATIONAL ADMINISTRATION QUARTERLY, 1978, 14 (01) : 48 - 63
  • [44] Automatic detection for bioacoustic research: a practical guide from and for biologists and computer scientists
    Kershenbaum, Arik
    Akcay, Caglar
    Babu-Saheer, Lakshmi
    Barnhill, Alex
    Best, Paul
    Cauzinille, Jules
    Clink, Dena
    Dassow, Angela
    Dufourq, Emmanuel
    Growcott, Jonathan
    Markham, Andrew
    Marti-Domken, Barbara
    Marxer, Ricard
    Muir, Jen
    Reynolds, Sam
    Root-Gutteridge, Holly
    Sadhukhan, Sougata
    Schindler, Loretta
    Smith, Bethany R.
    Stowell, Dan
    Wascher, Claudia A. F.
    Dunn, Jacob C.
    BIOLOGICAL REVIEWS, 2025, 100 (02) : 620 - 646
  • [45] Automatic detection for bioacoustic research: a practical guide from and for biologists and computer scientists
    Kershenbaum, Arik
    Akcay, Caglar
    Babu-Saheer, Lakshmi
    Barnhill, Alex
    Best, Paul
    Cauzinille, Jules
    Clink, Dena
    Dassow, Angela
    Dufourq, Emmanuel
    Growcott, Jonathan
    Markham, Andrew
    Marti-Domken, Barbara
    Marxer, Ricard
    Muir, Jen
    Reynolds, Sam
    Root-Gutteridge, Holly
    Sadhukhan, Sougata
    Schindler, Loretta
    Smith, Bethany R.
    Stowell, Dan
    Wascher, Claudia A. F.
    Dunn, Jacob C.
    BIOLOGICAL REVIEWS, 2024,
  • [47] Multivariate Effect Size Estimation: Confidence Interval Construction via Latent Variable Modeling
    Raykov, Tenko
    Marcoulides, George A.
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2010, 35 (04) : 407 - 421
  • [48] Sample size and the width of the confidence interval for mean difference
    Liu, Xiaofeng Steven
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2009, 62 : 201 - 215
  • [49] Sample size for the Z test and its confidence interval
    Liu, Xiaofeng Steven
    INTERNATIONAL JOURNAL OF MATHEMATICAL EDUCATION IN SCIENCE AND TECHNOLOGY, 2012, 43 (02) : 266 - 270
  • [50] Efficient Adjusted Joint Significance Test and Sobel-Type Confidence Interval for Mediation Effect
    Zhang, Haixiang
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2025, 32 (01) : 93 - 104