The distribution of P-values in medical research articles suggested selective reporting associated with statistical significance

被引:32
|
作者
Perneger, Thomas V. [1 ]
Combescure, Christophe
机构
[1] Univ Geneva, Fac Med, Div Clin Epidemiol, 6 Rue Gabrielle Perret Gentil, CH-1211 Geneva, Switzerland
关键词
Statistical tests; P-values; Publication bias; Practice of research; SCIENCE-WISE FALSE; DISCOVERY RATE; PUBLICATION; INFERENCES; ABSTRACTS;
D O I
10.1016/j.jclinepi.2017.04.003
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: Published P-values provide a window into the global enterprise of medical research. The aim of this study was to use the distribution of published P-values to estimate the relative frequencies of null and alternative hypotheses and to seek irregularities suggestive of publication bias. Study Design and Setting: This cross-sectional study included P-values published in 120 medical research articles in 2016 (30 each from the BMJ, JAMA, Lancet, and New England Journal of Medicine). The observed distribution of P-values was compared with expected distributions under the null hypothesis (i.e., uniform between 0 and 1) and the alternative hypothesis (strictly decreasing from 0 to 1). P-values were categorized according to conventional levels of statistical significance and in one-percent intervals. Results: Among 4,158 recorded P-values, 26.1% were highly significant (P < 0.001), 9.1% were moderately significant (P > 0.001 to < 0.01), 11.7% were weakly significant (P >= 0.01 to < 0.05), and 53.2% were nonsignificant (P >= 0.05). We noted three irregularities: (1) high proportion of P-values <0.001, especially in observational studies, (2) excess of P-values equal to 1, and (3) about twice as many P-values less than 0.05 compared with those more than 0.05. The latter finding was seen in both randomized trials and observational studies, and in most types of analyses, excepting heterogeneity tests and interaction tests. Under plausible assumptions, we estimate that about half of the tested hypotheses were null and the other half were alternative. Conclusion: This analysis suggests that statistical tests published in medical journals are not a random sample of null and alternative hypotheses but that selective reporting is prevalent. In particular, significant results are about twice as likely to be reported as nonsignificant results. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:70 / 77
页数:8
相关论文
共 24 条
  • [1] Statistical Significance, p-Values, and the Reporting of Uncertainty
    Imbens, Guido W.
    JOURNAL OF ECONOMIC PERSPECTIVES, 2021, 35 (03): : 157 - 173
  • [2] Statistical significance and p-values GUIDELINES FOR USE AND REPORTING
    Parsons, N.
    Carey-Smith, R.
    Dritsaki, M.
    Griffin, X.
    Metcalfe, D.
    Perry, D.
    Stengel, D.
    Costa, M.
    BONE & JOINT JOURNAL, 2019, 101B (10): : 1179 - 1183
  • [3] On reporting and interpreting statistical significance and p values in medical research
    Aguinis, Herman
    Vassar, Matt
    Wayant, Cole
    BMJ EVIDENCE-BASED MEDICINE, 2021, 26 (02) : 39 - 42
  • [4] On p-Values and Statistical Significance
    Bonovas, Stefanos
    Piovani, Daniele
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (03)
  • [5] Publishing research with P-values: Prescribe more stringent statistical significance or proscribe statistical significance?
    Ioannidis, John P. A.
    EUROPEAN HEART JOURNAL, 2019, 40 (31) : 2553 - 2554
  • [6] ASA Statement on Statistical Significance and P-Values
    Wasserstein, Ronald L.
    AMERICAN STATISTICIAN, 2016, 70 (02): : 131 - 133
  • [7] EDITORIAL: STATISTICAL SIGNIFICANCE, P-VALUES, AND REPLICABILITY
    Kafadar, Karen
    ANNALS OF APPLIED STATISTICS, 2021, 15 (03): : 1081 - 1083
  • [8] The JMIG Issues New Guidelines on Statistical Reporting and p-values
    Wilson, Jeffrey R.
    Falcone, Tommaso
    JOURNAL OF MINIMALLY INVASIVE GYNECOLOGY, 2020, 27 (01) : 1 - 3
  • [9] Statistical Significance and the Dichotomization of Evidence: The Relevance of the ASA Statement on Statistical Significance and p-Values for Statisticians
    Laber, Eric B.
    Shedden, Kerby
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (519) : 902 - 904
  • [10] Inconsistent conclusions of statistical significance based on p-values and confidence intervals
    Nicolas Bamat
    Matthew Bryan
    Erik A Jensen
    Journal of Perinatology, 2018, 38 : 295 - 296