The distribution of P-values in medical research articles suggested selective reporting associated with statistical significance

被引:32
|
作者
Perneger, Thomas V. [1 ]
Combescure, Christophe
机构
[1] Univ Geneva, Fac Med, Div Clin Epidemiol, 6 Rue Gabrielle Perret Gentil, CH-1211 Geneva, Switzerland
关键词
Statistical tests; P-values; Publication bias; Practice of research; SCIENCE-WISE FALSE; DISCOVERY RATE; PUBLICATION; INFERENCES; ABSTRACTS;
D O I
10.1016/j.jclinepi.2017.04.003
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: Published P-values provide a window into the global enterprise of medical research. The aim of this study was to use the distribution of published P-values to estimate the relative frequencies of null and alternative hypotheses and to seek irregularities suggestive of publication bias. Study Design and Setting: This cross-sectional study included P-values published in 120 medical research articles in 2016 (30 each from the BMJ, JAMA, Lancet, and New England Journal of Medicine). The observed distribution of P-values was compared with expected distributions under the null hypothesis (i.e., uniform between 0 and 1) and the alternative hypothesis (strictly decreasing from 0 to 1). P-values were categorized according to conventional levels of statistical significance and in one-percent intervals. Results: Among 4,158 recorded P-values, 26.1% were highly significant (P < 0.001), 9.1% were moderately significant (P > 0.001 to < 0.01), 11.7% were weakly significant (P >= 0.01 to < 0.05), and 53.2% were nonsignificant (P >= 0.05). We noted three irregularities: (1) high proportion of P-values <0.001, especially in observational studies, (2) excess of P-values equal to 1, and (3) about twice as many P-values less than 0.05 compared with those more than 0.05. The latter finding was seen in both randomized trials and observational studies, and in most types of analyses, excepting heterogeneity tests and interaction tests. Under plausible assumptions, we estimate that about half of the tested hypotheses were null and the other half were alternative. Conclusion: This analysis suggests that statistical tests published in medical journals are not a random sample of null and alternative hypotheses but that selective reporting is prevalent. In particular, significant results are about twice as likely to be reported as nonsignificant results. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:70 / 77
页数:8
相关论文
共 24 条
  • [11] Inconsistent conclusions of statistical significance based on p-values and confidence intervals
    Bamat, Nicolas
    Bryan, Matthew
    Jensen, Erik A.
    JOURNAL OF PERINATOLOGY, 2018, 38 (03) : 295 - 296
  • [12] Reviewing research reporting in randomised controlled trials: Confidence and P-values
    Ganesh, Venkata
    Sahni, Neeru
    INDIAN JOURNAL OF ANAESTHESIA, 2024, 68 (05) : 492 - 495
  • [13] Statcheck in Canada: What Proportion of CPA Journal Articles Contain Errors in the Reporting of p-Values?
    Green, Christopher D.
    Abbas, Sahir
    Belliveau, Arlie
    Beribisky, Nataly
    Davidson, Ian J.
    DiGiovanni, Julian
    Heidari, Crystal
    Martin, Shane M.
    Oosenbrug, Eric
    Wainwright, Linda M.
    CANADIAN PSYCHOLOGY-PSYCHOLOGIE CANADIENNE, 2018, 59 (03): : 203 - 210
  • [14] An unexpected influence of widely used significance thresholds on the distribution of reported P-values
    Ridley, J.
    Kolm, N.
    Freckelton, R. P.
    Gage, M. J. G.
    JOURNAL OF EVOLUTIONARY BIOLOGY, 2007, 20 (03) : 1082 - 1089
  • [15] Fiducialize statistical significance: transforming p-values into conservative posterior probabilities and Bayes factors
    Bickel, David R. R.
    STATISTICS, 2023, 57 (04) : 941 - 959
  • [16] Statistical significance testing and p-values: Defending the indefensible? A discussion paper and position statement
    Griffiths, Peter
    Needleman, Jack
    INTERNATIONAL JOURNAL OF NURSING STUDIES, 2019, 99
  • [17] Beyond p-values: A phase II dual-criterion design with statistical significance and clinical relevance
    Roychoudhury, Satrajit
    Scheuer, Nicolas
    Neuenschwander, Beat
    CLINICAL TRIALS, 2018, 15 (05) : 452 - 461
  • [18] p-Values and confidence intervals as compatibility measures: guidelines for interpreting statistical studies in clinical research
    Rovetta, Alessandro
    Piretta, Luca
    Mansournia, Mohammad Ali
    LANCET REGIONAL HEALTH - SOUTHEAST ASIA, 2025, 33
  • [19] Best uses of p-values and complementary measures in medical research: Recent developments in the frequentist and Bayesian frameworks
    Quatto, Piero
    Ripamonti, Enrico
    Marasini, Donata
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2020, 30 (01) : 121 - 142
  • [20] The reporting of p values, confidence intervals and statistical significance in Preventive Veterinary Medicine (1997-2017)
    Messam, Locksley L. Mc, V
    Weng, Hsin-Yi
    Rosenberger, Nicole W. Y.
    Tan, Zhi Hao
    Payet, Stephanie D. M.
    Santbakshsing, Mahishi
    PEERJ, 2021, 9