A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results

被引:15
|
作者
van Ravenzwaaij, Don [1 ]
Ioannidis, John P. A. [2 ,3 ,4 ,5 ]
机构
[1] Univ Groningen, Dept Psychol, Groningen, Netherlands
[2] Stanford Univ, Dept Med, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[5] Stanford Univ, Meta Res Innovat Ctr Stanford METRICS, Stanford, CA 94305 USA
来源
PLOS ONE | 2017年 / 12卷 / 03期
关键词
P-VALUES; CONFIDENCE-INTERVALS; RANDOMIZED-TRIALS; CLINICAL-TRIALS; HYPOTHESIS; TESTS;
D O I
10.1371/journal.pone.0173184
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A typical rule that has been used for the endorsement of new medications by the Food and Drug Administration is to have two trials, each convincing on its own, demonstrating effectiveness. "Convincing" may be subjectively interpreted, but the use of p-values and the focus on statistical significance (in particular with p < .05 being coined significant) is pervasive in clinical research. Therefore, in this paper, we calculate with simulations what it means to have exactly two trials, each with p < .05, in terms of the actual strength of evidence quantified by Bayes factors. Our results show that different cases where two trials have a p-value below .05 have wildly differing Bayes factors. Bayes factors of at least 20 in favor of the alternative hypothesis are not necessarily achieved and they fail to be reached in a large proportion of cases, in particular when the true effect size is small (0.2 standard deviations) or zero. In a non-trivial number of cases, evidence actually points to the null hypothesis, in particular when the true effect size is zero, when the number of trials is large, and when the number of participants in both groups is low. We recommend use of Bayes factors as a routine tool to assess endorsement of new medications, because Bayes factors consistently quantify strength of evidence. Use of p-values may lead to paradoxical and spurious decision-making regarding the use of new medications.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] The fragility of statistically significant results in otolaryngology randomized trials
    Skinner, Mason
    Tritz, Daniel
    Farahani, Clayton
    Ross, Andrew
    Hamilton, Tom
    Vassar, Matt
    AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2019, 40 (01) : 61 - 66
  • [2] THE FRAGILITY OF STATISTICALLY SIGNIFICANT RESULTS FROM RANDOMIZED TRIALS IN UROLOGY
    Narayan, Vikram
    Gandhi, Shreyas
    Chrouser, Kristin
    Evaniew, Nathan
    Dahm, Philipp
    JOURNAL OF UROLOGY, 2017, 197 (04): : E1131 - E1132
  • [3] Evaluating Non-Statistically Significant Results From Trials in Practice
    Hofmann, Stefan G.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 324 (16): : 1679 - +
  • [4] The Fragility of Statistically Significant Results in Randomized Clinical Trials for COVID-19
    Itaya, Takahiro
    Isobe, Yotsuha
    Suzuki, Sayoko
    Koike, Kanako
    Nishigaki, Masakazu
    Yamamoto, Yosuke
    JAMA NETWORK OPEN, 2022, 5 (03)
  • [5] The fragility of statistically significant results from clinical nutrition randomized controlled trials
    Pedziwiatr, Michal
    Mizera, Magdalena
    Wysocki, Michal
    Malczak, Piotr
    Stefura, Tomasz
    Gajewska, Natalia
    Torbicz, Grzegorz
    Dros, Jakub
    Kacprzyk, Artur
    Major, Piotr
    Klek, Stanislaw
    Bala, Malgorzata
    CLINICAL NUTRITION, 2020, 39 (04) : 1284 - 1291
  • [6] Evaluating Non-Statistically Significant Results From Trials in Practice Reply
    Young, Paul J.
    Nickson, Christopher P.
    Perner, Anders
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 324 (16): : 1680 - 1680
  • [7] Clinical trials in urology: how many patients are required to achieve statistically significant results?
    Parazzini, F
    Colli, E
    Chatenoud, L
    Malvezzi, M
    Olivieri, L
    Montorsi, F
    La Vecchia, C
    BJU INTERNATIONAL, 2005, 95 (05) : 717 - 722
  • [8] Reporting of statistically significant results at ClinicalTrials.gov for completed superiority randomized controlled trials
    Dechartres, Agnes
    Bond, Elizabeth G.
    Scheer, Jordan
    Riveros, Carolina
    Atal, Ignacio
    Ravaud, Philippe
    BMC MEDICINE, 2016, 14
  • [9] Reporting of statistically significant results at ClinicalTrials.gov for completed superiority randomized controlled trials
    Agnes Dechartres
    Elizabeth G. Bond
    Jordan Scheer
    Carolina Riveros
    Ignacio Atal
    Philippe Ravaud
    BMC Medicine, 14
  • [10] When Should Clinicians Act on Non-Statistically Significant Results From Clinical Trials?
    Young, Paul J.
    Nickson, Christopher P.
    Perner, Anders
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (22): : 2256 - 2257