A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results

被引:15
|
作者
van Ravenzwaaij, Don [1 ]
Ioannidis, John P. A. [2 ,3 ,4 ,5 ]
机构
[1] Univ Groningen, Dept Psychol, Groningen, Netherlands
[2] Stanford Univ, Dept Med, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[5] Stanford Univ, Meta Res Innovat Ctr Stanford METRICS, Stanford, CA 94305 USA
来源
PLOS ONE | 2017年 / 12卷 / 03期
关键词
P-VALUES; CONFIDENCE-INTERVALS; RANDOMIZED-TRIALS; CLINICAL-TRIALS; HYPOTHESIS; TESTS;
D O I
10.1371/journal.pone.0173184
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A typical rule that has been used for the endorsement of new medications by the Food and Drug Administration is to have two trials, each convincing on its own, demonstrating effectiveness. "Convincing" may be subjectively interpreted, but the use of p-values and the focus on statistical significance (in particular with p < .05 being coined significant) is pervasive in clinical research. Therefore, in this paper, we calculate with simulations what it means to have exactly two trials, each with p < .05, in terms of the actual strength of evidence quantified by Bayes factors. Our results show that different cases where two trials have a p-value below .05 have wildly differing Bayes factors. Bayes factors of at least 20 in favor of the alternative hypothesis are not necessarily achieved and they fail to be reached in a large proportion of cases, in particular when the true effect size is small (0.2 standard deviations) or zero. In a non-trivial number of cases, evidence actually points to the null hypothesis, in particular when the true effect size is zero, when the number of trials is large, and when the number of participants in both groups is low. We recommend use of Bayes factors as a routine tool to assess endorsement of new medications, because Bayes factors consistently quantify strength of evidence. Use of p-values may lead to paradoxical and spurious decision-making regarding the use of new medications.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Real world effects of COPD medications: a cohort study with validation against results from randomised controlled trials
    Wing, Kevin
    Williamson, Elizabeth
    Carpenter, James R.
    Wise, Lesley
    Schneeweiss, Sebastian
    Smeeth, Liam
    Quint, Jennifer K.
    Douglas, Ian
    EUROPEAN RESPIRATORY JOURNAL, 2021, 57 (03)
  • [42] Evidence-based study design in ocular allergy trials
    Katelaris, Constance H.
    Bielory, Leonard
    CURRENT OPINION IN ALLERGY AND CLINICAL IMMUNOLOGY, 2008, 8 (05) : 484 - 488
  • [43] A generalised model for individualising a treatment recommendation based on group-level evidence from randomised clinical trials
    Marcucci, Maura
    Sinclair, John C.
    BMJ OPEN, 2013, 3 (08):
  • [44] A flexible modelling of the time trend for the analysis of stepped wedge trials: results of a simulation study
    Di Tanna, Gian Luca
    Gasparrini, Antonio
    TRIALS, 2016, 17
  • [45] Evidence-based medicine in oncology:: Do the results of trials reflect clinical reality
    Hoelzel, D.
    Schubert-Fritschle, G.
    ZENTRALBLATT FUR CHIRURGIE, 2008, 133 (01): : 15 - 19
  • [46] Evidence-based data for newly approved medications' use in older adults: An analysis from clinical trials to monographs
    Gosselin, Maude
    Baroud, Marie-Laure
    Denis, Odelie-Rose
    Gagnon, Marie-Eve
    Sirois, Caroline
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2024, 72 (04) : 1252 - 1262
  • [47] Optimal design of clinical trials with computer simulation based on results of earlier trials, illustrated with a lipodystrophy trial in HIV patients
    Abbas, Ismail
    Rovira, Joan
    Casanovas, Josep
    Greenfield, Tony
    JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (06) : 1053 - 1061
  • [48] Evidence based sample size calculations for future trials based on results of current meta-analyses
    Sutton, A
    Cooper, N
    Abrams, K
    CONTROLLED CLINICAL TRIALS, 2003, 24 : 88S - 88S
  • [49] Recommendation of New Medical Alarms Based on Audibility, Identifiability, and Detectability in a Randomized, Simulation-Based Study
    Bennett, Christopher
    Dudaryk, Roman
    Crenshaw, Nichole
    Edworthy, Judy
    McNeer, Richard
    CRITICAL CARE MEDICINE, 2019, 47 (08) : 1050 - 1057
  • [50] Study on the Soft Handoff Simulation Based on Received Signal Strength in CDMA
    Mao, Wandui
    He, Zhaofeng
    Liu, Ningning
    Yang, Ming
    Liu, Dongmei
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL II, PROCEEDINGS, 2009, : 145 - +