Inter-Rater Agreement in Assessing Risk of Bias in Melanoma Prediction Studies Using the Prediction Model Risk of Bias Assessment Tool (PROBAST): Results from a Controlled Experiment on the Effect of Specific Rater Training

被引:6
|
作者
Kaiser, Isabelle [1 ]
Pfahlberg, Annette B. [1 ]
Mathes, Sonja [2 ]
Uter, Wolfgang [1 ]
Diehl, Katharina [1 ]
Steeb, Theresa [3 ]
Heppt, Markus V. [3 ,4 ]
Gefeller, Olaf [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nuremberg, Dept Med Informat Biometry & Epidemiol, D-91054 Erlangen, Germany
[2] Tech Univ Munich, Fac Med, Dept Dermatol & Allergy Biederstein, D-80802 Munich, Germany
[3] Univ Hosp Erlangen, Dept Dermatol, D-91054 Erlangen, Germany
[4] Comprehens Canc Ctr Erlangen European Metropolitan, D-91054 Erlangen, Germany
关键词
inter-rater agreement; inter-rater reliability; melanoma; risk of bias; prediction; PROBAST; CUTANEOUS MELANOMA; IDENTIFYING INDIVIDUALS; MALIGNANT-MELANOMA; SYSTEMATIC REVIEWS; SKIN-CANCER; ROB; RELIABILITY; SCORE; POPULATION; VALIDATION;
D O I
10.3390/jcm12051976
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Assessing the risk of bias (ROB) of studies is an important part of the conduct of systematic reviews and meta-analyses in clinical medicine. Among the many existing ROB tools, the Prediction Model Risk of Bias Assessment Tool (PROBAST) is a rather new instrument specifically designed to assess the ROB of prediction studies. In our study we analyzed the inter-rater reliability (IRR) of PROBAST and the effect of specialized training on the IRR. Six raters independently assessed the risk of bias (ROB) of all melanoma risk prediction studies published until 2021 (n = 42) using the PROBAST instrument. The raters evaluated the ROB of the first 20 studies without any guidance other than the published PROBAST literature. The remaining 22 studies were assessed after receiving customized training and guidance. Gwet's AC(1) was used as the primary measure to quantify the pairwise and multi-rater IRR. Depending on the PROBAST domain, results before training showed a slight to moderate IRR (multi-rater AC(1) ranging from 0.071 to 0.535). After training, the multi-rater AC(1) ranged from 0.294 to 0.780 with a significant improvement for the overall ROB rating and two of the four domains. The largest net gain was achieved in the overall ROB rating (difference in multi-rater AC(1): 0.405, 95%-CI 0.149-0.630). In conclusion, without targeted guidance, the IRR of PROBAST is low, questioning its use as an appropriate ROB instrument for prediction studies. Intensive training and guidance manuals with context-specific decision rules are needed to correctly apply and interpret the PROBAST instrument and to ensure consistency of ROB ratings.
引用
收藏
页数:13
相关论文
共 9 条
  • [1] Systematic metareview of prediction studies demonstrates stable trends in bias and low PROBAST inter-rater agreement
    Langenhuijsen, Liselotte F. S.
    Janse, Roemer J.
    Venema, Esmee
    Kent, David M.
    van Diepen, Merel
    Dekker, Friedo W.
    Steyerberg, Ewout W.
    de Jong, Ype
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2023, 159 : 159 - 173
  • [2] Using the Prediction Model Risk of Bias Assessment Tool (PROBAST) to Evaluate Melanoma Prediction Studies
    Kaiser, Isabelle
    Mathes, Sonja
    Pfahlberg, Annette B.
    Uter, Wolfgang
    Berking, Carola
    Heppt, Markus, V
    Steeb, Theresa
    Diehl, Katharina
    Gefeller, Olaf
    CANCERS, 2022, 14 (12)
  • [3] PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies
    Wolff, Robert F.
    Moons, Karel G. M.
    Riley, Richard D.
    Whiting, Penny F.
    Westwood, Marie
    Collins, Gary S.
    Reitsma, Johannes B.
    Kleijnen, Jos
    Mallett, Sue
    Altman, Doug
    Bossuyt, Patrick
    Cook, Nancy R.
    D'Amico, Gennaro
    Debray, Thomas P. A.
    Deeks, Jon
    de Groot, Joris
    di Angelantonio, Emanuele
    Fahey, Tom
    Harrell, Frank
    Hayden, Jill A.
    Heymans, Martijn W.
    Hooft, Lotty
    Hyde, Chris
    Ioannidis, John
    Iorio, Alfonso
    Kaptoge, Stephen
    Knottnerus, Andre
    Leeflang, Mariska
    Nixon, Frances
    Perel, Pablo
    Phillips, Bob
    Raatz, Heike
    Riemsma, Rob
    Rovers, Maroeska
    Rutjes, Anne W. S.
    Sauerbrei, Willi
    Sauerland, Stefan
    Scheibler, Fueloep
    Scholten, Rob
    Schuit, Ewoud
    Steyerberg, Ewout
    Tan, Toni
    ter Riet, Gerben
    van der Windt, Danielle
    Vergouwe, Yvonne
    Vickers, Andrew
    Wood, Angela M.
    ANNALS OF INTERNAL MEDICINE, 2019, 170 (01) : 51 - +
  • [4] Appraising prediction research: a guide and meta-review on bias and applicability assessment using the Prediction model Risk Of Bias ASsessment Tool (PROBAST)
    de Jong, Ype
    Ramspek, Chava L.
    Zoccali, Carmine
    Jager, Kitty J.
    Dekker, Friedo W.
    van Diepen, Merel
    NEPHROLOGY, 2021, 26 (12) : 939 - 947
  • [5] PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration
    Moons, Karel G. M.
    Wolff, Robert F.
    Riley, Richard D.
    Whiting, Penny F.
    Westwood, Marie
    Collins, Gary S.
    Reitsma, Johannes B.
    Kleijnen, Jos
    Mallett, Sue
    ANNALS OF INTERNAL MEDICINE, 2019, 170 (01) : W1 - W33
  • [6] Assessing the quality of prediction models in health care using the Prediction model Risk Of Bias ASsessment Tool (PROBAST): an evaluation of its use and practical application
    Kaul, Tabea
    Damen, Johanna A. A.
    Wynants, Laure
    Van Calster, Ben
    van Smeden, Maarten
    Hooft, Lotty
    Moons, Karel G. M.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2025, 181
  • [7] Large-scale validation of the prediction model risk of bias assessment Tool (PROBAST) using a short form: high risk of bias models show poorer discrimination
    Venema, Esmee
    Wessler, Benjamin S.
    Paulus, Jessica K.
    Salah, Rehab
    Raman, Gowri
    Leung, Lester Y.
    Koethe, Benjamin C.
    Nelson, Jason
    Park, Jinny G.
    van Klaveren, David
    Steyerberg, Ewout W.
    Kent, David M.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2021, 138 : 32 - 39
  • [8] Assessor burden, inter-rater agreement and user experience of the RoB-SPEO tool for assessing risk of bias in studies estimating prevalence of exposure to occupational risk factors: An analysis from the WHO/ILO Joint Estimates of the Work-related Burden of Disease and Injury
    Momen, Natalie C.
    Streicher, Kai N.
    da Silva, Denise T. C.
    Descatha, Alexis
    Frings-Dresen, Monique H. W.
    Gagliardi, Diana
    Godderis, Lode
    Loney, Tom
    Mandrioli, Daniele
    Modenese, Alberto
    Morgan, Rebecca L.
    Pachito, Daniela
    Scheepers, Paul T. J.
    Sgargi, Daria
    Paulo, Marilia Silva
    Schlunssen, Vivi
    Sembajwe, Grace
    Sorensen, Kathrine
    Teixeira, Liliane R.
    Tenkate, Thomas
    Pega, Frank
    ENVIRONMENT INTERNATIONAL, 2022, 158
  • [9] Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence
    Collins, Gary S.
    Dhiman, Paula
    Andaur Navarro, Constanza L.
    Ma, Ji
    Hooft, Lotty
    Reitsma, Johannes B.
    Logullo, Patricia
    Beam, Andrew L.
    Peng, Lily
    Van Calster, Ben
    van Smeden, Maarten
    Riley, Richard D.
    Moons, Karel G. M.
    BMJ OPEN, 2021, 11 (07):