Strategies for validating biomarkers using data from a reference set

被引:0
|
作者
Wang, Lu [1 ]
Huang, Ying [1 ,2 ]
Feng, Ziding [1 ]
机构
[1] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
Biomarker validation; Group sequential testing; Reference set; Rotation; Two-stage; CONDITIONAL ESTIMATION; EARLY TERMINATION;
D O I
10.1093/biostatistics/kxz031
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Candidate biomarkers discovered in the laboratory need to be rigorously validated before advancing to clinical application. However, it is often expensive and time-consuming to collect the high quality specimens needed for validation; moreover, such specimens are often limited in volume. The Early Detection Research Network has developed valuable specimen reference sets that can be used by multiple labs for biomarker validation. To optimize the chance of successful validation, it is critical to efficiently utilize the limited specimens in these reference sets on promising candidate biomarkers. Towards this end, we propose a novel two-stage validation strategy that partitions the samples in the reference set into two groups for sequential validation. The proposed strategy adopts the group sequential testing method to control for the type I error rate and rotates group membership to maximize the usage of available samples. We develop analytical formulas for performance parameters of this strategy in terms of the expected numbers of biomarkers that can be evaluated and the truly useful biomarkers that can be successfully validated, which can provide valuable guidance for future study design. The performance of our proposed strategy for validating biomarkers with respect to the points on the receiver operating characteristic curve are evaluated via extensive simulation studies and compared with the default strategy of validating each biomarker using all samples in the reference set. Different types of early stopping rules and boundary shapes in the group sequential testing method are considered. Compared with the default strategy, our proposed strategy makes more efficient use of the limited resources in the reference set by allowing more candidate biomarkers to be evaluated, giving a better chance of having truly useful biomarkers successfully validated.
引用
收藏
页码:298 / 314
页数:17
相关论文
共 50 条
  • [1] A reference data set for validating vapor pressure measurement techniques: homologous series of polyethylene glycols
    Krieger, Ulrich K.
    Siegrist, Franziska
    Marcolli, Claudia
    Emanuelsson, Eva U.
    Gobel, Freya M.
    Bilde, Merete
    Marsh, Aleksandra
    Reid, Jonathan P.
    Huisman, Andrew J.
    Riipinen, Ilona
    Hyttinen, Noora
    Myllys, Nanna
    Kurten, Theo
    Bannan, Thomas
    Percival, Carl J.
    Topping, David
    ATMOSPHERIC MEASUREMENT TECHNIQUES, 2018, 11 (01) : 49 - 63
  • [2] Accuracy of Classified Imagery without using Reference Data Set
    Sharma, Ranjana
    Garg, P. K.
    Dwivedi, R. K.
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART), 2018, : 195 - 199
  • [3] Synthetic phosphopeptide reference data set
    不详
    NATURE BIOTECHNOLOGY, 2017, 35 (08)
  • [4] LocaRDS: A Localization Reference Data Set
    Schaefer, Matthias
    Strohmeier, Martin
    Leonardi, Mauro
    Lenders, Vincent
    SENSORS, 2021, 21 (16)
  • [5] New Data Set for Validating PV Module Performance Models
    Marion, Bill
    Anderberg, Allan
    Deline, Chris
    del Cueto, Joe
    Muller, Matt
    Perrin, Greg
    Rodriguez, Jose
    Rummel, Steve
    Silverman, Timothy J.
    Vignola, Frank
    Kessler, Rich
    Peterson, Josh
    Barkaszi, Stephen
    Jacobs, Mark
    Riedel, Nick
    Pratt, Larry
    King, Bruce
    2014 IEEE 40TH PHOTOVOLTAIC SPECIALIST CONFERENCE (PVSC), 2014, : 1362 - 1366
  • [6] Comparison of MTF measurements using edge method: towards reference data set
    Viallefont-Robinet, Francoise
    Helder, Dennis
    Fraisse, Renaud
    Newbury, Amy
    van den Bergh, Frans
    Lee, DongHan
    Saunier, Sebastien
    OPTICS EXPRESS, 2018, 26 (26): : 33625 - 33648
  • [7] Validating semistructured data using OWL
    Li, Yuan Fang
    Sun, Jing
    Dobbie, Gillian
    Sun, Jun
    Wang, Hai H.
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 520 - 531
  • [8] A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database
    Katz, Simon
    Irizarry, Rafael A.
    Lin, Xue
    Tripputi, Mark
    Porter, Mark W.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [9] A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database
    Simon Katz
    Rafael A Irizarry
    Xue Lin
    Mark Tripputi
    Mark W Porter
    BMC Bioinformatics, 7
  • [10] Validating reference genes using minimally transformed qpcr data: findings in human cortex and outcomes in schizophrenia
    Dean, Brian
    Udawela, Madhara
    Scarr, Elizabeth
    BMC PSYCHIATRY, 2016, 16