Receiver operating characteristic estimation and threshold selection criteria in three-class classification problems for clustered data

被引:4
|
作者
To, Duc-Khanh [1 ]
Adimari, Gianfranco [1 ]
Chiogna, Monica [2 ]
Risso, Davide [1 ]
机构
[1] Univ Padua, Dept Stat Sci, Via C Battisti 241, I-35121 Padua, Italy
[2] Univ Bologna, Dept Stat Sci Paolo Fortunati, Bologna, Italy
关键词
Receiver operating characteristic analysis; clustered data; covariate adjustment; linear-mixed models; Box-Cox transformation; BOX-COX TRANSFORMATION; LONGITUDINAL DATA; ACCURACY; SURFACE;
D O I
10.1177/09622802221089029
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Statistical evaluation of diagnostic tests, and, more generally, of biomarkers, is a constantly developing field, in which complexity of the assessment increases with the complexity of the design under which data are collected. One particularly prevalent type of data is clustered data, where individual units are naturally nested into clusters. In these cases, Bias can arise from omission, in the evaluation process, of cluster-level effects and/or individual covariates. Focusing on the three-class case and for continuous-valued diagnostic tests, we investigate how to exploit the clustered structure of data within a linear-mixed model approach, both when the assumption of normality holds and when it does not. We provide a method for the estimation of covariate-specific receiver operating characteristic surfaces and discuss methods for the choice of optimal thresholds, proposing three possible estimators. A proof of consistency and asymptotic normality of the proposed threshold estimators is given. All considered methods are evaluated by extensive simulation experiments. As an application, we study the use of the Lysosomal Associated Membrane Protein Family Member 5 gene expression as a biomarker to distinguish among three types of glutamatergic neurons.
引用
收藏
页码:1325 / 1341
页数:17
相关论文
共 45 条