Receiver operating characteristic estimation and threshold selection criteria in three-class classification problems for clustered data

被引:4
|
作者
To, Duc-Khanh [1 ]
Adimari, Gianfranco [1 ]
Chiogna, Monica [2 ]
Risso, Davide [1 ]
机构
[1] Univ Padua, Dept Stat Sci, Via C Battisti 241, I-35121 Padua, Italy
[2] Univ Bologna, Dept Stat Sci Paolo Fortunati, Bologna, Italy
关键词
Receiver operating characteristic analysis; clustered data; covariate adjustment; linear-mixed models; Box-Cox transformation; BOX-COX TRANSFORMATION; LONGITUDINAL DATA; ACCURACY; SURFACE;
D O I
10.1177/09622802221089029
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Statistical evaluation of diagnostic tests, and, more generally, of biomarkers, is a constantly developing field, in which complexity of the assessment increases with the complexity of the design under which data are collected. One particularly prevalent type of data is clustered data, where individual units are naturally nested into clusters. In these cases, Bias can arise from omission, in the evaluation process, of cluster-level effects and/or individual covariates. Focusing on the three-class case and for continuous-valued diagnostic tests, we investigate how to exploit the clustered structure of data within a linear-mixed model approach, both when the assumption of normality holds and when it does not. We provide a method for the estimation of covariate-specific receiver operating characteristic surfaces and discuss methods for the choice of optimal thresholds, proposing three possible estimators. A proof of consistency and asymptotic normality of the proposed threshold estimators is given. All considered methods are evaluated by extensive simulation experiments. As an application, we study the use of the Lysosomal Associated Membrane Protein Family Member 5 gene expression as a biomarker to distinguish among three types of glutamatergic neurons.
引用
收藏
页码:1325 / 1341
页数:17
相关论文
共 45 条
  • [41] Bias-corrected GEE estimation and smooth-threshold GEE variable selection for single-index models with clustered data
    Lai, Peng
    Wang, Qihua
    Lian, Heng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 105 (01) : 422 - 432
  • [42] A CNN-based Approach for three-class classification of motor imagery EEG data including 'rest state' in hybrid multi-user BCI
    Zhang, Jianhai
    Su, Chongwei
    Zapala, Dariusz
    Zhu, Li
    Cui, Gaochao
    Kong, Wanzeng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 770 - 773
  • [43] Original smooth receiver operating characteristic curve estimation from continuous data: Statistical methods for analyzing the predictive value of spiral CT of ureteral stones
    Zou, KH
    Tempany, CM
    Fielding, JR
    Silverman, SG
    ACADEMIC RADIOLOGY, 1998, 5 (10) : 680 - 687
  • [44] Centre-independent Detection of Non-small Cell Lung Cancer (NSCLC) by Means of Classification with Receiver Operating Characteristic (ROC)-based Data Transformation
    Bitterlich, Norman
    Muley, Thomas
    Schneider, Joachim
    ANTICANCER RESEARCH, 2010, 30 (05) : 1661 - 1665
  • [45] Evidence-Based Cutoff Threshold Values from Receiver Operating Characteristic Curve Analysis for Knee Osteoarthritis in the 50-Year-Old Korean Population: Analysis of Big Data from the National Health Insurance Sharing Service
    Jee, Hyunseok
    Lee, Hae-Dong
    Lee, Sae Yong
    BIOMED RESEARCH INTERNATIONAL, 2018, 2018