Receiver operating characteristic estimation and threshold selection criteria in three-class classification problems for clustered data

被引:4
|
作者
To, Duc-Khanh [1 ]
Adimari, Gianfranco [1 ]
Chiogna, Monica [2 ]
Risso, Davide [1 ]
机构
[1] Univ Padua, Dept Stat Sci, Via C Battisti 241, I-35121 Padua, Italy
[2] Univ Bologna, Dept Stat Sci Paolo Fortunati, Bologna, Italy
关键词
Receiver operating characteristic analysis; clustered data; covariate adjustment; linear-mixed models; Box-Cox transformation; BOX-COX TRANSFORMATION; LONGITUDINAL DATA; ACCURACY; SURFACE;
D O I
10.1177/09622802221089029
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Statistical evaluation of diagnostic tests, and, more generally, of biomarkers, is a constantly developing field, in which complexity of the assessment increases with the complexity of the design under which data are collected. One particularly prevalent type of data is clustered data, where individual units are naturally nested into clusters. In these cases, Bias can arise from omission, in the evaluation process, of cluster-level effects and/or individual covariates. Focusing on the three-class case and for continuous-valued diagnostic tests, we investigate how to exploit the clustered structure of data within a linear-mixed model approach, both when the assumption of normality holds and when it does not. We provide a method for the estimation of covariate-specific receiver operating characteristic surfaces and discuss methods for the choice of optimal thresholds, proposing three possible estimators. A proof of consistency and asymptotic normality of the proposed threshold estimators is given. All considered methods are evaluated by extensive simulation experiments. As an application, we study the use of the Lysosomal Associated Membrane Protein Family Member 5 gene expression as a biomarker to distinguish among three types of glutamatergic neurons.
引用
收藏
页码:1325 / 1341
页数:17
相关论文
共 45 条
  • [31] Receiver operating characteristic analysis: a general tool for DNA array data filtration and performance estimation
    Khodarev, NN
    Park, J
    Kataoka, Y
    Nodzenski, E
    Hellman, S
    Roizman, B
    Weichselbaum, RR
    Pelizzari, CA
    GENOMICS, 2003, 81 (02) : 202 - 209
  • [32] Interval estimation for the area under the receiver operating characteristic curve when data are subject to error
    Li, Yanhong
    Koval, John J.
    Donner, Allan
    Zou, G. Y.
    STATISTICS IN MEDICINE, 2010, 29 (24) : 2521 - 2531
  • [33] Detection and Classification of Three-Class Initial Dips Using Vector Phase Analysis with Dual Threshold Circles: An fNIRS Study
    Khan, M. N. Afzal
    Zafar, Amad
    Hong, Keum-Shik
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 786 - 791
  • [34] Comparison of classification methods with "n-class" receiver operating characteristic curves: A case study of energy drinks
    Racz, Anita
    Bajusz, David
    Fodor, Marietta
    Heberger, Karoly
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 151 : 34 - 43
  • [35] A parsimonious threshold-independent protein feature selection method through the area under receiver operating characteristic curve
    Wang, Zhanfeng
    Chang, Yuan-Chin I.
    Ying, Zhiliang
    Zhu, Liang
    Yang, Yaning
    BIOINFORMATICS, 2007, 23 (20) : 2788 - 2794
  • [36] Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data
    Metz, CE
    Herman, BA
    Shen, JH
    STATISTICS IN MEDICINE, 1998, 17 (09) : 1033 - 1053
  • [37] Classification of three-class motor imagery EEG data by combining wavelet packet decomposition and common spatial pattern
    Tu, Wei
    Wei, Qingguo
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 1, PROCEEDINGS, 2009, : 188 - 191
  • [38] Gene Features Selection for Three-Class Disease Classification via Multiple Orthogonal Partial Least Square Discriminant Analysis and S-Plot Using Microarray Data
    Yang, Mingxing
    Li, Xiumin
    Li, Zhibin
    Ou, Zhimin
    Liu, Ming
    Liu, Suhuan
    Li, Xuejun
    Yang, Shuyu
    PLOS ONE, 2013, 8 (12):
  • [39] Test Data Reuse for the Evaluation of Continuously Evolving Classification Algorithms Using the Area under the Receiver Operating Characteristic Curve
    Gossmann, Alexej
    Pezeshk, Aria
    Wang, Yu-Ping
    Sahiner, Berkman
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (02): : 692 - 714
  • [40] Three-class classification of Motor Imagery EEG data including "rest state" using Filter-Bank multi-class Common Spatial Pattern
    Shiratori, T.
    Tsubakida, H.
    Ishiyama, A.
    Ono, Y.
    3RD INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, 2015, : 80 - 83