Dependence of weighted kappa coefficients on the number of categories

被引:193
作者
Brenner, H
Kliebsch, U
机构
[1] Department of Epidemiology, University of Ulm, Ulm
[2] Department of Epidemiology, University of Ulm, D-89081 Ulm
关键词
kappa; reliability; statistics;
D O I
10.1097/00001648-199603000-00016
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Weighted kappa coefficients are commonly used to quantify inter- or intra-rater reliability or test-retest reliability of ordinal ratings in clinical and epidemiologic applications. In this paper, we assess the dependence of weighted kappa coefficients on the number of categories and the type of weighting scheme, which vary between applications. The most commonly used weights are weights that are proportional to the deviation of individual ratings (''linear weights'') or to the square of the deviation of individual ratings (''quadratic weights''). Quadratically weighted kappa coefficients are equivalent to the intraclass correlation coefficient and to the product-moment correlation coefficient under certain conditions. We illustrate that an increase of quadratically weighted kappa coefficients with the number of categories is expected under a broad variety of conditions, whereas linearly weighted kappa coefficients appear to be less sensitive to the number of categories. Number of categories and type of weighting scheme therefore require careful consideration in the interpretation of weighted kappa coefficients.
引用
收藏
页码:199 / 202
页数:4
相关论文
共 18 条
[1]  
Altman DG, 1991, PRACTICAL STATISTICS, P440
[2]   BIAS, PREVALENCE AND KAPPA [J].
BYRT, T ;
BISHOP, J ;
CARLIN, JB .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1993, 46 (05) :423-429
[4]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[5]   INTERPRETATION OF LOW KAPPA-VALUES [J].
DONKER, DK ;
HASMAN, A ;
VANGEIJN, HP .
INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1993, 33 (01) :55-64
[6]   HIGH AGREEMENT BUT LOW KAPPA .1. THE PROBLEMS OF 2 PARADOXES [J].
FEINSTEIN, AR ;
CICCHETTI, DV .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) :543-549
[7]   EQUIVALENCE OF WEIGHTED KAPPA AND INTRACLASS CORRELATION COEFFICIENT AS MEASURES OF RELIABILITY [J].
FLEISS, JL ;
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1973, 33 (03) :613-619
[8]  
FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619
[9]  
Fleiss JL., 1981, MEASUREMENT INTERRAT
[10]  
GJORUP T, 1988, METHOD INFORM MED, V27, P184