Reliability of Pass and Fail Decisions on Tests Employing Cut Scores

被引：0

作者：

Gautam Puhan

Leanne Gall

机构：

[1] Educational Testing Service,

来源：

Psychological Studies | 2012年 / 57卷 / 3期

关键词：

Reliability of classification; Certification tests; Cut scores; Classification accuracy; Classification consistency;

D O I：

10.1007/s12646-012-0147-9

中图分类号：

学科分类号：

摘要：

The study evaluated the reliability of pass and fail classifications for several teacher certification tests. Since these tests are used in the context of a cut score to classify examinees as pass and fail, evaluating the accuracy and consistency of these classifications is important. The classification accuracy and consistency statistics were estimated using the RELCLASS software. Results indicated the following. (1) The 29 teacher certification tests that were examined had a relatively high classification accuracy (0.827 to 0.999) and consistency (0.760 to 0.999). (2) Both classification accuracy and consistency increased as the difference between the mean and cut score increased. (3) Classification accuracy and consistency was higher for multiple-choice (MC) as compared to tests consisting of only constructed-response (CR) items or a combination of CR and MC items.

引用

页码：273 / 282

页数：9

共 50 条

[41] Evidence for Validity and Reliability, and Development of Performance Standards and Cut-Scores for Job-Related Tests of Physical Aptitude for Structural Firefighters
Scarlett, Michael P.
Rogers, W. Todd
Adams, Eric M.
Dreger, Randy W.
Petersen, Stewart R.
JOURNAL OF OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2021, 63 (11) : 992 - 1002
[42] Description and impact of using a standard-setting method for determining pass/fail scores in a surgery clerkship
Schindler, Nancy
Corcoran, Julia
DaRosa, Debra
AMERICAN JOURNAL OF SURGERY, 2007, 193 (02): : 252 - 257
[43] Repeatability and reliability of scores from ridden temperament tests conducted during performance tests
von Borstel, Uta Koenig
Pirsich, Wiebke
Gauly, Matthias
Bruns, Erich
APPLIED ANIMAL BEHAVIOUR SCIENCE, 2012, 139 (3-4) : 251 - 263
[44] Reliability, Validity, and Cut Scores of the South Oaks Gambling Screen (SOGS) for Chinese
Tang, Catherine So-kum
Wu, Anise M. S.
Tang, Joe Y. C.
Yan, Elsie C. W.
JOURNAL OF GAMBLING STUDIES, 2010, 26 (01) : 145 - 158
[45] Attention deficit hyperactive disorder diagnosis continues to fail the reliability and validity tests
Whitely, Martin
AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2015, 49 (06): : 497 - 498
[46] Reliability, Validity, and Cut Scores of the South Oaks Gambling Screen (SOGS) for Chinese
Catherine So-kum Tang
Anise M. S. Wu
Joe Y. C. Tang
Elsie C. W. Yan
Journal of Gambling Studies, 2010, 26 : 145 - 158
[47] Commentary on "ADHD diagnosis continues to fail the reliability and validity tests' by Martin Whitely
Vance, Alasdair L.
AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2015, 49 (06): : 574 - 575
[48] Comparing methods for assessing reliability uncertainty based on pass/fail data collected over time
Abes, Jeff I.
Hamada, Michael S.
Hills, Charles R.
QUALITY ENGINEERING, 2018, 30 (04) : 694 - 700
[49] SHOULD PERSONNEL-SELECTION TESTS BE USED ON A PASS-FAIL, GROUPING, OR RANKING BASIS
SPROULE, CF
PUBLIC PERSONNEL MANAGEMENT, 1984, 13 (04) : 375 - 394
[50] Potential Impact of Pass/Fail Scores on USMLE Step 1: Predictors of Excellence in Obstetrics and Gynecology Residency Training
Tamakuwala, Sejal
Dean, Joshua
Kramer, Katherine J.
Shafi, Adib
Ottum, Sarah
George, Joshua
Kaur, Satinder
Chao, Conrad R.
Recanati, Maurice-Andre
JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT, 2021, 8

← 1 2 3 4 5 →