Reliability of Pass and Fail Decisions on Tests Employing Cut Scores

被引:0
|
作者
Gautam Puhan
Leanne Gall
机构
[1] Educational Testing Service,
关键词
Reliability of classification; Certification tests; Cut scores; Classification accuracy; Classification consistency;
D O I
10.1007/s12646-012-0147-9
中图分类号
学科分类号
摘要
The study evaluated the reliability of pass and fail classifications for several teacher certification tests. Since these tests are used in the context of a cut score to classify examinees as pass and fail, evaluating the accuracy and consistency of these classifications is important. The classification accuracy and consistency statistics were estimated using the RELCLASS software. Results indicated the following. (1) The 29 teacher certification tests that were examined had a relatively high classification accuracy (0.827 to 0.999) and consistency (0.760 to 0.999). (2) Both classification accuracy and consistency increased as the difference between the mean and cut score increased. (3) Classification accuracy and consistency was higher for multiple-choice (MC) as compared to tests consisting of only constructed-response (CR) items or a combination of CR and MC items.
引用
收藏
页码:273 / 282
页数:9
相关论文
共 50 条
  • [41] Evidence for Validity and Reliability, and Development of Performance Standards and Cut-Scores for Job-Related Tests of Physical Aptitude for Structural Firefighters
    Scarlett, Michael P.
    Rogers, W. Todd
    Adams, Eric M.
    Dreger, Randy W.
    Petersen, Stewart R.
    JOURNAL OF OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2021, 63 (11) : 992 - 1002
  • [42] Description and impact of using a standard-setting method for determining pass/fail scores in a surgery clerkship
    Schindler, Nancy
    Corcoran, Julia
    DaRosa, Debra
    AMERICAN JOURNAL OF SURGERY, 2007, 193 (02): : 252 - 257
  • [43] Repeatability and reliability of scores from ridden temperament tests conducted during performance tests
    von Borstel, Uta Koenig
    Pirsich, Wiebke
    Gauly, Matthias
    Bruns, Erich
    APPLIED ANIMAL BEHAVIOUR SCIENCE, 2012, 139 (3-4) : 251 - 263
  • [44] Reliability, Validity, and Cut Scores of the South Oaks Gambling Screen (SOGS) for Chinese
    Tang, Catherine So-kum
    Wu, Anise M. S.
    Tang, Joe Y. C.
    Yan, Elsie C. W.
    JOURNAL OF GAMBLING STUDIES, 2010, 26 (01) : 145 - 158
  • [45] Attention deficit hyperactive disorder diagnosis continues to fail the reliability and validity tests
    Whitely, Martin
    AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2015, 49 (06): : 497 - 498
  • [46] Reliability, Validity, and Cut Scores of the South Oaks Gambling Screen (SOGS) for Chinese
    Catherine So-kum Tang
    Anise M. S. Wu
    Joe Y. C. Tang
    Elsie C. W. Yan
    Journal of Gambling Studies, 2010, 26 : 145 - 158
  • [47] Commentary on "ADHD diagnosis continues to fail the reliability and validity tests' by Martin Whitely
    Vance, Alasdair L.
    AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2015, 49 (06): : 574 - 575
  • [48] Comparing methods for assessing reliability uncertainty based on pass/fail data collected over time
    Abes, Jeff I.
    Hamada, Michael S.
    Hills, Charles R.
    QUALITY ENGINEERING, 2018, 30 (04) : 694 - 700
  • [49] SHOULD PERSONNEL-SELECTION TESTS BE USED ON A PASS-FAIL, GROUPING, OR RANKING BASIS
    SPROULE, CF
    PUBLIC PERSONNEL MANAGEMENT, 1984, 13 (04) : 375 - 394
  • [50] Potential Impact of Pass/Fail Scores on USMLE Step 1: Predictors of Excellence in Obstetrics and Gynecology Residency Training
    Tamakuwala, Sejal
    Dean, Joshua
    Kramer, Katherine J.
    Shafi, Adib
    Ottum, Sarah
    George, Joshua
    Kaur, Satinder
    Chao, Conrad R.
    Recanati, Maurice-Andre
    JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT, 2021, 8