Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests

被引:1
|
作者
Demir, Seda [1 ]
Atar, Burcu [2 ]
机构
[1] Tokat Gaziosmanpasa Univ, Fac Educ, Tokat, Turkey
[2] Hacettepe Univ, Fac Educ, Ankara, Turkey
关键词
Computerized adaptive classification testing; content balancing; item exposure control; classification criteria; item selection methods;
D O I
10.21031/epod.787865
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
This study aims to compare Sequential Probability Ratio Test (SPRT) and Confidence Interval (CI) classification criteria, Maximum Fisher Information method on the basis of estimated-ability (MFI-EB) and Cut-Point (MFI-CB) item selection methods while ability estimation method is Weighted Likelihood Estimation (WLE) in Computerized Adaptive Classification Testing (CACT), according to the Average Classification Accuracy (ACA), Average Test Length (ATL), and measurement precision under content balancing (Constrained Computerized Adaptive Testing: CCAT and Modified Multinomial Model: MMM) and item exposure control (Sympson-Hetter Method: SH and Item Eligibility Method: IE) when the classification is done based on two, three, or four categories for a unidimensional pool of dichotomous items. Forty-eight conditions are created in Monte Carlo (MC) simulation for the data, generated in R software, including 500 items and 5000 examinees, and the results are calculated over 30 replications. As a result of the study, it was observed that CI performs better in terms of ATL, and SPRT performs better in ACA and correlation, bias, Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) values, sequentially; MFI-EB is more useful than MFI-CB. It was also seen that MMM is more successful in content balancing, whereas CCAT is better in terms of test efficiency (ATL and ACA), and IE is superior in terms of item exposure control though SH is more beneficial in test efficiency. Besides, increasing the number of classification categories increases ATL but decreases ACA, and it gives better results in terms of the correlation, bias, RMSE, and MAE values.
引用
收藏
页码:15 / 27
页数:13
相关论文
共 50 条
  • [1] Investigation of Measurement Precision and Test Length in Computerized Adaptive Tests under Different Conditions
    Yildiz, Huseyin
    Demir, Ceren Tunaboylu
    Ulku, Suleyman
    Giray, Gamze
    Kelecioglu, Hulya
    JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, 2024, 15 (01): : 5 - 17
  • [2] The Effects of Item Pool Characteristics on Test Length and Classification Accuracy in Computerized Adaptive Classification Testings
    Gundeger, Ceylan
    Dogan, Nuri
    HACETTEPE UNIVERSITESI EGITIM FAKULTESI DERGISI-HACETTEPE UNIVERSITY JOURNAL OF EDUCATION, 2018, 33 (04): : 888 - 896
  • [3] A Comparison of Computerized Adaptive Classification Test Criteria in Terms of Test Efficiency and Measurement Precision
    Gundeger, Ceylan
    Dogan, Nuri
    JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, 2018, 9 (02): : 161 - 177
  • [4] Comparison of computerized adaptive test and computerized classification test in making classification decisions
    Jiao, H
    Wang, SD
    Lau, A
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 313 - 313
  • [5] Classification accuracy and consistency of computerized adaptive testing
    Ying Cheng
    Deanna L. Morgan
    Behavior Research Methods, 2013, 45 : 132 - 142
  • [6] Classification accuracy and consistency of computerized adaptive testing
    Cheng, Ying
    Morgan, Deanna L.
    BEHAVIOR RESEARCH METHODS, 2013, 45 (01) : 132 - 142
  • [7] Mixing linear and adaptive algorithms in computerized classification test
    Jiao, H
    Lau, A
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 313 - 313
  • [8] The Accuracy of Computerized Adaptive Tests in Heterogeneous Populations
    Sawatzky, Richard
    Ratner, Pamela A.
    Kopec, Jacek A.
    Zumbo, Bruno D.
    QUALITY OF LIFE RESEARCH, 2012, 20 : 17 - 18
  • [9] Classification accuracy for tests that allow retakes
    Clauser, BE
    Nungester, RJ
    ACADEMIC MEDICINE, 2001, 76 (10) : S108 - S110
  • [10] HETEROGENEITY OF ATOPIC POPULATIONS - A COMPUTERIZED CLASSIFICATION TEST
    BORGNON, A
    BOURGOUIN, D
    HENOCQ, E
    REVUE FRANCAISE D ALLERGOLOGIE ET D IMMUNOLOGIE CLINIQUE, 1984, 24 (01): : 70 - 70