Sequential Bayesian Ability Estimation Applied to Mixed-Format Item Tests

被引:0
|
作者
Xiong, Jiawei [1 ,4 ]
Cohen, Allan S. [2 ]
Xiong, Xinhui [3 ]
机构
[1] Pearson, Athens, GA USA
[2] Univ Georgia, Athens, GA USA
[3] Educ Testing Serv, Princeton, NJ USA
[4] 110 Carlton St, Athens, GA 30602 USA
关键词
mixed-format data; Bayesian; EAP; prior; ability estimation; RESPONSE THEORY; MULTIPLE-CHOICE; EMPIRICAL BAYES; FIT; MODELS;
D O I
10.1177/01466216231201986
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Large-scale tests often contain mixed-format items, such as when multiple-choice (MC) items and constructed-response (CR) items are both contained in the same test. Although previous research has analyzed both types of items simultaneously, this may not always provide the best estimate of ability. In this paper, a two-step sequential Bayesian (SB) analytic method under the concept of empirical Bayes is explored for mixed item response models. This method integrates ability estimates from different item formats. Unlike the empirical Bayes method, the SB method estimates examinees' posterior ability parameters with individual-level sample-dependent prior distributions estimated from the MC items. Simulations were used to evaluate the accuracy of recovery of ability and item parameters over four factors: the type of the ability distribution, sample size, test length (number of items for each item type), and person/item parameter estimation method. The SB method was compared with a traditional concurrent Bayesian (CB) calibration method, EAPsum, that uses scaled scores for summed scores to estimate parameters from the MC and CR items simultaneously in one estimation step. From the simulation results, the SB method showed more accurate and reliable ability estimation than the CB method, especially when the sample size was small (150 and 500). Both methods presented similar recovery results for MC item parameters, but the CB method yielded a bit better recovery of the CR item parameters. The empirical example suggested that posterior ability estimated by the proposed SB method had higher reliability than the CB method.
引用
收藏
页码:402 / 419
页数:18
相关论文
共 50 条
  • [1] Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test
    Ho, Tsung-Han
    Dodd, Barbara G.
    APPLIED MEASUREMENT IN EDUCATION, 2012, 25 (04) : 305 - 326
  • [2] Customizing Bayesian multivariate generalizability theory to mixed-format tests
    Jiang, Zhehan
    Ouyang, Jinying
    Shi, Dingjing
    Shi, Dexin
    Zhang, Jihong
    Xu, Lingling
    Cai, Fen
    BEHAVIOR RESEARCH METHODS, 2024, 56 (07) : 8080 - 8090
  • [3] The Effect Of Proportion Of Mixed-Format Scoring : Mixed-Format Achievement Tests
    Saen-amnuaiphon, R.
    Tuksino, P.
    Nichanong, C.
    INTERNATIONAL CONFERENCE ON EDUCATION & EDUCATIONAL PSYCHOLOGY (ICEEPSY 2012), 2012, 69 : 1522 - 1528
  • [4] Efficiency Analysis of Item Response Theory Kernel Equating for Mixed-Format Tests
    Wallmark, Joakim
    Josefsson, Maria
    Wiberg, Marie
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2023, 47 (7-8) : 496 - 512
  • [5] A Mixed Sequential IRT Model for Mixed-Format Items
    Wei, Junhuan
    Cai, Yan
    Tu, Dongbo
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2023, 47 (04) : 259 - 274
  • [6] Extension of caution indices to mixed-format tests
    Sinharay, Sandip
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2018, 71 (02): : 363 - 386
  • [7] Classification Consistency and Accuracy for Mixed-Format Tests
    Kim, Stella Y.
    Lee, Won-Chan
    APPLIED MEASUREMENT IN EDUCATION, 2019, 32 (02) : 97 - 115
  • [8] Assessment of Person Fit for Mixed-Format Tests
    Sinharay, Sandip
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2015, 40 (04) : 343 - 365
  • [9] A multidimensional partial credit model with associated item and test statistics: An application to mixed-format tests
    Yao, Lihua
    Schwarz, Richard D.
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2006, 30 (06) : 469 - 492
  • [10] The Impact of Item Feature and Response Preference in a Mixed-Format Design
    Chen, Hui-Fang
    Jin, Kuan-Yu
    MULTIVARIATE BEHAVIORAL RESEARCH, 2022, 57 (2-3) : 208 - 222