Metrics for Estimating Validity, Reliability and Bias in Peer Assessment

被引:0
|
作者
Molina-Carmona, Rafael [1 ]
Satorre-Cuerda, Rosana [1 ]
Compan-Rosique, Patricia [1 ]
Llorens-Largo, Faraon [1 ]
机构
[1] Univ Alicante, Catedra Santander UA Transformac Digital, Ctra San Vicente del Raspeig S-N, Alicante 03690, Spain
关键词
peer assessment; success rate; agreement degree; reliability; validity; bias; confusion matrix; automatic classification;
D O I
暂无
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Peer assessment is a widespread way of evaluating and rating the quality of a work in the field of education. Although it results to be a very effective learning instrument, it is subjected to possible problems of reliability, validity and some potential biases. Most works that study and try to solve these problems are focused on specific cases and the statistics for measuring reliability, validity or bias are global, that is, they give a measure of these values for the whole process, but they do not allow an individual study. In this work the approach is different. It proposes some metrics for reliability and validity of each reviewer, as well as an approximation to the possible biases that may appear in the assessment process, so that the review process can be itself assessed. An analogy between the work of a reviewer in a process of peer assessment and the operation of an automatic classifier is proposed. This has allowed us to leverage the usual measures in evaluating the quality of automatic classifiers to establish the quality of peer assessment. The reviewers are characterized by obtaining their confusion matrices and six new indicators: success rate (which estimates the validity); agreement degree (as a measure of reliability); assessment median and its interquartile range (for the estimation of central tendency and restriction of range biases); and average distance to diagonal and its standard deviation (to determine possible leniency and harshness biases). This method provides indicators of the reviewer's task and the detection of different profiles, so that the teacher can assess the work of the students as reviewers and introduce some correction mechanisms in the final assessment of the works. A practical example of application to an engineering degree is provided to illustrate the potential of the method.
引用
收藏
页码:968 / 980
页数:13
相关论文
共 50 条
  • [41] Validity and reliability of an accreditation assessment for colonoscopy
    Barton, R.
    GUT, 2008, 57 : A2 - A2
  • [42] Methodological Concerns About the Education Value-Added Assessment System (EVAAS): Validity, Reliability, and Bias
    Amrein-Beardsley, Audrey
    Geiger, Tray
    SAGE OPEN, 2020, 10 (02):
  • [43] Developing a Peer Relationship Scale for Adolescents: a validity and reliability study
    Aydogdu, Fatih
    CURRENT ISSUES IN PERSONALITY PSYCHOLOGY, 2022, 10 (02) : 164 - 176
  • [44] Validity and reliability of the Turkish version of the PEER-U scale
    Akca, Emine
    Surucu, Sule Gokyildiz
    Sanberk, Ismail
    JOURNAL OF REPRODUCTIVE AND INFANT PSYCHOLOGY, 2019, 37 (05) : 499 - 512
  • [45] Addressing Bias to Improve Reliability in Peer Review of Programming Coursework
    Bradley, Steven
    19TH KOLI CALLING CONFERENCE ON COMPUTING EDUCATION RESEARCH (KOLI CALLING 2019), 2019,
  • [46] The bias in estimating accessibility inequalities using gravity-based metrics
    Giannotti, Mariana
    Tomasiello, Diego B.
    Bittencourt, Taina A.
    JOURNAL OF TRANSPORT GEOGRAPHY, 2022, 101
  • [47] Assessment of validity and response bias in neuropsychiatric evaluations
    Wygant, Dustin B.
    Granacher, Robert P.
    NEUROREHABILITATION, 2015, 36 (04) : 427 - 438
  • [48] Reliability and Validity of Korean Version of Questionnaire for Weight Bias Measurement
    Kim, Eun-Mi
    Lee, Kayoung
    Hwang, Kyu-Man
    Kim, Jun-Su
    Park, Tae-Jin
    KOREAN JOURNAL OF FAMILY MEDICINE, 2010, 31 (06): : 461 - 471
  • [49] Reliability and validity of the Japanese version of the weight bias internalization scale
    Shota Endo
    Hideaki Kasuga
    Masuishi Yusuke
    Tomoo Hidaka
    Takeyasu Kakamu
    Tetsuhito Fukushima
    BMC Research Notes, 15
  • [50] ELIMINATION OF BIAS IN TEST-SCORES - EFFECT ON RELIABILITY AND VALIDITY
    FRARY, RB
    ZIMMERMAN, DW
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1984, 44 (01) : 25 - 31