Predicting Difficulty and Discrimination of Natural Language Questions

被引:0
|
作者
Byrd, Matthew A. [1 ]
Srivastava, Shashank [1 ]
机构
[1] Univ North Carolina Chapel Hill, Chapel Hill, NC 27599 USA
关键词
ITEM RESPONSE THEORY; READABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Item Response Theory (IRT) has been extensively used to numerically characterize question difficulty and discrimination for human subjects in domains including cognitive psychology and education (Primi et al., 2014; Downing, 2003). More recently, IRT has been used to similarly characterize item difficulty and discrimination for natural language models across various datasets (Lalor et al., 2019; Vania et al., 2021; Rodriguez et al., 2021). In this work, we explore predictive models for directly estimating and explaining these traits for natural language questions in a question-answering context. We use HotpotQA for illustration. Our experiments show that it is possible to predict both difficulty and discrimination parameters for new questions, and these traits are correlated with features of questions, answers, and associated contexts. Our findings can have significant implications for the creation of new datasets and tests on the one hand and strategies such as active learning and curriculum learning on the other.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [1] DIFFICULTY AND DISCRIMINATION OF MULTIPLE-CHOICE QUESTIONS - A COUNTERINTUITIVE RESULT
    SEVENAIR, JP
    BURKETT, AR
    JOURNAL OF CHEMICAL EDUCATION, 1988, 65 (05) : 441 - 442
  • [2] Predicting reading difficulty with statistical language models
    Collins-Thompson, K
    Callan, J
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2005, 56 (13): : 1448 - 1462
  • [3] A language modeling approach to predicting reading difficulty
    Collins-Thompson, K
    Callan, J
    HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 193 - 200
  • [4] BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
    Clark, Christopher
    Lee, Kenton
    Chang, Ming-Wei
    Kwiatkowski, Tom
    Collins, Michael
    Toutanova, Kristina
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2924 - 2936
  • [5] Relationship between difficulty and discrimination indices of essay questions in formative assessment
    Kunjappagounder, Pushpalatha
    Doddaiah, Sunil Kumar
    Basavanna, Pushpa Nagavalli
    Bhat, Deepa
    JOURNAL OF THE ANATOMICAL SOCIETY OF INDIA, 2021, 70 (04) : 239 - 243
  • [6] Predicting the Difficulty of Exercise Items for Dynamic Difficulty Adaptation in Adaptive Language Tutoring
    Pandarova, Irina
    Schmidt, Torben
    Hartig, Johannes
    Boubekki, Ahcene
    Jones, Roger Dale
    Brefeld, Ulf
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2019, 29 (03) : 342 - 367
  • [7] Predicting the Difficulty of Exercise Items for Dynamic Difficulty Adaptation in Adaptive Language Tutoring
    Irina Pandarova
    Torben Schmidt
    Johannes Hartig
    Ahcène Boubekki
    Roger Dale Jones
    Ulf Brefeld
    International Journal of Artificial Intelligence in Education, 2019, 29 : 342 - 367
  • [8] Bilattices and the Semantics of Natural Language Questions
    R. Nelken
    N. Francez
    Linguistics and Philosophy, 2002, 25 : 37 - 64
  • [9] Bilattices and the semantics of natural language questions
    Nelken, R
    Francez, N
    LINGUISTICS AND PHILOSOPHY, 2002, 25 (01) : 37 - 64
  • [10] Understanding questions: a specific difficulty in children with pragmatic communication and language disorders
    Monfort, Isabelle
    Monfort, Marc
    REVISTA DE NEUROLOGIA, 2010, 50 : S107 - S111