Predicting Difficulty and Discrimination of Natural Language Questions

被引:0
|
作者
Byrd, Matthew A. [1 ]
Srivastava, Shashank [1 ]
机构
[1] Univ North Carolina Chapel Hill, Chapel Hill, NC 27599 USA
关键词
ITEM RESPONSE THEORY; READABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Item Response Theory (IRT) has been extensively used to numerically characterize question difficulty and discrimination for human subjects in domains including cognitive psychology and education (Primi et al., 2014; Downing, 2003). More recently, IRT has been used to similarly characterize item difficulty and discrimination for natural language models across various datasets (Lalor et al., 2019; Vania et al., 2021; Rodriguez et al., 2021). In this work, we explore predictive models for directly estimating and explaining these traits for natural language questions in a question-answering context. We use HotpotQA for illustration. Our experiments show that it is possible to predict both difficulty and discrimination parameters for new questions, and these traits are correlated with features of questions, answers, and associated contexts. Our findings can have significant implications for the creation of new datasets and tests on the one hand and strategies such as active learning and curriculum learning on the other.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [21] Predicting literacy achievement in young English language learners: A question of language proficiency or of learning difficulty?
    Rosenman, Sara
    Madelaine, Alison
    AUSTRALIAN JOURNAL OF LEARNING DIFFICULTIES, 2012, 17 (01) : 17 - 34
  • [22] Querying Biornedical Linked Data with Natural Language Questions
    Hamon, Thierry
    Grabar, Natalia
    Mougin, Fleur
    SEMANTIC WEB, 2017, 8 (04) : 581 - +
  • [23] Translating Web Search Queries into Natural Language Questions
    Kumar, Adarsh
    Dandapat, Sandipan
    Chordia, Sushil
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 944 - 947
  • [24] Mining the web to validate answers to natural language questions
    Magnini, B
    Negri, M
    Prevete, R
    Tanev, H
    DATA MINING III, 2002, 6 : 339 - 349
  • [25] Processing normative references on the basis of natural language questions
    Palmirani, M
    Brighi, R
    Massini, M
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 9 - 12
  • [26] Ranking Clarification Questions via Natural Language Inference
    Kumar, Vaibhav
    Raunak, Vikas
    Callan, Jamie
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2093 - 2096
  • [27] Semantic Querying of News Articles With Natural Language Questions
    Tuan-Dung Cao
    Quang-Minh Nguyen
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2021, 14 (03) : 38 - 57
  • [28] 'LANGUAGE DIFFICULTY'
    DRYSDALE, A
    POETRY WALES, 1996, 32 (01): : 51 - 51
  • [29] Resolving ambiguities in the semantic interpretation of natural language questions
    Linckels, Serge
    Meinel, Christoph
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 612 - 619
  • [30] Identifying Well-formed Natural Language Questions
    Faruqui, Manaal
    Das, Dipanjan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 798 - 803