Learning to Reuse Distractors to Support Multiple-Choice Question Generation in Education

被引：6

作者：

Bitew, Semere Kiros ^{[1
]}

Hadifar, Amir ^{[1
]}

Sterckx, Lucas ^{[2
]}

Deleu, Johannes ^{[1
]}

Develder, Chris ^{[1
]}

Demeester, Thomas ^{[1
]}

机构：

[1] Ghent Univ imec, Internet Technol & Data Sci Lab, Text to Knowledge Team, B-9052 Ghent, Belgium

[2] LynxCare, B-3000 Leuven, Belgium

来源：

IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES | 2024年 / 17卷

关键词：

Context modeling; Task analysis; Semantics; Agricultural machinery; Vocabulary; Guidelines; Benchmark testing; Distractor generation; multiple-choice question (MCQ); natural language processing (NLP); online learning; transformers;

D O I：

10.1109/TLT.2022.3226523

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Multiple-choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, owing to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an expensive and time-consuming task. A particularly sensitive aspect of MCQ creation is to devise relevant distractors, i.e., wrong answers that are not easily identifiable as being wrong. This article studies how a large existing set of manually created answers and distractors for questions over a variety of domains, subjects, and languages can be leveraged to help teachers in creating new MCQs, by the smart reuse of existing distractors. We built several data-driven models based on context-aware question and distractor representations and compared them with static feature-based models. The proposed models are evaluated with automated metrics and in a realistic user test with teachers. Both automatic and human evaluations indicate that context-aware models consistently outperform a static feature-based approach. For our best-performing context-aware model, on average, three distractors out of the ten shown to teachers were rated as high-quality distractors. We create a performance benchmark, and make it public, to enable comparison between different approaches and to introduce a more standardized evaluation of the task. The benchmark contains a test of 298 educational questions covering multiple subjects and languages and a 77k multilingual pool of distractor vocabulary for future research.

引用

页码：375 / 390

页数：16

共 50 条

[41] HARMONIZING MULTIPLE-CHOICE QUESTION MARKS WITH ESSAY MARKS
ASHBY, D
BARON, DN
MEDICAL EDUCATION, 1986, 20 (04) : 321 - 323
[42] STUDENTS EXPERIENCES IN STUDYING FOR MULTIPLE-CHOICE QUESTION EXAMINATIONS
SCOULLER, KM
PROSSER, M
STUDIES IN HIGHER EDUCATION, 1994, 19 (03) : 267 - 279
[43] OPTIMIZING MARKS OBTAINED IN MULTIPLE-CHOICE QUESTION EXAMINATIONS
MITCHELL, G
FORD, DM
PRINZ, W
MEDICAL TEACHER, 1986, 8 (01) : 49 - 53
[44] Positive Impact of Multiple-Choice Question Authoring and Regular Quiz Participation on Student Learning
Riggs, C. Daniel
Kang, Sohee
Rennie, Olivia
CBE-LIFE SCIENCES EDUCATION, 2020, 19 (02):
[45] Multiple-choice question tests: a convenient, flexible and effective learning tool? A case study
Douglas, Mercedes
Wilson, Juliette
Ennis, Sean
INNOVATIONS IN EDUCATION AND TEACHING INTERNATIONAL, 2012, 49 (02) : 111 - 121
[46] FURTHER SUPPORT FOR CHANGING MULTIPLE-CHOICE ANSWERS
FABREY, LJ
CASE, SM
JOURNAL OF MEDICAL EDUCATION, 1985, 60 (06): : 488 - 490
[47] Evaluating Human and Automated Generation of Distractors for Diagnostic Multiple-Choice Cloze Questions to Assess Children's Reading Comprehension
Huang, Yi-Ting
Mostow, Jack
ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2015, 2015, 9112 : 155 - 164
[48] QUALITY AND FEATURE OF MULTIPLE-CHOICE QUESTIONS IN EDUCATION
Jia, Bing
He, Dan
Zhu, Zhemin
PROBLEMS OF EDUCATION IN THE 21ST CENTURY, 2020, 78 (04) : 576 - 594
[49] An empirical analysis of online multiple-choice question-generation learning activity for the enhancement of students' cognitive strategy development while learning science
Yu, F. Y.
Hung, C. C.
RECENT PROGRESS IN COMPUTATIONAL SCIENCES AND ENGINEERING, VOLS 7A AND 7B, 2006, 7A-B : 585 - +
[50] Multiple-Choice Testing in Education: Are the Best Practices for Assessment Also Good for Learning?
Butler, Andrew C.
JOURNAL OF APPLIED RESEARCH IN MEMORY AND COGNITION, 2018, 7 (03) : 323 - 331

← 1 2 3 4 5 →