Classification of mathematical test questions using machine learning on datasets of learning management system questions

被引:4
|
作者
Kim, Gun Il [1 ]
Kim, Sungtae [2 ]
Jang, Beakcheol [1 ]
机构
[1] Yonsei Univ, Grad Sch Informat, Seoul, South Korea
[2] ABLE EduTech Inc, Seoul, South Korea
来源
PLOS ONE | 2023年 / 18卷 / 10期
关键词
D O I
10.1371/journal.pone.0286989
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Every student has a varied level of mathematical proficiency. Therefore, it is important to provide them with questions accordingly. Owing to advances in technology and artificial intelligence, the Learning Management System (LMS) has become a popular application to conduct online learning for students. The LMS can store multiple pieces of information on students through an online database, enabling it to recommend appropriate questions for each student based on an analysis of their previous responses to questions. Particularly, the LMS manages learners and provides an online platform that can evaluate their skills. Questions need to be classified according to their difficulty level so that the LMS can recommend them to learners appropriately and thereby increase their learning efficiency. In this study, we classified large-scale mathematical test items provided by ABLE Tech, which supports LMS-based online mathematical education platforms, using various machine learning techniques according to the difficulty levels of the questions. First, through t-test analysis, we identified the significant correlation variables according to the difficulty level. The t-test results showed that answer rate, type of question, and solution time were positively correlated with the difficulty of the question. Second, items were classified according to their difficulty level using various machine learning models, such as logistic regression (LR), random forest (RF), and extreme gradient boosting (xgboost). Accuracy, precision, recall, F1 score, the area under the curve of the receiver operating curve (AUC-ROC), Cohen's Kappa and Matthew's correlation coefficient (MCC) scores were used as the evaluation metrics. The correct answer rate, question type, and time for solving a question correlated significantly with the difficulty level. The machine learning-based xgboost model outperformed the statistical machine learning models, with a 85.7% accuracy, and 85.8% F1 score. These results can be used as an auxiliary tool in recommending suitable mathematical questions to various learners based on their difficulty level.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Learning to Generate Questions by Learning What not to Generate
    Liu, Bang
    Zhao, Mingjun
    Niu, Di
    Lai, Kunfeng
    He, Yancheng
    Wei, Haojie
    Xu, Yu
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1106 - 1118
  • [42] Guidelines to Select Machine Learning Scheme for Classification of Biomedical Datasets
    Tanwani, Ajay Kumar
    Afridi, Jamal
    Shafiq, M. Zubair
    Farooq, Muddassar
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2009, 5483 : 128 - 139
  • [43] Machine Learning-based Classification of Online Industrial Datasets
    Faber, Rastislav
    L'ubusky, Karol
    Paulen, Radoslav
    2023 24TH INTERNATIONAL CONFERENCE ON PROCESS CONTROL, PC, 2023, : 132 - 137
  • [44] Evaluation of Deep Learning and Machine Learning Algorithms for Building Occupancy Classification on Open Datasets
    Cretu, Georgiana
    Stamatescu, Iulia
    Stamatescu, Grigore
    2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 575 - 580
  • [45] FORMULATING QUESTIONS FOR PHYSIOLOGY LEARNING
    Cheng, Hwee Ming
    JOURNAL OF PHYSIOLOGICAL SCIENCES, 2009, 59 : 446 - 446
  • [46] Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets
    Abousaber, Inam
    Abdallah, Haitham F.
    El-Ghaish, Hany
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2025, 7
  • [47] The Questions of Ethics in Learning Analytics
    May, Madeth
    Iksal, Sebastien
    INTELLIGENT TUTORING SYSTEMS, ITS 2016, 2016, 9684 : 502 - 503
  • [48] A refined approach for evaluating small datasets via binary classification using machine learning
    Steinert, Steffen
    Ruf, Verena
    Dzsotjan, David
    Grossmann, Nicolas
    Schmidt, Albrecht
    Kuhn, Jochen
    Kuechemann, Stefan
    PLOS ONE, 2024, 19 (05):
  • [49] REPEATING QUESTIONS IN PROSE LEARNING
    BOYD, WM
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1973, 64 (01) : 31 - 38
  • [50] EFFECT OF QUESTIONS ON VISUAL LEARNING
    DWYER, FM
    PERCEPTUAL AND MOTOR SKILLS, 1970, 30 (01) : 51 - &