Classification of mathematical test questions using machine learning on datasets of learning management system questions

被引:4
|
作者
Kim, Gun Il [1 ]
Kim, Sungtae [2 ]
Jang, Beakcheol [1 ]
机构
[1] Yonsei Univ, Grad Sch Informat, Seoul, South Korea
[2] ABLE EduTech Inc, Seoul, South Korea
来源
PLOS ONE | 2023年 / 18卷 / 10期
关键词
D O I
10.1371/journal.pone.0286989
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Every student has a varied level of mathematical proficiency. Therefore, it is important to provide them with questions accordingly. Owing to advances in technology and artificial intelligence, the Learning Management System (LMS) has become a popular application to conduct online learning for students. The LMS can store multiple pieces of information on students through an online database, enabling it to recommend appropriate questions for each student based on an analysis of their previous responses to questions. Particularly, the LMS manages learners and provides an online platform that can evaluate their skills. Questions need to be classified according to their difficulty level so that the LMS can recommend them to learners appropriately and thereby increase their learning efficiency. In this study, we classified large-scale mathematical test items provided by ABLE Tech, which supports LMS-based online mathematical education platforms, using various machine learning techniques according to the difficulty levels of the questions. First, through t-test analysis, we identified the significant correlation variables according to the difficulty level. The t-test results showed that answer rate, type of question, and solution time were positively correlated with the difficulty of the question. Second, items were classified according to their difficulty level using various machine learning models, such as logistic regression (LR), random forest (RF), and extreme gradient boosting (xgboost). Accuracy, precision, recall, F1 score, the area under the curve of the receiver operating curve (AUC-ROC), Cohen's Kappa and Matthew's correlation coefficient (MCC) scores were used as the evaluation metrics. The correct answer rate, question type, and time for solving a question correlated significantly with the difficulty level. The machine learning-based xgboost model outperformed the statistical machine learning models, with a 85.7% accuracy, and 85.8% F1 score. These results can be used as an auxiliary tool in recommending suitable mathematical questions to various learners based on their difficulty level.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Learning to love the questions
    Yerxa, EJ
    AMERICAN JOURNAL OF OCCUPATIONAL THERAPY, 2005, 59 (01): : 108 - 112
  • [22] Learning by Asking Questions
    Misra, Ishan
    Girshick, Ross
    Fergus, Rob
    Hebert, Martial
    Gupta, Abhinav
    van der Maaten, Laurens
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 11 - 20
  • [23] Social Learning with Questions
    Su, Shih-Tang
    Subramanian, Vijay G.
    Schoenebeck, Grant
    PROCEEDINGS OF THE 14TH WORKSHOP ON THE ECONOMICS OF NETWORKS, SYSTEMS AND COMPUTATION (NETECON '19), 2019,
  • [24] Learning by posing questions
    Torres, BB
    BIOCHEMICAL EDUCATION, 1998, 26 (04): : 294 - 296
  • [25] Personalized Learning Management System using a Machine Learning Technique
    Kanokngamwitroj, Kulkatechol
    Srisa-An, Chetneti
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (04): : 1626 - 1633
  • [26] Taxonomy building and machine learning based automatic classification for knowledge-oriented Chinese questions
    Hu, YH
    Zheng, QH
    Bai, HX
    Sun, X
    Dang, HF
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 485 - 494
  • [27] Improving Mathematical Understanding Using A Cooperative Learning Model with HOTS Questions in the Study of Geometry
    Sari, Putri Permata
    Budiyono
    Slamet, Isnandar
    8TH ANNUAL BASIC SCIENCE INTERNATIONAL CONFERENCE: COVERAGE OF BASIC SCIENCES TOWARD THE WORLD'S SUSTAINABILITY CHALLANGES, 2018, 2021
  • [28] Intelligent System Using Deep Learning for Answering Learner Questions in a MOOC
    Hamal, Oussama
    El Faddouli, Nour-Eddine
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2022, 17 (02) : 32 - 42
  • [29] Using Parameterized Calculus Questions for Learning and Assessment
    Descalco, L.
    Carvalho, Paula
    2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,
  • [30] Internet Traffic Classification using Machine Learning Approach: Datasets Validation Issues
    Ibrahim, Hamza Awad Hamza
    AL Zuobi, Omer Radhi Aqeel
    Al-Namari, Marwan A.
    MohamedAli, Gaafer
    Abdalla, Ali Ahmed Alfaki
    2016 CONFERENCE OF BASIC SCIENCES AND ENGINEERING STUDIES (SCGAC), 2016, : 158 - 166