Bloom's Taxonomy-based exam question classification: The outcome of CNN and optimal pre-trained word embedding technique

被引:5
|
作者
Gani, Mohammed Osman [1 ]
Ayyasamy, Ramesh Kumar [2 ]
Sangodiah, Anbuselvan [3 ]
Fui, Yong Tien [2 ]
机构
[1] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Kampar 31900, Perak, Malaysia
[2] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Dept Informat Syst, Kampar 31900, Perak, Malaysia
[3] Quest Int Univ, Fac Comp & Engn, Sch Comp, Ipoh 30250, Perak, Malaysia
关键词
BERT; Bloom's Taxonomy (BT); CNN; Examination question classification; Word embedding;
D O I
10.1007/s10639-023-11842-1
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The automated classification of examination questions based on Bloom's Taxonomy (BT) aims to assist the question setters so that high-quality question papers are produced. Most studies to automate this process adopted the machine learning approach, and only a few utilised the deep learning approach. The pre-trained contextual and non-contextual word embedding techniques effectively solved various natural language processing tasks. This study aims to identify the optimal pre-trained word embedding technique and propose a Convolutional Neural Network (CNN) model with the optimal word embedding technique. Therefore, non-contextual word embedding techniques: Word2vec, GloVe, and FastText, whereas contextualised embedding techniques: BERT, RoBERTa, and ELECTRA, were analysed in this study with two datasets. The experiment results showed that FastText is the most optimal technique in the first dataset, whereas RoBERTa is in the second dataset. This outcome of the first dataset differs from the text classification since contextual embedding generally outperforms non-contextual embedding. It could be due to the comparatively smaller size of the first dataset and the shorter length of the examination questions. Since RoBERTa is the most optimal word embedding technique in the second dataset, hence used along with CNN to build the model. This study used CNN instead of Recurrent Neural Networks (RNNs) since extracting relevant features is more important than the learning sequence from data in the context of examination question classification. The proposed CNN model achieved approximately 86% in both weighted F1-score and accuracy and outperformed all the models proposed by past studies, including RNNs. The proposed model's robustness could be assessed in the future using a more comprehensive dataset.
引用
收藏
页码:15893 / 15914
页数:22
相关论文
共 31 条
  • [1] Bloom’s Taxonomy-based exam question classification: The outcome of CNN and optimal pre-trained word embedding technique
    Mohammed Osman Gani
    Ramesh Kumar Ayyasamy
    Anbuselvan Sangodiah
    Yong Tien Fui
    Education and Information Technologies, 2023, 28 : 15893 - 15914
  • [2] Transfer learning for Bloom’s taxonomy-based question classification
    Chindukuri, Mallikarjuna
    Sivanesan, Sangeetha
    Neural Computing and Applications, 2024, 36 (31) : 19915 - 19937
  • [3] Mining Exam Question based on Bloom's Taxonomy
    Tanalol, Siti Hasnah
    Fattah, Salmah
    Sulong, Rina Suryani
    Mamat, Mazlina
    KMICE 2008 - KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE, 2008 - TRANSFERRING, MANAGING AND MAINTAINING KNOWLEDGE FOR NATION CAPACITY DEVELOPMENT, 2008, : 424 - 427
  • [4] An Enhanced Sentiment Analysis Framework Based on Pre-Trained Word Embedding
    Mohamed, Ensaf Hussein
    Moussa, Mohammed ElSaid
    Haggag, Mohamed Hassan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2020, 19 (04)
  • [5] USTW Vs. STW: A Comparative Analysis for Exam Question Classification based on Bloom’s Taxonomy
    Gani M.O.
    Ayyasamy R.K.
    Fui T.
    Sangodiah A.
    Mendel, 2022, 28 (02) : 25 - 40
  • [6] Pattern Augmentation for Handwritten Digit Classification based on Combination of Pre-trained CNN and SVM
    Shima, Yoshihiro
    Nakashima, Yumi
    Yasuda, Michio
    2017 6TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION & 2017 7TH INTERNATIONAL SYMPOSIUM IN COMPUTATIONAL MEDICAL AND HEALTH TECHNOLOGY (ICIEV-ISCMHT), 2017,
  • [7] Image Augmentation for Object Image Classification Based On Combination of Pre-Trained CNN and SVM
    Shima, Yoshihiro
    2ND INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2018), 2018, 1004
  • [8] Hybrid Feature Fusion Using RNN and Pre-trained CNN for Classification of Alzheimer's Disease
    Jabason, Emimal
    Ahmad, M. Omair
    Swamy, M. N. S.
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [9] An optimal deep learning approach for breast cancer detection and classification with pre-trained CNN-based feature learning mechanism
    Meena, L. C.
    Joe Prathap, P. M.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024,
  • [10] On The Optimal Classifier For Affective Vocal Bursts And Stuttering Predictions Based On Pre-Trained Acoustic Embedding
    Atmaja, Bagus Tris
    Zanjabila
    Sasou, Akira
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1690 - 1695