Bloom's Taxonomy-based exam question classification: The outcome of CNN and optimal pre-trained word embedding technique

被引：5

作者：

Gani, Mohammed Osman ^{[1
]}

Ayyasamy, Ramesh Kumar ^{[2
]}

Sangodiah, Anbuselvan ^{[3
]}

Fui, Yong Tien ^{[2
]}

机构：

[1] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Kampar 31900, Perak, Malaysia

[2] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Dept Informat Syst, Kampar 31900, Perak, Malaysia

[3] Quest Int Univ, Fac Comp & Engn, Sch Comp, Ipoh 30250, Perak, Malaysia

来源：

EDUCATION AND INFORMATION TECHNOLOGIES | 2023年 / 28卷 / 12期

关键词：

BERT; Bloom's Taxonomy (BT); CNN; Examination question classification; Word embedding;

D O I：

10.1007/s10639-023-11842-1

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

The automated classification of examination questions based on Bloom's Taxonomy (BT) aims to assist the question setters so that high-quality question papers are produced. Most studies to automate this process adopted the machine learning approach, and only a few utilised the deep learning approach. The pre-trained contextual and non-contextual word embedding techniques effectively solved various natural language processing tasks. This study aims to identify the optimal pre-trained word embedding technique and propose a Convolutional Neural Network (CNN) model with the optimal word embedding technique. Therefore, non-contextual word embedding techniques: Word2vec, GloVe, and FastText, whereas contextualised embedding techniques: BERT, RoBERTa, and ELECTRA, were analysed in this study with two datasets. The experiment results showed that FastText is the most optimal technique in the first dataset, whereas RoBERTa is in the second dataset. This outcome of the first dataset differs from the text classification since contextual embedding generally outperforms non-contextual embedding. It could be due to the comparatively smaller size of the first dataset and the shorter length of the examination questions. Since RoBERTa is the most optimal word embedding technique in the second dataset, hence used along with CNN to build the model. This study used CNN instead of Recurrent Neural Networks (RNNs) since extracting relevant features is more important than the learning sequence from data in the context of examination question classification. The proposed CNN model achieved approximately 86% in both weighted F1-score and accuracy and outperformed all the models proposed by past studies, including RNNs. The proposed model's robustness could be assessed in the future using a more comprehensive dataset.

引用

页码：15893 / 15914

页数：22

共 31 条

[1] Bloom’s Taxonomy-based exam question classification: The outcome of CNN and optimal pre-trained word embedding technique
Mohammed Osman Gani
Ramesh Kumar Ayyasamy
Anbuselvan Sangodiah
Yong Tien Fui
Education and Information Technologies, 2023, 28 : 15893 - 15914
[2] Transfer learning for Bloom’s taxonomy-based question classification
Chindukuri, Mallikarjuna
Sivanesan, Sangeetha
Neural Computing and Applications, 2024, 36 (31) : 19915 - 19937
[3] Mining Exam Question based on Bloom's Taxonomy
Tanalol, Siti Hasnah
Fattah, Salmah
Sulong, Rina Suryani
Mamat, Mazlina
KMICE 2008 - KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE, 2008 - TRANSFERRING, MANAGING AND MAINTAINING KNOWLEDGE FOR NATION CAPACITY DEVELOPMENT, 2008, : 424 - 427
[4] An Enhanced Sentiment Analysis Framework Based on Pre-Trained Word Embedding
Mohamed, Ensaf Hussein
Moussa, Mohammed ElSaid
Haggag, Mohamed Hassan
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2020, 19 (04)
[5] USTW Vs. STW: A Comparative Analysis for Exam Question Classification based on Bloom’s Taxonomy
Gani M.O.
Ayyasamy R.K.
Fui T.
Sangodiah A.
Mendel, 2022, 28 (02) : 25 - 40
[6] Pattern Augmentation for Handwritten Digit Classification based on Combination of Pre-trained CNN and SVM
Shima, Yoshihiro
Nakashima, Yumi
Yasuda, Michio
2017 6TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION & 2017 7TH INTERNATIONAL SYMPOSIUM IN COMPUTATIONAL MEDICAL AND HEALTH TECHNOLOGY (ICIEV-ISCMHT), 2017,
[7] Image Augmentation for Object Image Classification Based On Combination of Pre-Trained CNN and SVM
Shima, Yoshihiro
2ND INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2018), 2018, 1004
[8] Hybrid Feature Fusion Using RNN and Pre-trained CNN for Classification of Alzheimer's Disease
Jabason, Emimal
Ahmad, M. Omair
Swamy, M. N. S.
2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
[9] An optimal deep learning approach for breast cancer detection and classification with pre-trained CNN-based feature learning mechanism
Meena, L. C.
Joe Prathap, P. M.
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024,
[10] On The Optimal Classifier For Affective Vocal Bursts And Stuttering Predictions Based On Pre-Trained Acoustic Embedding
Atmaja, Bagus Tris
Zanjabila
Sasou, Akira
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1690 - 1695

← 1 2 3 4 →