Big Data Application in Education: Dropout Prediction in Edx MOOCs

被引:63
|
作者
Liang, Jiajun [1 ]
Yang, Jian [2 ]
Wu, Yongji [3 ]
Li, Chao [3 ]
Zheng, Li [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing, Peoples R China
[3] Tsinghua Univ, Res Inst Informat Technol, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
MOOC; Big Data; Dropout prediction; Supervised learning;
D O I
10.1109/BigMM.2016.70
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Educational Data Mining and Learning Analytics are two growing fields of study, trying to make sense of education data and to improve teaching and learning experience. We study dropout prediction in Massively Open Online Courses (MOOCS), where the goal is given student's learning behavior log data in one month, to predict whether students would drop out in next ten days. We collect 39 courses data from XuetangX platform, which is based on the open source Edx platform. We describe our complete approach to cope with drop out prediction task, including data extraction from Edx platform, data preprocessing, feature engineering and performance test on several supervised classification model such as SVM, Logistics Regression, Random Forest and Gradient Boosting Decision Tree. We achieve 88% accuracy in dropout prediction task with GBDT model.
引用
收藏
页码:440 / 443
页数:4
相关论文
共 50 条