A three-stage machine learning and inference approach for educational data

被引:0
|
作者
Da, Ting [1 ]
机构
[1] Beijing Normal Univ, Natl Engn Res Ctr Cyberlearning & Intelligent Tech, Beijing, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Machine learning; Causal inference; OLS regression; Instrumental variable (IV); LASSO; STUDENT-ACHIEVEMENT; PERFORMANCE; SELECTION; ABSENCES; REGRESSION; MODEL;
D O I
10.1038/s41598-025-89394-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A central task in educational studies is to uncover factors that drive a student's academic performance. While existing studies have utilized meticulous regression designs, it is challenging to select appropriate controls. Machine learning, however, offers a solution whereby the entire variable set can be inspected and filtered by different optimization schemes. In that light, this paper adopts a three-stage framework to analyze and discover potentially latent causal relationships from an open dataset from UCI. In the first stage, machine learning methods are employed to select candidate variables that are closely associated with student grades, and then a "post-double-selection" process is implemented to select the set of control variables. In the final stage, three case studies are conducted to illustrate the effectiveness of the three-stage design. The model pipeline is suitable for situations where there is only minimal prior knowledge available to address a potentially causal research question.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A three-stage data mining model for reject inference
    Chen, Weimin
    Liu, Youjin
    Xiang, Guocheng
    Liu, Yongqing
    Wang, Kexi
    2012 FIFTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE), 2012, : 34 - 38
  • [2] A Three-Stage Approach to a Multirow Parallel Machine Layout Problem
    Zuo, Xingquan
    Gao, Shubing
    Zhou, MengChu
    Yang, Xin
    Zhao, Xinchao
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (01) : 433 - 447
  • [3] A three-stage machine learning network security solution for public entities
    Saganowski, Stanislaw
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1098 - 1105
  • [4] Longitudinal data with nonstationary errors:: a nonparametric three-stage approach
    Nuñez-Antón, V
    Rodríguez-Póo, JM
    Vieu, P
    TEST, 1999, 8 (01) : 201 - 231
  • [5] Longitudinal data with nonstationary errors: a nonparametric three-stage approach
    Vicente Núñez-Antón
    Juan M. Rodríguez-Póo
    Philippe Vieu
    Test, 1999, 8 : 201 - 231
  • [6] Automation of surgical skill assessment using a three-stage machine learning algorithm
    Lavanchy, Joeel L.
    Zindel, Joel
    Kirtac, Kadir
    Twick, Isabell
    Hosgor, Enes
    Candinas, Daniel
    Beldi, Guido
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [7] Vehicle recognition by silhouettes - a three-stage machine learning method in computer vision systems
    Duke, Vyacheslav A.
    Malygin, Igor G.
    Pritsker, Vladimir, I
    MARINE INTELLECTUAL TECHNOLOGIES, 2022, (02): : 162 - 167
  • [8] Sensorless Control System for Three-Stage Synchronous Machine
    Sidorov, Vadim E.
    Zharkov, Maksim A.
    Khoroshev, Maxim A.
    Dubkova, Regina Yu
    2019 20TH INTERNATIONAL CONFERENCE OF YOUNG SPECIALISTS ON MICRO/NANOTECHNOLOGIES AND ELECTRON DEVICES (EDM 2019), 2019, : 476 - 480
  • [9] A three-stage computational approach to network matching
    Xiong, DM
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2000, 8 (1-6) : 71 - 89
  • [10] Three-stage approach to ultrasound contrast detection
    Kirkhorn, J
    Frinking, PJA
    de Jong, N
    Torp, H
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2001, 48 (04) : 1013 - 1022