The Impact of Data Quantity and Source on the Quality of Data-Driven Hints for Programming

被引:5
|
作者
Price, Thomas W. [1 ]
Zhi, Rui [1 ]
Dong, Yihuan [1 ]
Lytle, Nicholas [1 ]
Barnes, Tiffany [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27606 USA
基金
美国国家科学基金会;
关键词
Data-driven hints; Programming; Hint quality; Cold start; GENERATION;
D O I
10.1007/978-3-319-93843-1_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the domain of programming, intelligent tutoring systems increasingly employ data-driven methods to automate hint generation. Evaluations of these systems have largely focused on whether they can reliably provide hints for most students, and how much data is needed to do so, rather than how useful the resulting hints are to students. We present a method for evaluating the quality of data-driven hints and how their quality is impacted by the data used to generate them. Using two datasets, we investigate how the quantity of data and the source of data (whether it comes from students or experts) impact one hint generation algorithm. We find that with student training data, hint quality stops improving after 15-20 training solutions and can decrease with additional data. We also find that student data outperforms a single expert solution but that a comprehensive set of expert solutions generally performs best.
引用
收藏
页码:476 / 490
页数:15
相关论文
共 50 条
  • [41] DQSOps: Data Quality Scoring Operations Framework for Data-Driven Applications
    Bayram, Firas
    Ahmed, Bestoun S.
    Hallin, Erik
    Engman, Anton
    27TH INTERNATIONAL CONFERENCE ON EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, EASE 2023, 2023, : 32 - 41
  • [42] Data-driven analysis of soil quality indicators using limited data
    Pulido Moncada, Mansonia
    Gabriels, Donald
    Cornelis, Willi M.
    GEODERMA, 2014, 235 : 271 - 278
  • [43] Data-driven voice source waveform analysis and synthesis
    Gudnason, Jon
    Thomas, Mark R. P.
    Ellis, Daniel P. W.
    Naylor, Patrick A.
    SPEECH COMMUNICATION, 2012, 54 (02) : 199 - 211
  • [44] Towards Data-Driven Programming Problem Generation for Mastery Learning
    Lytle, Nicholas
    ICER '19 - PROCEEDINGS OF THE 2019 ACM CONFERENCE ON INTERNATIONAL COMPUTING EDUCATION RESEARCH, 2019, : 339 - 340
  • [45] A Data-Driven Approach to Compare the Syntactic Difficulty of Programming Languages
    Lokkila, Erno
    Christopoulos, Athanasios
    Laakso, Mikko-Jussi
    Journal of Information Systems Education, 2023, 34 (01) : 84 - 93
  • [46] A Data-Driven Policy Iteration Scheme based on Linear Programming
    Banjac, Goran
    Lygeros, John
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 816 - 821
  • [47] SeeQ: A Programming Model for Portable Data-driven Building Applications
    Mavrokapnidis, Dimitris
    Fierro, Gabe
    Husmann, Maria
    Korolija, Ivan
    Rovas, Dimitrios
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 159 - 168
  • [48] Data-Driven Program Synthesis for Hint Generation in Programming Tutors
    Lazar, Timotej
    Bratko, Ivan
    INTELLIGENT TUTORING SYSTEMS, ITS 2014, 2014, 8474 : 306 - 311
  • [49] Data-Driven Differential Dynamic Programming Using Gaussian Processes
    Pan, Yunpeng
    Theodorou, Evangelos A.
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 4467 - 4472
  • [50] Data-Driven Control of Unknown Systems: A Linear Programming Approach
    Tanzanakis, Alexandros
    Lygeros, John
    IFAC PAPERSONLINE, 2020, 53 (02): : 7 - 13