The Impact of Data Quantity and Source on the Quality of Data-Driven Hints for Programming

被引:5
|
作者
Price, Thomas W. [1 ]
Zhi, Rui [1 ]
Dong, Yihuan [1 ]
Lytle, Nicholas [1 ]
Barnes, Tiffany [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27606 USA
基金
美国国家科学基金会;
关键词
Data-driven hints; Programming; Hint quality; Cold start; GENERATION;
D O I
10.1007/978-3-319-93843-1_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the domain of programming, intelligent tutoring systems increasingly employ data-driven methods to automate hint generation. Evaluations of these systems have largely focused on whether they can reliably provide hints for most students, and how much data is needed to do so, rather than how useful the resulting hints are to students. We present a method for evaluating the quality of data-driven hints and how their quality is impacted by the data used to generate them. Using two datasets, we investigate how the quantity of data and the source of data (whether it comes from students or experts) impact one hint generation algorithm. We find that with student training data, hint quality stops improving after 15-20 training solutions and can decrease with additional data. We also find that student data outperforms a single expert solution but that a comprehensive set of expert solutions generally performs best.
引用
收藏
页码:476 / 490
页数:15
相关论文
共 50 条
  • [1] An Evaluation of Data-Driven Programming Hints in a Classroom Setting
    Price, Thomas W.
    Marwan, Samiha
    Winters, Michael
    Williams, Joseph Jay
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT II, 2020, 12164 : 246 - 251
  • [2] Automated Data-Driven Hints for Computer Programming Students
    Chow, Sammi
    Yacef, Kalina
    Koprinska, Irena
    Curran, James
    ADJUNCT PUBLICATION OF THE 25TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION (UMAP'17), 2017, : 5 - 10
  • [3] AlphaCode and "data-driven" programming
    Kolter, J. Zico
    SCIENCE, 2022, 378 (6624) : 1056 - 1056
  • [4] A Comparison of the Quality of Data-Driven Programming Hint Generation Algorithms
    Pricer, Thomas W.
    Dong, Yihuan
    Zhi, Rui
    Paassen, Benjamin
    Lytle, Nicholas
    Catete, Veronica
    Barnes, Tiffany
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2019, 29 (03) : 368 - 395
  • [5] A Comparison of the Quality of Data-Driven Programming Hint Generation Algorithms
    Thomas W. Price
    Yihuan Dong
    Rui Zhi
    Benjamin Paaßen
    Nicholas Lytle
    Veronica Cateté
    Tiffany Barnes
    International Journal of Artificial Intelligence in Education, 2019, 29 : 368 - 395
  • [6] Data-Driven Source Localization of Impact on Aircraft Control Surfaces
    Ai, Li
    Soltangharaei, Vafa
    Anay, Rafal
    van Tooren, Michael J. L.
    Ziehl, Paul
    2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [7] Data-Driven Computational Intelligence for Scientific Programming
    Rubio-Largo, Alvaro
    Carlos Preciado, Juan
    Iribarne, Luis
    SCIENTIFIC PROGRAMMING, 2019, 2019
  • [8] A DATA-DRIVEN MODEL FOR A SUBSET OF LOGIC PROGRAMMING
    BIC, L
    LEE, C
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1987, 9 (04): : 618 - 645
  • [9] Towards the Creation of a Data-Driven Programming Tutor
    Mostafavi, Behrooz
    Barnes, Tiffany
    INTELLIGENT TUTORING SYSTEMS, PART II, 2010, 6095 : 239 - 241
  • [10] Measurement uncertainty, data quality and data-driven modelling
    Sommer, Klaus-Dieter
    Schuetze, Andreas
    TM-TECHNISCHES MESSEN, 2024, 91 (09) : 417 - 418