Automated Analysis of Algorithm Descriptions Quality, Through Large Language Models

被引:0
|
作者
Sterbini, Andrea [1 ]
Temperini, Marco [2 ]
机构
[1] Sapienza Univ Rome, Dept Comp Sci, Rome, Italy
[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Rome, Italy
关键词
Large Language Models; LLM-based Text Similarity; Peer Assessment; Automated Assessment;
D O I
10.1007/978-3-031-63028-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a method to classify the students' textual descriptions of algorithms. This work is based on a wealth of data (programming tasks, related algorithm descriptions, and Peer Assessment data), coming from 6 years of use of the system Q2A, in a "Fundamentals of Computer Programming" course, given at first year in our university's Computer Science curriculum. The descriptions are submitted, as part of the answer to a computer programming task, through Q2A, and are subject to (formative) Peer Assessment. The proposed classification method aims to support the teacher on the analysis of the quite numerous students' descriptions, in ours as well as in other similar systems. We 1) process the students' submissions, by topic automated extraction (BERTopic) and by separate Large Language Models, 2) compute their degree of suitability as "algorithm description", in a scale from BAD to GOOD, and 3) compare the obtained classification with those coming from the teacher's direct assessment (expert: one of the authors), and from the Peer Assessment. The automated classification does correlate with both the expert classification and the grades given by the peers to the "clarity" of the descriptions. This result is encouraging in view of the production of a Q2A subsystem allowing the teacher to analyse the students' submissions guided by an automated classification, and ultimately support fully automated grading.
引用
收藏
页码:258 / 271
页数:14
相关论文
共 50 条
  • [1] Automated Topic Analysis with Large Language Models
    Kirilenko, Andrei
    Stepchenkova, Svetlana
    INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2024, ENTER 2024, 2024, : 29 - 34
  • [2] Leveraging Large Language Models for Automated Dialogue Analysis
    Finch, Sarah E.
    Paek, Ellie S.
    Choi, Jinho D.
    24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 202 - 215
  • [3] Trend Analysis Through Large Language Models
    Alzapiedi, Lucas
    Bihl, Trevor
    IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, NAECON 2024, 2024, : 370 - 374
  • [4] Improving requirements completeness: automated assistance through large language models
    Dipeeka Luitel
    Shabnam Hassani
    Mehrdad Sabetzadeh
    Requirements Engineering, 2024, 29 : 73 - 95
  • [5] Improving requirements completeness: automated assistance through large language models
    Luitel, Dipeeka
    Hassani, Shabnam
    Sabetzadeh, Mehrdad
    REQUIREMENTS ENGINEERING, 2024, 29 (01) : 73 - 95
  • [6] Frontiers: Determining the Validity of Large Language Models for Automated Perceptual Analysis
    Li, Peiyao
    Castelo, Noah
    Katona, Zsolt
    Sarvary, Miklos
    MARKETING SCIENCE, 2024, 43 (02) : 254 - 266
  • [7] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
  • [8] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    SPLASH Companion 2023 - Companion Proceedings of the 2023 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, 2023, : 7 - 9
  • [9] Extracting phenotypes from clinical descriptions using large language models: a comparison between automated and manual approach.
    Berardelli, Silvia
    Gazzo, Andrea
    De Paoli, Federica
    Limongelli, Ivan
    Rizzo, Ettore
    Magni, Paolo
    Zucca, Susanna
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1630 - 1631
  • [10] Understanding Telecom Language Through Large Language Models
    Bariah, Lina
    Zou, Hang
    Zhao, Qiyang
    Mouhouche, Belkacem
    Bader, Faouzi
    Debbah, Merouane
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6542 - 6547