Automated Analysis of Algorithm Descriptions Quality, Through Large Language Models

被引：0

作者：

Sterbini, Andrea ^{[1
]}

Temperini, Marco ^{[2
]}

机构：

[1] Sapienza Univ Rome, Dept Comp Sci, Rome, Italy

[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Rome, Italy

来源：

GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024 | 2024年 / 14798卷

关键词：

Large Language Models; LLM-based Text Similarity; Peer Assessment; Automated Assessment;

D O I：

10.1007/978-3-031-63028-6_20

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we propose a method to classify the students' textual descriptions of algorithms. This work is based on a wealth of data (programming tasks, related algorithm descriptions, and Peer Assessment data), coming from 6 years of use of the system Q2A, in a "Fundamentals of Computer Programming" course, given at first year in our university's Computer Science curriculum. The descriptions are submitted, as part of the answer to a computer programming task, through Q2A, and are subject to (formative) Peer Assessment. The proposed classification method aims to support the teacher on the analysis of the quite numerous students' descriptions, in ours as well as in other similar systems. We 1) process the students' submissions, by topic automated extraction (BERTopic) and by separate Large Language Models, 2) compute their degree of suitability as "algorithm description", in a scale from BAD to GOOD, and 3) compare the obtained classification with those coming from the teacher's direct assessment (expert: one of the authors), and from the Peer Assessment. The automated classification does correlate with both the expert classification and the grades given by the peers to the "clarity" of the descriptions. This result is encouraging in view of the production of a Q2A subsystem allowing the teacher to analyse the students' submissions guided by an automated classification, and ultimately support fully automated grading.

引用

页码：258 / 271

页数：14

共 50 条

[1] Automated Topic Analysis with Large Language Models
Kirilenko, Andrei
Stepchenkova, Svetlana
INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2024, ENTER 2024, 2024, : 29 - 34
[2] Leveraging Large Language Models for Automated Dialogue Analysis
Finch, Sarah E.
Paek, Ellie S.
Choi, Jinho D.
24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 202 - 215
[3] Trend Analysis Through Large Language Models
Alzapiedi, Lucas
Bihl, Trevor
IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, NAECON 2024, 2024, : 370 - 374
[4] Improving requirements completeness: automated assistance through large language models
Dipeeka Luitel
Shabnam Hassani
Mehrdad Sabetzadeh
Requirements Engineering, 2024, 29 : 73 - 95
[5] Improving requirements completeness: automated assistance through large language models
Luitel, Dipeeka
Hassani, Shabnam
Sabetzadeh, Mehrdad
REQUIREMENTS ENGINEERING, 2024, 29 (01) : 73 - 95
[6] Frontiers: Determining the Validity of Large Language Models for Automated Perceptual Analysis
Li, Peiyao
Castelo, Noah
Katona, Zsolt
Sarvary, Miklos
MARKETING SCIENCE, 2024, 43 (02) : 254 - 266
[7] Large Language Models for Automated Program Repair
Ribeiro, Francisco
COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
[8] Large Language Models for Automated Program Repair
Ribeiro, Francisco
SPLASH Companion 2023 - Companion Proceedings of the 2023 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, 2023, : 7 - 9
[9] Extracting phenotypes from clinical descriptions using large language models: a comparison between automated and manual approach.
Berardelli, Silvia
Gazzo, Andrea
De Paoli, Federica
Limongelli, Ivan
Rizzo, Ettore
Magni, Paolo
Zucca, Susanna
EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1630 - 1631
[10] Understanding Telecom Language Through Large Language Models
Bariah, Lina
Zou, Hang
Zhao, Qiyang
Mouhouche, Belkacem
Bader, Faouzi
Debbah, Merouane
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6542 - 6547

← 1 2 3 4 5 →