LLMs for science: Usage for code generation and data analysis

被引：3

作者：

Nejjar, Mohamed ^{[1
]}

Zacharias, Luca ^{[1
]}

Stiehle, Fabian ^{[1
]}

Weber, Ingo ^{[1
,2
]}

机构：

[1] Tech Univ Munich, Sch Computat Informat & Technol, Munich, Germany

[2] Fraunhofer Gesell, Munich, Germany

来源：

JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS | 2025年 / 37卷 / 01期

关键词：

artificial intelligence; code generation; data analysis; GenAI4Science; large language models; LLMs4Science; research methods;

D O I：

10.1002/smr.2723

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Large language models (LLMs) have been touted to enable increased productivity in many areas of today's work life. Scientific research as an area of work is no exception: The potential of LLM-based tools to assist in the daily work of scientists has become a highly discussed topic across disciplines. However, we are only at the very onset of this subject of study. It is still unclear how the potential of LLMs will materialize in research practice. With this study, we give first empirical evidence on the use of LLMs in the research process. We have investigated a set of use cases for LLM-based tools in scientific research and conducted a first study to assess to which degree current tools are helpful. In this position paper, we report specifically on use cases related to software engineering, specifically, on generating application code and developing scripts for data analytics and visualization. While we studied seemingly simple use cases, results across tools differ significantly. Our results highlight the promise of LLM-based tools in general, yet we also observe various issues, particularly regarding the integrity of the output these tools provide.

引用

页数：7

共 50 条

[41] Beyond open science: Data, code, and causality
Wolf, Levi John
ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2023, 50 (09) : 2333 - 2336
[42] Next Generation Vulnerability Detection with LLMs
Dalla Preda, Mila
Marastoni, Niccolo
Paci, Federica
ERCIM NEWS, 2024, (139):
[43] Frustrated with Code Quality Issues? LLMs can Help!
Wadhwa, Nalin
Pradhan, Jui
Sonwane, Atharv
Sahu, Surya Prakash
Natarajan, Nagarajan
Kanade, Aditya
Parthasarathy, Suresh
Rajamani, Sriram
arXiv, 2023,
[44] FlowMind: Automatic Workflow Generation with LLMs
Zeng, Zhen
Watson, William
Cho, Nicole
Rahimi, Saba
Reynolds, Shayleen
Balch, Tucker
Veloso, Manuela
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 73 - 81
[45] Automated Assessment of Students' Code Comprehension using LLMs
Oli, Priti
Banjade, Rabin
Chapagain, Jeevan
Rus, Vasile
AI FOR EDUCATION WORKSHOP, 2024, 257 : 118 - 128
[46] Evaluating LLMs for visualization generation and understanding
Saadiq Rauf Khan
Vinit Chandak
Sougata Mukherjea
Discover Data, 3 (1):
[47] The next generation of experimental research with LLMs
Charness, Gary
Jabarian, Brian
List, John A.
NATURE HUMAN BEHAVIOUR, 2025,
[48] Identifying Gaps in Students' Explanations of Code Using LLMs
Banjade, Rabin
Oli, Priti
Sajib, Mahmudul Islam
Rus, Vasile
ARTIFICIAL INTELLIGENCE IN EDUCATION, PT II, AIED 2024, 2024, 14830 : 268 - 275
[49] ANALYSIS OF VARIANCE OF USAGE DATA
PEDERSEN, OA
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1973, AES9 (05) : 805 - 805
[50] Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation
Kasner, Zdenek
Dusek, Ondrej
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12045 - 12072

← 1 2 3 4 5 →