LLMs for science: Usage for code generation and data analysis

被引:3
|
作者
Nejjar, Mohamed [1 ]
Zacharias, Luca [1 ]
Stiehle, Fabian [1 ]
Weber, Ingo [1 ,2 ]
机构
[1] Tech Univ Munich, Sch Computat Informat & Technol, Munich, Germany
[2] Fraunhofer Gesell, Munich, Germany
关键词
artificial intelligence; code generation; data analysis; GenAI4Science; large language models; LLMs4Science; research methods;
D O I
10.1002/smr.2723
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Large language models (LLMs) have been touted to enable increased productivity in many areas of today's work life. Scientific research as an area of work is no exception: The potential of LLM-based tools to assist in the daily work of scientists has become a highly discussed topic across disciplines. However, we are only at the very onset of this subject of study. It is still unclear how the potential of LLMs will materialize in research practice. With this study, we give first empirical evidence on the use of LLMs in the research process. We have investigated a set of use cases for LLM-based tools in scientific research and conducted a first study to assess to which degree current tools are helpful. In this position paper, we report specifically on use cases related to software engineering, specifically, on generating application code and developing scripts for data analytics and visualization. While we studied seemingly simple use cases, results across tools differ significantly. Our results highlight the promise of LLM-based tools in general, yet we also observe various issues, particularly regarding the integrity of the output these tools provide.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Beyond open science: Data, code, and causality
    Wolf, Levi John
    ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2023, 50 (09) : 2333 - 2336
  • [42] Next Generation Vulnerability Detection with LLMs
    Dalla Preda, Mila
    Marastoni, Niccolo
    Paci, Federica
    ERCIM NEWS, 2024, (139):
  • [43] Frustrated with Code Quality Issues? LLMs can Help!
    Wadhwa, Nalin
    Pradhan, Jui
    Sonwane, Atharv
    Sahu, Surya Prakash
    Natarajan, Nagarajan
    Kanade, Aditya
    Parthasarathy, Suresh
    Rajamani, Sriram
    arXiv, 2023,
  • [44] FlowMind: Automatic Workflow Generation with LLMs
    Zeng, Zhen
    Watson, William
    Cho, Nicole
    Rahimi, Saba
    Reynolds, Shayleen
    Balch, Tucker
    Veloso, Manuela
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 73 - 81
  • [45] Automated Assessment of Students' Code Comprehension using LLMs
    Oli, Priti
    Banjade, Rabin
    Chapagain, Jeevan
    Rus, Vasile
    AI FOR EDUCATION WORKSHOP, 2024, 257 : 118 - 128
  • [46] Evaluating LLMs for visualization generation and understanding
    Saadiq Rauf Khan
    Vinit Chandak
    Sougata Mukherjea
    Discover Data, 3 (1):
  • [47] The next generation of experimental research with LLMs
    Charness, Gary
    Jabarian, Brian
    List, John A.
    NATURE HUMAN BEHAVIOUR, 2025,
  • [48] Identifying Gaps in Students' Explanations of Code Using LLMs
    Banjade, Rabin
    Oli, Priti
    Sajib, Mahmudul Islam
    Rus, Vasile
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT II, AIED 2024, 2024, 14830 : 268 - 275
  • [49] ANALYSIS OF VARIANCE OF USAGE DATA
    PEDERSEN, OA
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1973, AES9 (05) : 805 - 805
  • [50] Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation
    Kasner, Zdenek
    Dusek, Ondrej
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12045 - 12072