Prompting GPT-4 to support automatic safety case generation

被引:0
|
作者
Sivakumar, Mithila [1 ]
Belle, Alvine B. [1 ]
Shan, Jinjun [1 ]
Shahandashti, Kimya Khakzad [1 ]
机构
[1] York Univ, Lassonde Sch Engn, 4700 Keele St, Toronto, ON M3J 1P3, Canada
关键词
Safety cases; Safety assurance; Machine learning; Large language models; Generative AI; Requirements engineering;
D O I
10.1016/j.eswa.2024.124653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the ever-evolving field of software engineering, the advent of large language models and conversational interfaces, exemplified by ChatGPT, represents a significant revolution. While their potential is evident in various domains, this paper expands upon our previous research, where we experimented with GPT -4, on its ability to create safety cases. A safety case is a structured argument supported by a body of evidence to demonstrate that a given system is safe to operate in a given environment. In this paper, we first determine GPT -4's comprehension of the Goal Structuring Notation (GSN), a well-established notation for visually representing safety cases. Additionally, we conduct four distinct experiments using GPT -4 to evaluate its ability to generate safety cases within a specified system and application domain. To assess GPT -4's performance in this context, we compare the results it produces with the ground-truth safety cases developed for an X-ray system, a machine learning-enabled component for tire noise recognition in a vehicle, and a lane management system from the automotive domain. This comparison enables us to gain valuable insights into the model's generative capabilities. Our findings indicate that GPT -4 is able to generate moderately accurate and reasonable safety cases.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Exploring the capabilities of large language models for the generation of safety cases: the case of GPT-4
    Sivakumar, Mithila
    Belle, Alvine Boaye
    Shan, Jinjun
    Shahandashti, Kimya Khakzad
    32ND INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW 2024, 2024, : 35 - 45
  • [2] Evaluating GPT-4 on Impressions Generation in Radiology Reports
    Sun, Zhaoyi
    Ong, Hanley
    Kennedy, Patrick
    Tang, Liyan
    Chen, Shirley
    Elias, Jonathan
    Lucas, Eugene
    Shih, George
    Peng, Yifan
    RADIOLOGY, 2023, 307 (05)
  • [3] Feedback-Generation for Programming Exercises With GPT-4
    Azaiz, Imen
    Kiesler, Natalie
    Strickroth, Sven
    PROCEEDINGS OF THE 2024 CONFERENCE INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, VOL 1, ITICSE 2024, 2024, : 31 - 37
  • [4] Leveraging GPT-4 for Automatic Translation Post-Editing
    Raunak, Vikas
    Sharaf, Amr
    Wang, Yiren
    Awadalla, Hany Hassan
    Menezes, Arul
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 12009 - 12024
  • [5] Is GPT-4 a reliable rater? Evaluating consistency in GPT-4's text ratings
    Hackl, Veronika
    Mueller, Alexandra Elena
    Granitzer, Michael
    Sailer, Maximilian
    FRONTIERS IN EDUCATION, 2023, 8
  • [6] GPT-4 as a biomedical simulator
    Schaefer M.
    Reichl S.
    ter Horst R.
    Nicolas A.M.
    Krausgruber T.
    Piras F.
    Stepper P.
    Bock C.
    Samwald M.
    Computers in Biology and Medicine, 2024, 178
  • [7] Analysis and prediction in SCR experiments using GPT-4 with an effective chain-of-thought prompting strategy
    Lu, Muyu
    Gao, Fengyu
    Tang, Xiaolong
    Chen, Linjiang
    ISCIENCE, 2024, 27 (04)
  • [8] Case study identification with GPT-4 and implications for mapping studies
    Petersen, Kai
    INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 171
  • [9] GPT-4 DRIVEN CINEMATIC MUSIC GENERATION THROUGH TEXT PROCESSING
    Haseeb, Muhammad Taimoor
    Hammoudeh, Ahmad
    Xia, Gus
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6995 - 6999
  • [10] Evaluating GPT-4 as an academic support tool for clinicians: A comparative analysis of case records from the literature
    Fonseca Magalhaes Filho, M. A.
    Aguiar Junior, P. N.
    Fabre, B. L.
    Marques, F.
    Gutierres, B.
    William Junior, W. Nassib
    Del Giglio, A.
    ANNALS OF ONCOLOGY, 2023, 34 : S729 - S729