Trusta: Reasoning about assurance cases with formal methods and large language models

被引:0
|
作者
Chen, Zezhong [1 ]
Deng, Yuxin [1 ]
Du, Wenjie [2 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai 200062, Peoples R China
[2] Shanghai Normal Univ, Shanghai 200233, Peoples R China
基金
中国国家自然科学基金;
关键词
Assurance cases; Trustworthiness derivation trees; Large language models; Formal methods; Constraint solving;
D O I
10.1016/j.scico.2025.103288
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Assurance cases can be used to argue for the safety of products in safety engineering. In safety-critical areas, the construction of assurance cases is indispensable. We introduce the Trustworthiness Derivation Tree Analyzer (Trusta), a tool designed to enhance the development and evaluation of assurance cases by integrating formal methods and large language models (LLMs). The tool incorporates a Prolog interpreter and solvers like Z3 and MONA to handle various constraint types, enhancing the precision and efficiency of assurance case assessment. Beyond traditional formal methods, Trusta harnesses the power of LLMs including ChatGPT-3.5, ChatGPT4, and PaLM 2, assisting humans in the development of assurance cases and the writing of formal constraints. Our evaluation, through qualitative and quantitative analyses, shows Trusta's impact on improving assurance case quality and efficiency. Trusta enables junior engineers to reach the skill level of experienced safety experts, narrowing the expertise gap and greatly benefiting those with limited experience. Case studies, including automated guided vehicles (AGVs), demonstrate Trusta's effectiveness in identifying subtle issues and improving the overall trustworthiness of complex systems.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Integrated Formal Methods for Constructing Assurance Cases
    Carlan, Carmen
    Beyene, Tewodros A.
    Ruess, Harald
    2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2016, : 221 - 228
  • [2] Reasoning About Confidence and Uncertainty in Assurance Cases: A Survey
    Duan, Lian
    Rayadurgam, Sanjai
    Heimdahl, Mats P. E.
    Ayoub, Anaheed
    Sokolsky, Oleg
    Lee, Insup
    SOFTWARE ENGINEERING IN HEALTH CARE, SEHC 2014, 2017, 9062 : 64 - 80
  • [3] Automatic instantiation of assurance cases from patterns using large language models
    Odu, Oluwafemi
    Belle, Alvine B.
    Wang, Song
    Kpodjedo, Segla
    Lethbridge, Timothy C.
    Hemmati, Hadi
    JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 222
  • [4] Formal reasoning about hardware and software memory models
    Roychoudhury, A
    FORMAL METHODS AND SOFTWARE ENGINEERING, PROCEEDINGS, 2002, 2495 : 423 - 434
  • [5] Editorial: Beyond formal models of reasoning about explanations
    Koslowski, Barbara
    Douven, Igor
    Masnick, Amy
    Krzyzanowska, Karolina
    Chinn, Clark
    Morris, Bradley
    FRONTIERS IN PSYCHOLOGY, 2024, 15
  • [6] Large Language Models Are Reasoning Teachers
    Ho, Namgyu
    Schmid, Laura
    Yun, Se-Young
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14852 - 14882
  • [7] Dehallucinating Large Language Models Using Formal Methods Guided Iterative Prompting
    Jha, Susmit
    Jha, Sumit Kumar
    Lincoln, Patrick
    Bastian, Nathaniel D.
    Velasquez, Alvaro
    Neema, Sandeep
    2023 IEEE INTERNATIONAL CONFERENCE ON ASSURED AUTONOMY, ICAA, 2023, : 149 - 152
  • [8] PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change
    Valmeekam, Karthik
    Marquez, Matthew
    Olmo, Alberto
    Sreedharan, Sarath
    Kambhampati, Subbarao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Towards Reasoning in Large Language Models: A Survey
    Huang, Jie
    Chang, Kevin Chen-Chuan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1049 - 1065
  • [10] Conversations on reasoning: Large language models in diagnosis
    Restrepo, Daniel
    Rodman, Adam
    Abdulnour, Raja-Elie
    JOURNAL OF HOSPITAL MEDICINE, 2024, 19 (08) : 731 - 735