Trusta: Reasoning about assurance cases with formal methods and large language models

被引：0

作者：

Chen, Zezhong ^{[1
]}

Deng, Yuxin ^{[1
]}

Du, Wenjie ^{[2
]}

机构：

[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai 200062, Peoples R China

[2] Shanghai Normal Univ, Shanghai 200233, Peoples R China

来源：

SCIENCE OF COMPUTER PROGRAMMING | 2025年 / 244卷

基金：

中国国家自然科学基金;

关键词：

Assurance cases; Trustworthiness derivation trees; Large language models; Formal methods; Constraint solving;

D O I：

10.1016/j.scico.2025.103288

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Assurance cases can be used to argue for the safety of products in safety engineering. In safety-critical areas, the construction of assurance cases is indispensable. We introduce the Trustworthiness Derivation Tree Analyzer (Trusta), a tool designed to enhance the development and evaluation of assurance cases by integrating formal methods and large language models (LLMs). The tool incorporates a Prolog interpreter and solvers like Z3 and MONA to handle various constraint types, enhancing the precision and efficiency of assurance case assessment. Beyond traditional formal methods, Trusta harnesses the power of LLMs including ChatGPT-3.5, ChatGPT4, and PaLM 2, assisting humans in the development of assurance cases and the writing of formal constraints. Our evaluation, through qualitative and quantitative analyses, shows Trusta's impact on improving assurance case quality and efficiency. Trusta enables junior engineers to reach the skill level of experienced safety experts, narrowing the expertise gap and greatly benefiting those with limited experience. Case studies, including automated guided vehicles (AGVs), demonstrate Trusta's effectiveness in identifying subtle issues and improving the overall trustworthiness of complex systems.

引用

页数：32

共 50 条

[1] Integrated Formal Methods for Constructing Assurance Cases
Carlan, Carmen
Beyene, Tewodros A.
Ruess, Harald
2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2016, : 221 - 228
[2] Reasoning About Confidence and Uncertainty in Assurance Cases: A Survey
Duan, Lian
Rayadurgam, Sanjai
Heimdahl, Mats P. E.
Ayoub, Anaheed
Sokolsky, Oleg
Lee, Insup
SOFTWARE ENGINEERING IN HEALTH CARE, SEHC 2014, 2017, 9062 : 64 - 80
[3] Automatic instantiation of assurance cases from patterns using large language models
Odu, Oluwafemi
Belle, Alvine B.
Wang, Song
Kpodjedo, Segla
Lethbridge, Timothy C.
Hemmati, Hadi
JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 222
[4] Formal reasoning about hardware and software memory models
Roychoudhury, A
FORMAL METHODS AND SOFTWARE ENGINEERING, PROCEEDINGS, 2002, 2495 : 423 - 434
[5] Editorial: Beyond formal models of reasoning about explanations
Koslowski, Barbara
Douven, Igor
Masnick, Amy
Krzyzanowska, Karolina
Chinn, Clark
Morris, Bradley
FRONTIERS IN PSYCHOLOGY, 2024, 15
[6] Large Language Models Are Reasoning Teachers
Ho, Namgyu
Schmid, Laura
Yun, Se-Young
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14852 - 14882
[7] Dehallucinating Large Language Models Using Formal Methods Guided Iterative Prompting
Jha, Susmit
Jha, Sumit Kumar
Lincoln, Patrick
Bastian, Nathaniel D.
Velasquez, Alvaro
Neema, Sandeep
2023 IEEE INTERNATIONAL CONFERENCE ON ASSURED AUTONOMY, ICAA, 2023, : 149 - 152
[8] PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change
Valmeekam, Karthik
Marquez, Matthew
Olmo, Alberto
Sreedharan, Sarath
Kambhampati, Subbarao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[9] Towards Reasoning in Large Language Models: A Survey
Huang, Jie
Chang, Kevin Chen-Chuan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1049 - 1065
[10] Conversations on reasoning: Large language models in diagnosis
Restrepo, Daniel
Rodman, Adam
Abdulnour, Raja-Elie
JOURNAL OF HOSPITAL MEDICINE, 2024, 19 (08) : 731 - 735

← 1 2 3 4 5 →