Establishing Traceability Between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs

被引:0
|
作者
Ali, Syed Juried [1 ]
Naganathan, Varun [2 ]
Bork, Dominik [1 ]
机构
[1] TU Wien, Business Informat Grp, Vienna, Austria
[2] Microsoft, Hyderabad, India
来源
CONCEPTUAL MODELING, ER 2024 | 2025年 / 15238卷
关键词
Large Language Models; LLM; Requirements Traceability; Retrieval Augmented Generation; Requirements Engineering;
D O I
10.1007/978-3-031-75872-0_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software Engineering aims to effectively translate stakeholders' requirements into executable code to fulfill their needs. Traceability from natural language use case requirements to classes in a UML class diagram, subsequently translated into code implementation, is essential in systems development and maintenance. Tasks such as assessing the impact of changes and enhancing software reusability require a clear link between these requirements and their software implementation. However, establishing such links manually across extensive codebases is prohibitively challenging. Requirements, typically articulated in natural language, embody semantics that clarify the purpose of the codebase. Conventional traceability methods, relying on textual similarities between requirements and code, often suffer from low precision due to the semantic gap between high-level natural language requirements and the syntactic nature of code. The advent of Large Language Models (LLMs) provides new methods to address this challenge through their advanced capability to interpret both natural language and code syntax. Furthermore, representing code as a knowledge graph facilitates the use of graph structural information to enhance traceability links. This paper introduces an LLM-supported retrieval augmented generation approach for enhancing requirements traceability to the class diagram of the code, incorporating keyword, vector, and graph indexing techniques, and their integrated application. We present a comparative analysis against conventional methods and among different indexing strategies and parameterizations on the performance. Our results demonstrate how this methodology significantly improves the efficiency and accuracy of establishing traceability links in software development processes.
引用
收藏
页码:295 / 314
页数:20
相关论文
共 50 条
  • [21] Hidden in Plain Sight: Automatically Identifying Security Requirements from Natural Language Artifacts
    Riaz, Maria
    King, Jason
    Slankas, John
    Williams, Laurie
    2014 IEEE 22ND INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2014, : 183 - 192
  • [22] Constructing Traceability Links between Software Requirements and Source Code Based on Neural Networks
    Dai, Peng
    Yang, Li
    Wang, Yawen
    Jin, Dahai
    Gong, Yunzhan
    MATHEMATICS, 2023, 11 (02)
  • [23] A Natural-Language Processing Model for Detecting Errors in Software Requirements
    Guzman-Luna, J. A.
    Gomez Arias, S. A.
    Velez-Carvajal, C. A.
    REVISTA VIRTUAL UNIVERSIDAD CATOLICA DEL NORTE, 2015, 46 : 169 - 186
  • [24] Automatic transition of natural language software requirements specification into formal presentation
    Ilieva, MG
    Ormandjieva, O
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 392 - 397
  • [25] Natural language ambiguity resolution by intelligent semantic annotation of software requirements
    Ashfaq, Fariha
    Bajwa, Imran Sarwar
    AUTOMATED SOFTWARE ENGINEERING, 2021, 28 (02)
  • [26] Feature and Variability Extraction from Natural Language Software Requirements Specifications
    Li, Yang
    SPLC'18: PROCEEDINGS OF THE 22ND INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE - VOL 2, 2018, : 72 - 78
  • [27] Natural language ambiguity resolution by intelligent semantic annotation of software requirements
    Fariha Ashfaq
    Imran Sarwar Bajwa
    Automated Software Engineering, 2021, 28
  • [28] Functional grouping of natural language requirements for assistance in architectural software design
    Casamayor, Agustin
    Godoy, Daniela
    Campo, Marcelo
    KNOWLEDGE-BASED SYSTEMS, 2012, 30 : 78 - 86
  • [29] Analytic effectiveness evaluation of techniques for natural language software requirements testability
    Lami, Giuseppe
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2009, 24 (02): : 103 - 114
  • [30] Challenges from LLMs as a Natural Language Based Human-machine Collaborative Tool for Software Development and Evolution
    Li G.
    Peng X.
    Wang Q.-X.
    Xie T.
    Jin Z.
    Wang J.
    Ma X.-X.
    Li X.-D.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (10): : 4601 - 4606