Establishing Traceability Between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs

被引:0
|
作者
Ali, Syed Juried [1 ]
Naganathan, Varun [2 ]
Bork, Dominik [1 ]
机构
[1] TU Wien, Business Informat Grp, Vienna, Austria
[2] Microsoft, Hyderabad, India
来源
CONCEPTUAL MODELING, ER 2024 | 2025年 / 15238卷
关键词
Large Language Models; LLM; Requirements Traceability; Retrieval Augmented Generation; Requirements Engineering;
D O I
10.1007/978-3-031-75872-0_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software Engineering aims to effectively translate stakeholders' requirements into executable code to fulfill their needs. Traceability from natural language use case requirements to classes in a UML class diagram, subsequently translated into code implementation, is essential in systems development and maintenance. Tasks such as assessing the impact of changes and enhancing software reusability require a clear link between these requirements and their software implementation. However, establishing such links manually across extensive codebases is prohibitively challenging. Requirements, typically articulated in natural language, embody semantics that clarify the purpose of the codebase. Conventional traceability methods, relying on textual similarities between requirements and code, often suffer from low precision due to the semantic gap between high-level natural language requirements and the syntactic nature of code. The advent of Large Language Models (LLMs) provides new methods to address this challenge through their advanced capability to interpret both natural language and code syntax. Furthermore, representing code as a knowledge graph facilitates the use of graph structural information to enhance traceability links. This paper introduces an LLM-supported retrieval augmented generation approach for enhancing requirements traceability to the class diagram of the code, incorporating keyword, vector, and graph indexing techniques, and their integrated application. We present a comparative analysis against conventional methods and among different indexing strategies and parameterizations on the performance. Our results demonstrate how this methodology significantly improves the efficiency and accuracy of establishing traceability links in software development processes.
引用
收藏
页码:295 / 314
页数:20
相关论文
共 50 条
  • [31] Software Requirements Elicitation - A Controlled Experiment to Measure the Impact of a Native Natural Language
    Mahmood, Sajjad
    Ajila, Samuel A.
    2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2013, : 437 - 442
  • [32] A Systematic Literature Review on Using Natural Language Processing in Software Requirements Engineering
    Necula, Sabina-Cristiana
    Dumitriu, Florin
    Greavu-Serban, Valerica
    ELECTRONICS, 2024, 13 (11)
  • [33] Extracting concepts from the software requirements specification using natural language processing
    Kuchta, Jaroslaw
    Padhiyar, Priti
    2018 11TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2018, : 443 - 448
  • [34] Generation of Test Cases from Software Requirements Using Natural Language Processing
    Verma, Ravi Prakash
    Beg, Md. Rizwan
    2013 SIXTH INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2013), 2013, : 140 - 147
  • [35] Software Requirements Translation from Natural Language to Object-Oriented Model
    Fatwanto, Agung
    2012 IEEE CONFERENCE ON CONTROL, SYSTEMS & INDUSTRIAL INFORMATICS (ICCSII), 2012, : 191 - +
  • [36] Building Traceability Between Functional Requirements and Component Architecture Elements in Embedded Software Using Structured Features
    Yoo, Insun
    Park, Hyoseung
    Lee, Seok-Won
    Ryu, Ki-Yeol
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [37] Model-Based Methodology Establishing Traceability Between Requirements, Design and Operation Information in Lifecycle-Oriented Architecture
    Ishibashi, Shota
    Hisazumi, Kenji
    Nakanishi, Tsuneo
    Fukuda, Akira
    NEW TRENDS IN E-SERVICE AND SMART COMPUTING, 2018, 742 : 47 - 63
  • [38] Analyzing Competences in Software Testing: Combining Thematic Analysis with Natural Language Processing (NLP)
    Rahman, Tajmilur
    Nwokeji, Joshua
    Matovu, Richard
    Frezza, Stephen
    Sugnanam, Harika
    Pisolkar, Aparna
    2021 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2021), 2021,
  • [39] Building decision support problem domain ontology from natural language requirements for software assurance
    Lee, Seok-Won
    Muthurajant, Divya
    Gandhi, Robin A.
    Yavagal, Deepak
    Ahn, Gail-Joon
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2006, 16 (06) : 851 - 884
  • [40] ONTOLOGY-BASED INTELLIGENT AGENT FOR SEMANTIC PARSING THE NATURAL LANGUAGE SPECIFICATIONS OF SOFTWARE REQUIREMENTS
    Hovorushchenko, Tetiana
    Boyarchuk, Artem
    Pavlova, Olga
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2019, 11 (02): : 59 - 70