Establishing Traceability Between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs

被引:0
|
作者
Ali, Syed Juried [1 ]
Naganathan, Varun [2 ]
Bork, Dominik [1 ]
机构
[1] TU Wien, Business Informat Grp, Vienna, Austria
[2] Microsoft, Hyderabad, India
来源
CONCEPTUAL MODELING, ER 2024 | 2025年 / 15238卷
关键词
Large Language Models; LLM; Requirements Traceability; Retrieval Augmented Generation; Requirements Engineering;
D O I
10.1007/978-3-031-75872-0_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software Engineering aims to effectively translate stakeholders' requirements into executable code to fulfill their needs. Traceability from natural language use case requirements to classes in a UML class diagram, subsequently translated into code implementation, is essential in systems development and maintenance. Tasks such as assessing the impact of changes and enhancing software reusability require a clear link between these requirements and their software implementation. However, establishing such links manually across extensive codebases is prohibitively challenging. Requirements, typically articulated in natural language, embody semantics that clarify the purpose of the codebase. Conventional traceability methods, relying on textual similarities between requirements and code, often suffer from low precision due to the semantic gap between high-level natural language requirements and the syntactic nature of code. The advent of Large Language Models (LLMs) provides new methods to address this challenge through their advanced capability to interpret both natural language and code syntax. Furthermore, representing code as a knowledge graph facilitates the use of graph structural information to enhance traceability links. This paper introduces an LLM-supported retrieval augmented generation approach for enhancing requirements traceability to the class diagram of the code, incorporating keyword, vector, and graph indexing techniques, and their integrated application. We present a comparative analysis against conventional methods and among different indexing strategies and parameterizations on the performance. Our results demonstrate how this methodology significantly improves the efficiency and accuracy of establishing traceability links in software development processes.
引用
收藏
页码:295 / 314
页数:20
相关论文
共 50 条
  • [41] Feature Extraction from Japanese Natural Language Requirements Documents for Software Product Line Engineering
    Hisazumi, Kenji
    Xiao, Yuedong
    Fukuda, Akira
    2019 COMPANION OF THE 19TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS-C 2019), 2019, : 322 - 329
  • [42] Deep-transfer learning inspired natural language processing system for software requirements classification
    Saqib, Mohd
    Mustaqeem, Mohd
    Jawed, Md Saquib
    Abdulaziz, Alsolami
    Khan, Anish
    Khan, Jeeshan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, : 839 - 861
  • [43] Comparing the Use of Research Resource Identifiers and Natural Language Processing for Citation of Databases, Software, and Other Digital Artifacts
    Hsu, Chun-Nan
    Bandrowski, Anita E.
    Gillespie, Thomas H.
    Udell, Jon
    Lin, Ko-Wei
    Ozyurt, Ibrahim Burak
    Grethe, Jeffrey S.
    Martone, Maryann E.
    COMPUTING IN SCIENCE & ENGINEERING, 2020, 22 (02) : 22 - 32
  • [44] Recovering Traceability Links between a Simple Natural Language Sentence and Source Code Using Domain Ontologies
    Yoshikawa, Takashi
    Hayashi, Shinpei
    Saeki, Motoshi
    2009 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, CONFERENCE PROCEEDINGS, 2009, : 551 - 554
  • [45] Req2Spec: Transforming Software Requirements into Formal Specifications Using Natural Language Processing
    Nayak, Anmol
    Timmapathini, Hari Prasad
    Murali, Vidhya
    Ponnalagu, Karthikeyan
    Venkoparao, Vijendran Gopalan
    Post, Amalinda
    REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, REFSQ 2022, 2022, 13216 : 87 - 95
  • [46] Generating model-based test cases from natural language requirements for space application software
    Valdivino Alexandre de Santiago Júnior
    Nandamudi Lankalapalli Vijaykumar
    Software Quality Journal, 2012, 20 : 77 - 143
  • [47] Generating model-based test cases from natural language requirements for space application software
    de Santiago Junior, Valdivino Alexandre
    Vijaykumar, Nandamudi Lankalapalli
    SOFTWARE QUALITY JOURNAL, 2012, 20 (01) : 77 - 143
  • [48] Feature extraction approaches from natural language requirements for reuse in software product lines: A systematic literature review
    Bakar, Noor Hasrina
    Kasirun, Zarinah M.
    Salleh, Norsaremah
    JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 106 : 132 - 149
  • [49] Two Rule-Based Natural Language Strategies for Requirements Discovery and Classification in Open Source Software Development Projects
    Vlas, Radu E.
    Robinson, William N.
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2012, 28 (04) : 11 - 38
  • [50] A Novel Natural Language Processing (NLP) Approach to Automatically Generate Conceptual Class Model from Initial Software Requirements
    Ahmed, Mudassar Adeel
    Butt, Wasi Haider
    Ahsan, Imran
    Anwar, Muhammad Waseem
    Latif, Muhammad
    Azam, Farooque
    INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 476 - 484