Establishing Traceability Between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs

被引:0
|
作者
Ali, Syed Juried [1 ]
Naganathan, Varun [2 ]
Bork, Dominik [1 ]
机构
[1] TU Wien, Business Informat Grp, Vienna, Austria
[2] Microsoft, Hyderabad, India
来源
CONCEPTUAL MODELING, ER 2024 | 2025年 / 15238卷
关键词
Large Language Models; LLM; Requirements Traceability; Retrieval Augmented Generation; Requirements Engineering;
D O I
10.1007/978-3-031-75872-0_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software Engineering aims to effectively translate stakeholders' requirements into executable code to fulfill their needs. Traceability from natural language use case requirements to classes in a UML class diagram, subsequently translated into code implementation, is essential in systems development and maintenance. Tasks such as assessing the impact of changes and enhancing software reusability require a clear link between these requirements and their software implementation. However, establishing such links manually across extensive codebases is prohibitively challenging. Requirements, typically articulated in natural language, embody semantics that clarify the purpose of the codebase. Conventional traceability methods, relying on textual similarities between requirements and code, often suffer from low precision due to the semantic gap between high-level natural language requirements and the syntactic nature of code. The advent of Large Language Models (LLMs) provides new methods to address this challenge through their advanced capability to interpret both natural language and code syntax. Furthermore, representing code as a knowledge graph facilitates the use of graph structural information to enhance traceability links. This paper introduces an LLM-supported retrieval augmented generation approach for enhancing requirements traceability to the class diagram of the code, incorporating keyword, vector, and graph indexing techniques, and their integrated application. We present a comparative analysis against conventional methods and among different indexing strategies and parameterizations on the performance. Our results demonstrate how this methodology significantly improves the efficiency and accuracy of establishing traceability links in software development processes.
引用
收藏
页码:295 / 314
页数:20
相关论文
共 50 条
  • [1] Review of Research on Requirements Traceability Approaches for Different Software Artifacts
    Tao C.-Q.
    Zhang M.
    Guo H.-J.
    Huang Z.-Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (11): : 2393 - 2419
  • [2] Combining Textual and Structural Analysis of Software Artifacts for Traceability Link Recovery
    McMillan, Collin
    Poshyvanyk, Denys
    Revelle, Meghan
    2009 ICSE WORKSHOP ON TRACEABILITY IN EMERGING FORMS OF SOFTWARE ENGINEERING (TEFSE), 2009, : 41 - 48
  • [3] An Approach to Generate the Traceability Between Restricted Natural Language Requirements and AADL Models
    Wang, Fei
    Yang, Zhi-Bin
    Huang, Zhi-Qiu
    Liu, Cheng-Wei
    Zhou, Yong
    Bodeveix, Jean-Paul
    Filali, Mamoun
    IEEE TRANSACTIONS ON RELIABILITY, 2020, 69 (01) : 154 - 173
  • [4] Exploring Semantics of Software Artifacts to Improve Requirements Traceability Recovery: A Hybrid Approach
    Wang, Shiheng
    Li, Tong
    Yang, Zhen
    2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 39 - 46
  • [5] Towards Gamification in Software Traceability: Between Test and Code Artifacts
    Parizi, Reza Meimandi
    Kasem, Asem
    Abdullah, Azween
    2015 10TH INTERNATIONAL JOINT CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), VOL 1, 2015, : 393 - 400
  • [6] Analyzing Natural-Language Artifacts of the Software Process
    Hasan, Maryam
    Stroulia, Eleni
    Barbosa, Denilson
    Alalfi, Manar
    2010 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, 2010,
  • [7] Support for Traceability Management of Software Artefacts using Natural Language Processing
    Arunthavanathan, A.
    Shanmugathasan, S.
    Ratnavel, S.
    Thiyagarajah, V.
    Perera, I.
    Meedeniya, D.
    Balasubramaniam, D.
    2ND INTERNATIONAL MERCON 2016 MORATUWA ENGINEERING RESEARCH CONFERENCE, 2016, : 18 - 23
  • [8] Applications of natural language processing in software traceability: A systematic mapping study?
    Pauzi, Zaki
    Capiluppi, Andrea
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 198
  • [9] Software requirements as an application domain for natural language processing
    Diamantopoulos, Themistoklis
    Roth, Michael
    Symeonidis, Andreas
    Klein, Ewan
    LANGUAGE RESOURCES AND EVALUATION, 2017, 51 (02) : 495 - 524
  • [10] Ambiguity in Natural Language Software Requirements: A Case Study
    de Bruijn, Fabian
    Dekkers, Hans L.
    REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, 2010, 6182 : 233 - 247