An empirical study on the importance of source code entities for requirements traceability

被引:24
|
作者
Ali, Nasir [1 ]
Sharafi, Zohreh [2 ]
Gueheneuc, Yann-Gael [2 ]
Antoniol, Giuliano [2 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Kingston, ON, Canada
[2] Ecole Polytech, DGIGL, Montreal, PQ H3C 3A7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
LINKS;
D O I
10.1007/s10664-014-9315-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Requirements Traceability (RT) links help developers during program comprehension and maintenance tasks. However, creating RT links is a laborious and resource-consuming task. Information Retrieval (IR) techniques are useful to automatically create traceability links. However, IR-based techniques typically have low accuracy (precision, recall, or both) and thus, creating RT links remains a human intensive process. We conjecture that understanding how developers verify RT links could help improve the accuracy of IR-based RT techniques to create RT links. Consequently, we perform an empirical study consisting of four case studies. First, we use an eye-tracking system to capture developers' eye movements while they verify RT links. We analyse the obtained data to identify and rank developers' preferred types of Source Code Entities (SCEs), e.g., domain vs. implementation-level source code terms and class names vs. method names. Second, we perform another eye-tracking case study to confirm that it is the semantic content of the developers' preferred types of SCEs and not their locations that attract developers' attention and help them in their task to verify RT links. Third, we propose an improved term weighting scheme, i.e., Developers Preferred Term Frequency/Inverse Document Frequency (D P T F / I D F), that uses the knowledge of the developers' preferred types of SCEs to give more importance to these SCEs into the term weighting scheme. We integrate thisweighting scheme with an IR technique, i.e., Latent Semantic Indexing (LSI), to create a new technique to RT link recovery. Using three systems (iTrust, Lucene, and Pooka), we show that the proposed technique statistically improves the accuracy of the recovered RT links over a technique based on LSI and the usual Term Frequency/Inverse Document Frequency (T F / I D F) weighting scheme. Finally, we compare the newly proposed D P T F / I D F with our original Domain Or Implementation/Inverse Document Frequency (D O I / I D F) weighting scheme.
引用
收藏
页码:442 / 478
页数:37
相关论文
共 50 条
  • [1] An empirical study on the importance of source code entities for requirements traceability
    Nasir Ali
    Zohreh Sharafi
    Yann-Gaël Guéhéneuc
    Giuliano Antoniol
    Empirical Software Engineering, 2015, 20 : 442 - 478
  • [2] An Empirical Study on Source Code Feature Extraction in Preprocessing of IR-Based Requirements Traceability
    Wang, Bangchao
    Deng, Yang
    Luo, Ruiqi
    Jin, Huan
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 1069 - 1078
  • [3] Supporting Requirements to Code Traceability Creation by Code Comments
    Shen, Guohua
    Wang, Haijuan
    Huang, Zhiqiu
    Yu, YaoShen
    Chen, Kai
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2021, 31 (08) : 1099 - 1118
  • [4] An Empirical Study on Requirements Traceability Using Eye-Tracking
    Ali, Nasir
    Sharafi, Zohreh
    Gueheneuc, Yann-Gael
    Antoniol, Giuliano
    2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2012, : 191 - 200
  • [5] Empirical Study of Transformers for Source Code
    Chirkova, Nadezhda
    Troshin, Sergey
    PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 703 - 715
  • [6] Recovering Traceability Links between Requirements and Source Code Using the Configuration Management Log
    Tsuchiya, Ryosuke
    Washizaki, Hironori
    Fukazawa, Yoshiaki
    Kato, Tadahisa
    Kawakami, Masumi
    Yoshimura, Kentaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (04): : 852 - 862
  • [7] Recovering Semantic Traceability between Requirements and Source Code Using Feature Representation Techniques
    Zhang, Meng
    Tao, Chuanqi
    Guo, Hongjing
    Huang, Zhiqiu
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 873 - 882
  • [8] Constructing Traceability Links between Software Requirements and Source Code Based on Neural Networks
    Dai, Peng
    Yang, Li
    Wang, Yawen
    Jin, Dahai
    Gong, Yunzhan
    MATHEMATICS, 2023, 11 (02)
  • [9] Can method data dependencies support the assessment of traceability between requirements and source code?
    Kuang, Hongyu
    Maeder, Patrick
    Hu, Hao
    Ghabi, Achraf
    Huang, LiGuo
    Lue, Jian
    Egyed, Alexander
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2015, 27 (11) : 838 - 866
  • [10] Do Data Dependencies in Source Code complement Call Dependencies for Understanding Requirements Traceability?
    Kuang, Hongyu
    Maeder, Patrick
    Hu, Hao
    Ghabi, Achraf
    Huang, LiGuo
    Jian, Lv
    Egyed, Alexander
    2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2012, : 181 - 190