Identifying the severity of technical debt issues based on semantic and structural information

被引:0
|
作者
Dongjin Yu
Sicheng Li
Xin Chen
Tian Sun
机构
[1] Hangzhou Dianzi University,School of Computer Science and Technology
[2] Hangzhou Dianzi University,HDU
来源
Software Quality Journal | 2023年 / 31卷
关键词
Technical debt; Technical debt issues; Severity identification; Semantic information; Structural information; Code analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Technical debt (TD) refers to the phenomenon that developers choose a compromise solution from a short-term benefit perspective during design or architecture selection. TD-related issues, such as code smells, may have a critical impact on important non-functional requirements. Different severity levels of TD issues require different measures to be taken by developers in the future. Existing studies mainly focus on detecting TD in software projects through source code or comments, but usually ignore the severity degree of TD issues. As a matter of fact, it is very important to identify the severity of TD issues and clarify which TD should be prioritized. In this paper, we propose an approach that combines the semantic and structural information of the code snippets to identify their severity at method level. In the approach, we first transform each method affected by TD issues into an abstract syntax tree (AST) and use the paths in the AST to represent its semantic information. Then, we extract different code metrics to measure the size, coupling, and complexity of methods affected by TD issues to represent their structural information. Finally, we build a stacking ensemble model to identify the severity of TD issues by using Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) for the base classifiers and Support Vector Machine (SVM) for the meta-classifier. The evaluation results on the real dataset show that our approach achieves 65.77% in terms of precision, 68.18% in terms of recall, and 65.84% in terms of F1-score on average. In addition, the experimental results also demonstrate that the strategy of combining the semantic and structural information of code snippets is effective in improving the effectiveness of our approach.
引用
收藏
页码:1499 / 1526
页数:27
相关论文
共 50 条
  • [1] Identifying the severity of technical debt issues based on semantic and structural information
    Yu, Dongjin
    Li, Sicheng
    Chen, Xin
    Sun, Tian
    SOFTWARE QUALITY JOURNAL, 2023, 31 (04) : 1499 - 1526
  • [2] Identifying and formulating information requirements based on semantic theories of information
    Wang, Sij-Fen
    Feng, Jun-Kang
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 4080 - +
  • [3] Comments or Issues: Where to Document Technical Debt
    Xavier, Laerte
    Montandon, Joao Eduardo
    Valente, Marco Tulio
    IEEE SOFTWARE, 2022, 39 (05) : 84 - 91
  • [4] On the documentation of self-admitted technical debt in issues
    Laerte Xavier
    João Eduardo Montandon
    Fabio Ferreira
    Rodrigo Brito
    Marco Tulio Valente
    Empirical Software Engineering, 2022, 27
  • [5] On the documentation of self-admitted technical debt in issues
    Xavier, Laerte
    Montandon, Joao Eduardo
    Ferreira, Fabio
    Brito, Rodrigo
    Valente, Marco Tulio
    EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (07)
  • [6] Patent document categorization based on semantic structural information
    Kim, Jae-Ho
    Choi, Key-Sun
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1200 - 1215
  • [7] Debt issues and capital structure with soft information
    Jiao, Yawen
    MANAGERIAL FINANCE, 2009, 36 (01) : 4 - 21
  • [8] Identifying Technical Debt through a Code Comment Mining Tool
    Farias, Mario Andre de F.
    Xisto, Railan
    Santos, Marcos S.
    Fontes, Raphael S.
    Colaco Junior, Methanias
    Spinola, Rodrigo
    Mendonca, Manoel
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [9] Identifying and Estimating Technical Debt for Service Composition in SaaS Cloud
    Kumar, Satish
    Bahsoon, Rami
    Chen, Tao
    Buyya, Rajkumar
    2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, : 121 - 125
  • [10] A Contextualized Vocabulary Model for Identifying Technical Debt on Code Comments
    de Freitas Farias, Mario Andre
    da Silva, Andre Batista
    de Mendonca Neto, Manoel Gomes
    Spinola, Rodrigo Oliveira
    2015 IEEE 7TH INTERNATIONAL WORKSHOP ON MANAGING TECHNICAL DEBT (MTD) PROCEEDINGS, 2015, : 25 - 32