Bug localization based on syntactical and semantic information of source code

被引:0
|
作者
YAN Xuefeng [1 ,2 ]
CHENG Shasha [1 ]
GUO Liqin [3 ]
机构
[1] College of Computer Science Technology, Nanjing University of Aeronautics and Astronautics
[2] Collaborative Innovation Center of Novel Software Technology and Industrialization
[3] State Key Laboratory of Intelligent Manufacturing System Technology, Beijing Institute of Electronic System Engineering
基金
国家重点研发计划;
关键词
bug report; abstract syntax tree; code representation; software bug localization;
D O I
暂无
中图分类号
TP311.5 [软件工程];
学科分类号
摘要
The existing software bug localization models treat the source file as natural language, which leads to the loss of syntactical and structure information of the source file. A bug localization model based on syntactical and semantic information of source code is proposed. Firstly, abstract syntax tree(AST) is divided based on node category to obtain statement sequence. The statement tree is encoded into vectors to capture lexical and syntactical knowledge at the statement level.Secondly, the source code is transformed into vector representation by the sequence naturalness of the statement. Therefore,the problem of gradient vanishing and explosion caused by a large AST size is obviated when using AST to the represent source code. Finally, the correlation between bug reports and source files are comprehensively analyzed from three aspects of syntax, semantics and text to locate the buggy code. Experiments show that compared with other standard models, the proposed model improves the performance of bug localization, and it has good advantages in mean reciprocal rank(MRR), mean average precision(MAP) and Top N Rank.
引用
收藏
页码:236 / 246
页数:11
相关论文
共 50 条
  • [1] Bug localization based on syntactical and semantic information of source code
    Yan, Xuefeng
    Cheng, Shasha
    Guo, Liqin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (01) : 236 - 246
  • [2] Source Code Retrieval for Bug Localization using Bug Report
    Swe, Kyaw Ei Ei
    Oo, Hnin Min
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 241 - 247
  • [3] Bug Localization Model in Source Code Using Ontologies
    Da Silva, Alisson Solitto
    Garcia, Rogerio Eduardo
    Botega, Leonardo Castro
    IEEE ACCESS, 2023, 11 : 98542 - 98557
  • [4] Bug Localization Based on Code Change Histories and Bug Reports
    Youm, Klaus Changsun
    Ahn, June
    Kim, Jeongho
    Lee, Eunseok
    2015 22ND ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2015), 2015, : 190 - 197
  • [5] Bug Localization with Features Crossing and Structured Semantic Information Matching
    Xu, Guoqing
    Wang, Xingqi
    Wei, Dan
    Shao, Yanli
    Chen, Bin
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (08) : 1261 - 1291
  • [6] Improved bug localization based on code change histories and bug reports
    Youm, Klaus Changsun
    Ahn, June
    Lee, Eunseok
    INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 82 : 177 - 192
  • [7] Exploiting spatial code proximity and order for improved source code retrieval for bug localization
    Sisman, Bunyamin
    Akbar, Shayan A.
    Kak, Avinash C.
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2017, 29 (01)
  • [8] Source Code Retrieval for Bug Localization using Latent Dirichlet Allocation
    Lukins, Stacy K.
    Kraft, Nicholas A.
    Etzkorn, Letha H.
    FIFTEENTH WORKING CONFERENCE ON REVERSE ENGINEERING, PROCEEDINGS, 2008, : 155 - 164
  • [9] Pre-training Code Representation with Semantic Flow Graph for Effective Bug Localization
    Du, Yali
    Yu, Zhongxing
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 579 - 591
  • [10] Feature Combination to Alleviate Hubness Problem of Source Code Representation for Bug Localization
    Kim, Youngkyoung
    Kim, Misoo
    Lee, Eunseok
    2020 27TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2020), 2020, : 511 - 512