A Novel Source Code Clone Detection Method Based on Dual-GCN and IVHFS

被引:1
|
作者
Yang, Haixin [1 ]
Li, Zhen [1 ]
Guo, Xinyu [1 ]
机构
[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071002, Peoples R China
关键词
source code clone detection; software engineering; graph convolutional network; interval-valued hesitant fuzzy set; abstract syntax tree; control flow graph;
D O I
10.3390/electronics12061315
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source code clone detection, which can identify code fragments with similar functions, plays a significant role in software development and quality assurance. Existing methods either extract single syntactic or semantic information, or ignore the associated information between code statements in different structures. It is difficult for these methods to effectively detect clone pairs with similar functions. In this paper, we propose a new model based on a dual graph convolutional network (GCN) and interval-valued hesitant fuzzy set (IVHFS), which we named DG-IVHFS. Specifically, we simplified and grouped the abstract syntax tree (AST) of source code to obtain the group representations. The group representations of the AST, as well as the control flow graph (CFG) representations, were transformed into graph structures, and then we applied GCNs on them to learn dependencies between nodes. In addition, we introduced IVHFS into the model for a more comprehensive evaluation of similarity. Our experimental results demonstrated that the precision, recall, and F1-scores of DG-IVHFS on the BigCloneBench and GoogleCodeJam datasets reached 98, 97 and 97% and 98, 93 and 95%, respectively, exceeding current state-of-the-art models. Moreover, our model performed well in terms of time consumption.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] CloneTM: A Code Clone Detection Tool Based on Latent Dirichlet Allocation
    Reddivari, Sandeep
    Khan, Mohammed Salman
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 930 - 931
  • [42] TPCaps: a framework for code clone detection and localization based on improved CapsNet
    Li, Yuancheng
    Yu, Chaohang
    Cui, Yaqi
    APPLIED INTELLIGENCE, 2023, 53 (13) : 16594 - 16605
  • [43] Index-Based Code Clone Detection: Incremental, Distributed, Scalable
    Hummel, Benjamin
    Juergens, Elmar
    Heinemann, Lars
    Conradt, Michael
    2010 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, 2010,
  • [44] TPCaps: a framework for code clone detection and localization based on improved CapsNet
    Yuancheng Li
    Chaohang Yu
    Yaqi Cui
    Applied Intelligence, 2023, 53 : 16594 - 16605
  • [45] Scalable code clone detection and search based on adaptive prefix filtering
    Nishi, Manziba Akanda
    Damevski, Kostadin
    JOURNAL OF SYSTEMS AND SOFTWARE, 2018, 137 : 130 - 142
  • [46] A parallel deep learning-based code clone detection model
    Zhang, Xiangping
    Liu, Jianxun
    Shi, Min
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 181
  • [47] Multi-threshold token-based code clone detection
    Golubev, Yaroslav
    Poletansky, Viktor
    Povarov, Nikita
    Bryksin, Timofey
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2021), 2021, : 496 - 500
  • [48] An enhanced transformer-based framework for interpretable code clone detection
    Nashaat, Mona
    Amin, Reem
    Eid, Ahmad Hosny
    Abdel-Kader, Rabab F.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 222
  • [49] VFDETECT: A Vulnerable Code Clone Detection System Based on Vulnerability Fingerprint
    Liu, Zhen
    Wei, Qiang
    Cao, Yan
    2017 IEEE 3RD INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC), 2017, : 548 - 553
  • [50] Method-level incremental code clone detection using hybrid approach
    Kodhai, E.
    Kanmani, S.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2016, 54 (04) : 279 - 289