A Comparative Analysis of Clone Detection Techniques on SemanticCloneBench

被引:1
|
作者
Rabbani, Sohaib Masood [1 ]
Gulzar, Nabeel Ahmad [1 ]
Arshad, Saad [1 ]
Abid, Shamsa [2 ]
Shamail, Shafay [1 ]
机构
[1] LUMS, Dept Comp Sci, SBASSE, Lahore, Pakistan
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
来源
2022 IEEE 16TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC 2022) | 2022年
关键词
Semantic Clone Detection; SemanticCloneBench; Deep Learning; Semantic Similarity; CodeBERT; Large-Variance Clones;
D O I
10.1109/IWSC55060.2022.00011
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Semantic code clone detection involves the detection of functionally similar code fragments which may otherwise be lexically, syntactically, or structurally dissimilar. The detection of semantic code clones has important applications in aspect mining and product line analysis. The accurate detection of semantic code clones is a challenging task and various techniques have been proposed. However, the evaluation of these techniques is performed using various datasets and we do not have a clear picture of the performance of these techniques relative to each other. Recently, SemanticCloneBench has been introduced as a benchmark for semantic clones. Now, we can use the SemanticCloneBench to effectively evaluate and compare the performance of semantic code clone detection techniques. In this paper, we compare the semantic code clone detection performance of three different code clone detection techniques namely FACER-CD, CodeBERT and NIL for Java code clones using SemanticCloneBench. FACER-CD performs API usage similarity-based clustering to detect clones, while CodeBERT is a deep-learning based approach which uses a pre-trained programming language model, and NIL is a token-based large-gapped code clones detector. FACER-CD, NIL, and CodeBERT show a recall of 64.3%, 12.7%, and 83.2% respectively on SemanticCloneBench. Using all three techniques together on the SemanticCloneBench dataset gives us an overall recall of 95.5% which is currently the best performance achieved on SemanticCloneBench.
引用
收藏
页码:16 / 22
页数:7
相关论文
共 50 条
  • [41] Comparative Analysis of Overlap Community Detection Techniques on Social Media Platform
    Meena, Pawan
    Pawar, Mahesh
    Pandey, Anjana
    COMPUTER JOURNAL, 2023, 66 (08): : 1893 - 1912
  • [42] A Comparative Analysis of Deterministic Detection and Estimation Techniques for MIMO SFCW Radars
    Sirignano, Emilio
    Davoli, Alessandro
    Vitetta, Giorgio M.
    Viappiani, Federico
    IEEE ACCESS, 2019, 7 : 129848 - 129861
  • [43] Comparative analysis of machine learning techniques for Parkinson’s detection: A review
    Khanna K.
    Gambhir S.
    Gambhir M.
    Multimedia Tools and Applications, 2023, 82 (29) : 45205 - 45231
  • [44] Comparative Study between Big Data Analysis Techniques in Intrusion Detection
    Hafsa, Mounir
    Jemili, Farah
    BIG DATA AND COGNITIVE COMPUTING, 2019, 3 (01) : 1 - 13
  • [45] Comparative Analysis of Photovoltaic Faults and Performance Evaluation of its Detection Techniques
    Khalil, Ihsan Ullah
    Ul-Haq, Azhar
    Mahmoud, Yousef
    Jalal, Marium
    Aamir, Muhammad
    Ahsan, Mati Ullah
    Mehmood, Khalid
    IEEE ACCESS, 2020, 8 (08): : 26676 - 26700
  • [46] A comparative study on change vector analysis based change detection techniques
    SARTAJVIR SINGH
    RAJNEESH TALWAR
    Sadhana, 2014, 39 : 1311 - 1331
  • [47] Machine Learning in OpenFlow Network: Comparative Analysis of DDoS Detection Techniques
    Singh, Arun Kumar
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (02) : 221 - 226
  • [48] A comparative study on change vector analysis based change detection techniques
    Singh, Sartajvir
    Talwar, Rajneesh
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2014, 39 (06): : 1311 - 1331
  • [49] A Comparative Analysis of Solar Photovoltaic Advanced Fault Detection and Monitoring Techniques
    Ghazali, Siti Nor Azlina Mohd
    Sujod, Muhamad Zahim
    ELECTRICA, 2023, 23 (01): : 137 - 148
  • [50] A Comparative Analysis of Compression and Transfer Learning Techniques in DeepFake Detection Models
    Karathanasis, Andreas
    Violos, John
    Kompatsiaris, Ioannis
    MATHEMATICS, 2025, 13 (05)