Gapped Code Clone Detection with Lightweight Source Code Analysis

被引:0
|
作者
Murakami, Hiroaki [1 ]
Hotta, Keisuke [1 ]
Higo, Yoshiki [1 ]
Igaki, Hiroshi [1 ]
Kusumoto, Shinji [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan
关键词
Code Clone; Program Analysis; Software Maintenance; Tool Comparison; SYSTEM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A variety of methods detecting code clones has been proposed before. In order to detect gapped code clones, AST-based technique, PDG-based technique, metric-based technique and text-based technique using the LCS algorithm have been proposed. However, each of those techniques has limitations. For example, existing AST-based techniques and PDG-based techniques require costs for transforming source files into intermediate representations such as ASTs or PDGs and comparing them. Existing metric-based techniques and text-based techniques using the LCS algorithm cannot detect code clones if methods or blocks are partially duplicated. This paper proposes a new method that detects gapped code clones using the Smith-Waterman algorithm to resolve those limitations. The Smith-Waterman algorithm is an algorithm for identifying similar alignments between two sequences even if they include some gaps. The authors developed the proposed method as a software tool named CDSW, and confirmed that the proposed method could resolve the limitations by conducting a quantitative evaluation with Bellon's benchmark.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 50 条
  • [21] Code Clone Detection using Wavelets
    Karus, Siim
    Kilgi, Karl
    2015 IEEE 9TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2015, : 8 - 14
  • [22] CCCD: Concolic Code Clone Detection
    Krutz, Daniel E.
    Shihab, Emad
    2013 20TH WORKING CONFERENCE ON REVERSE ENGINEERING (WCRE), 2013, : 489 - 490
  • [23] Challenges in Behavioral Code Clone Detection
    Su, Fang-Hsiang
    Bell, Jonathan
    Kaiser, Gail
    2016 IEEE 23RD INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), VOL 3, 2016, : 21 - 22
  • [24] On Precision of Code Clone Detection Tools
    Farmahinifarahani, Farima
    Saini, Vaibhav
    Yang, Di
    Sajnani, Hitesh
    Lopes, Cristina V.
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER), 2019, : 84 - 94
  • [25] On the Robustness of Clone Detection to Code Obfuscation
    Schulze, Sandro
    Meyer, Daniel
    2013 7TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2013, : 62 - 68
  • [26] Code Clone Detection: A Literature Review
    Chen Q.-Y.
    Li S.-P.
    Yan M.
    Xia X.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (04): : 962 - 980
  • [27] Interface Driven Code Clone Detection
    Misu, Md Rakib Hossain
    Sakib, Kazi
    2017 24TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2017), 2017, : 747 - 748
  • [28] Poisoned source code detection in code models
    Ghannoum, Ehab
    Ghafari, Mohammad
    JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 226
  • [29] A Novel Code Stylometry-based Code Clone Detection Strategy
    Dong, Wenyuan
    Feng, Zhiyong
    Wei, Hua
    Luo, Hong
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 1516 - 1521
  • [30] Generic Code Cloning method for Detection of Clone Code in Software Development
    Haque, Syed Mohd Fazalul
    Srikanth, V.
    Reddy, E. Sreenivasa
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA MINING AND ADVANCED COMPUTING (SAPIENCE), 2016, : 340 - 344