Source code change analysis with deep learning based programming model

被引:0
|
作者
Babita Pathik
Meena Sharma
机构
[1] Institute of Engineering & Technology,IT
[2] DAVV,Department of Computer Engineering
[3] Institute of Engineering & Technology,undefined
[4] DAVV,undefined
来源
关键词
Change impact analysis; Abstract syntax tree; Path2Vec; Deep learning; Word embedding; Distance metrics; Attention; Un-parsing;
D O I
暂无
中图分类号
学科分类号
摘要
Analyzing the change in source code is a very crucial activity for object-oriented parallel programming software. This paper suggested an Impact analysis method with Attention BiLSTM (IABLSTM) for detecting the changes and their affected part in the object-oriented software system. Classical approaches based on control flow graph, program dependence analysis, latent dirichlet allocation, and data mining have been used for change impact analysis. A Path2Vec approach is presented in the paper, combining a deep learning technique with word embedding to analyze and identify the change. The paper considers two versions of a python program for experiment and generates the abstract syntax tree (AST). Then extract the path to produce a token sequence. Next, convert the token sequence into unique vectors by applying a word embedding layer. The BiLSTM network encodes the sequence into a vector representation. After that, compare the embedded output with the use of cosine distance metrics. We trained the neural network model with the embedded outcome. Then decode the resultant token sequence into a path of AST. Finally, convert the AST path back to code using the un-parsing technique. To strengthen the parallel programming based proposed model, we combined the attention mechanism to emphasize and detect the differences in the code. The model is detecting the change of code efficiently. The experimental results show that our proposed model's change detection accuracy increases significantly compared with other conventional models for change impact analysis. The proposed method can also be applied for impact analysis on object-oriented based parallel programming. The empirical evaluation shows that the model outperforms change detection with approximately 85% validation accuracy.
引用
收藏
相关论文
共 50 条
  • [21] Source code-based defect prediction using deep learning and transfer learning
    Saifan, Ahmad A.
    Al Smadi, Nawzat
    INTELLIGENT DATA ANALYSIS, 2019, 23 (06) : 1243 - 1269
  • [22] Multiple Authors Identification from Source Code Using Deep Learning Model
    Omi, Abdul Mannan
    Hossain, Monir
    Islam, Md Nahidul
    Mittra, Tanni
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND INFORMATION TECHNOLOGY 2021 (ICECIT 2021), 2021,
  • [23] Systematic Analysis of Deep Learning Model for Vulnerable Code Detection
    Bin Nazim, Mohammad Taneem
    Faruk, Md Jobair Hossain
    Shahriar, Hossain
    Khan, Md Abdullah
    Masum, Mohammad
    Sakib, Nazmus
    Wu, Fan
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1768 - 1773
  • [24] CLAVE: A deep learning model for source code authorship verification with contrastive learning and transformer encoders
    alvarez-Fidalgo, David
    Ortin, Francisco
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
  • [25] CLAVE: A deep learning model for source code authorship verification with contrastive learning and transformer encoders
    Álvarez-Fidalgo, David
    Ortin, Francisco
    Information Processing and Management, 3
  • [26] An Empirical Study on Vulnerability Detection for Source Code Software based on Deep Learning
    Lin, Wei
    Cai, Saihua
    2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 1159 - 1160
  • [27] On the Generalizability of Deep Learning-based Code Completion Across Programming Language Versions
    Ciniselli, Matteo
    Martin-Lopez, Alberto
    Bavota, Gabriele
    PROCEEDINGS 2024 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC 2024, 2024, : 99 - 111
  • [28] DLBT: Deep Learning-Based Transformer to Generate Pseudo-Code from Source Code
    Gad, Walaa
    Alokla, Anas
    Nazih, Waleed
    Aref, Mustafa
    Salem, Abdel-badeeh
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 3117 - 3132
  • [29] PLangRec: Deep-learning model to predict the programming language from single line of code
    Rodriguez-Prieto, Oscar
    Pato, Alejandro
    Ortin, Francisco
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [30] Adding Context to Source Code Representations for Deep Learning
    Tian, Fuwei
    Treude, Christoph
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 374 - 378