Source code change analysis with deep learning based programming model

被引：0

作者：

Babita Pathik

Meena Sharma

机构：

[1] Institute of Engineering & Technology,IT

[2] DAVV,Department of Computer Engineering

[3] Institute of Engineering & Technology,undefined

[4] DAVV,undefined

来源：

Automated Software Engineering | 2022年 / 29卷

关键词：

Change impact analysis; Abstract syntax tree; Path2Vec; Deep learning; Word embedding; Distance metrics; Attention; Un-parsing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Analyzing the change in source code is a very crucial activity for object-oriented parallel programming software. This paper suggested an Impact analysis method with Attention BiLSTM (IABLSTM) for detecting the changes and their affected part in the object-oriented software system. Classical approaches based on control flow graph, program dependence analysis, latent dirichlet allocation, and data mining have been used for change impact analysis. A Path2Vec approach is presented in the paper, combining a deep learning technique with word embedding to analyze and identify the change. The paper considers two versions of a python program for experiment and generates the abstract syntax tree (AST). Then extract the path to produce a token sequence. Next, convert the token sequence into unique vectors by applying a word embedding layer. The BiLSTM network encodes the sequence into a vector representation. After that, compare the embedded output with the use of cosine distance metrics. We trained the neural network model with the embedded outcome. Then decode the resultant token sequence into a path of AST. Finally, convert the AST path back to code using the un-parsing technique. To strengthen the parallel programming based proposed model, we combined the attention mechanism to emphasize and detect the differences in the code. The model is detecting the change of code efficiently. The experimental results show that our proposed model's change detection accuracy increases significantly compared with other conventional models for change impact analysis. The proposed method can also be applied for impact analysis on object-oriented based parallel programming. The empirical evaluation shows that the model outperforms change detection with approximately 85% validation accuracy.

引用

共 50 条

[21] Source code-based defect prediction using deep learning and transfer learning
Saifan, Ahmad A.
Al Smadi, Nawzat
INTELLIGENT DATA ANALYSIS, 2019, 23 (06) : 1243 - 1269
[22] Multiple Authors Identification from Source Code Using Deep Learning Model
Omi, Abdul Mannan
Hossain, Monir
Islam, Md Nahidul
Mittra, Tanni
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND INFORMATION TECHNOLOGY 2021 (ICECIT 2021), 2021,
[23] Systematic Analysis of Deep Learning Model for Vulnerable Code Detection
Bin Nazim, Mohammad Taneem
Faruk, Md Jobair Hossain
Shahriar, Hossain
Khan, Md Abdullah
Masum, Mohammad
Sakib, Nazmus
Wu, Fan
2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1768 - 1773
[24] CLAVE: A deep learning model for source code authorship verification with contrastive learning and transformer encoders
alvarez-Fidalgo, David
Ortin, Francisco
INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
[25] CLAVE: A deep learning model for source code authorship verification with contrastive learning and transformer encoders
Álvarez-Fidalgo, David
Ortin, Francisco
Information Processing and Management, 3
[26] An Empirical Study on Vulnerability Detection for Source Code Software based on Deep Learning
Lin, Wei
Cai, Saihua
2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 1159 - 1160
[27] On the Generalizability of Deep Learning-based Code Completion Across Programming Language Versions
Ciniselli, Matteo
Martin-Lopez, Alberto
Bavota, Gabriele
PROCEEDINGS 2024 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC 2024, 2024, : 99 - 111
[28] DLBT: Deep Learning-Based Transformer to Generate Pseudo-Code from Source Code
Gad, Walaa
Alokla, Anas
Nazih, Waleed
Aref, Mustafa
Salem, Abdel-badeeh
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 3117 - 3132
[29] PLangRec: Deep-learning model to predict the programming language from single line of code
Rodriguez-Prieto, Oscar
Pato, Alejandro
Ortin, Francisco
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
[30] Adding Context to Source Code Representations for Deep Learning
Tian, Fuwei
Treude, Christoph
2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 374 - 378

← 1 2 3 4 5 →