Source code change analysis with deep learning based programming model

被引:0
|
作者
Babita Pathik
Meena Sharma
机构
[1] Institute of Engineering & Technology,IT
[2] DAVV,Department of Computer Engineering
[3] Institute of Engineering & Technology,undefined
[4] DAVV,undefined
来源
关键词
Change impact analysis; Abstract syntax tree; Path2Vec; Deep learning; Word embedding; Distance metrics; Attention; Un-parsing;
D O I
暂无
中图分类号
学科分类号
摘要
Analyzing the change in source code is a very crucial activity for object-oriented parallel programming software. This paper suggested an Impact analysis method with Attention BiLSTM (IABLSTM) for detecting the changes and their affected part in the object-oriented software system. Classical approaches based on control flow graph, program dependence analysis, latent dirichlet allocation, and data mining have been used for change impact analysis. A Path2Vec approach is presented in the paper, combining a deep learning technique with word embedding to analyze and identify the change. The paper considers two versions of a python program for experiment and generates the abstract syntax tree (AST). Then extract the path to produce a token sequence. Next, convert the token sequence into unique vectors by applying a word embedding layer. The BiLSTM network encodes the sequence into a vector representation. After that, compare the embedded output with the use of cosine distance metrics. We trained the neural network model with the embedded outcome. Then decode the resultant token sequence into a path of AST. Finally, convert the AST path back to code using the un-parsing technique. To strengthen the parallel programming based proposed model, we combined the attention mechanism to emphasize and detect the differences in the code. The model is detecting the change of code efficiently. The experimental results show that our proposed model's change detection accuracy increases significantly compared with other conventional models for change impact analysis. The proposed method can also be applied for impact analysis on object-oriented based parallel programming. The empirical evaluation shows that the model outperforms change detection with approximately 85% validation accuracy.
引用
收藏
相关论文
共 50 条
  • [41] Smali code-based deep learning model for Android malware detectionSmali code-based deep learning model for Android malware...A. Anand et al.
    Abhishek Anand
    Jyoti Prakash Singh
    Amit Kumar Singh
    The Journal of Supercomputing, 81 (4)
  • [42] Deep Learning Similarities from Different Representations of Source Code
    Tufano, Michele
    Watson, Cody
    Bavota, Gabriele
    Di Penta, Massimiliano
    White, Martin
    Poshyvanyk, Denys
    2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, : 542 - 553
  • [43] Source-Code Generation Using Deep Learning: A Survey
    Ahmed, Areeg
    Azab, Shahira
    Abdelhamid, Yasser
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14116 LNAI : 467 - 482
  • [44] Comparison of Image-Based and Text-Based Source Code Classification Using Deep Learning
    Kiyak E.O.
    Cengiz A.B.
    Birant K.U.
    Birant D.
    SN Computer Science, 2020, 1 (5)
  • [45] Source-Code Generation Using Deep Learning: A Survey
    Ahmed, Areeg
    Azab, Shahira
    Abdelhamid, Yasser
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 467 - 482
  • [46] BinDeep: Binary to Source Code Matching Using Deep Learning
    Alrabaee, Saed
    Choo, Kim-Kwang Raymond
    Qbea'h, Mohammad
    Khasawneh, Mahmoud
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1100 - 1107
  • [47] CommtPst: Deep learning source code for commenting positions prediction
    Huang, Yuan
    Hu, Xinyu
    Jia, Nan
    Chen, Xiangping
    Zheng, Zibin
    Luo, Xiapu
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 170
  • [48] A Deep Learning Approach to Identifying Source Code in Images and Video
    Ott, Jordan
    Atchison, Abigail
    Harnack, Paul
    Bergh, Adrienne
    Linstead, Erik
    2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, : 376 - 386
  • [49] Knitting Music and Programming Reflections on the Frontiers of Source Code Analysis
    Gold, Nicolas
    11TH IEEE INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM 2011), 2011, : 10 - 14
  • [50] Structural Analysis of Source Code Collected from Programming Contests
    Park, Bokuk
    Tak, Haesung
    Cho, Hwan-Gue
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 571 - 576