Code Aggregate Graph: Effective Representation for Graph Neural Networks to Detect Vulnerable Code

被引:0
|
作者
Nguyen, Hoang Viet [1 ]
Zheng, Junjun [2 ]
Inomata, Atsuo [2 ]
Uehara, Tetsutaro [1 ]
机构
[1] Ritsumeikan University, College of Information Science and Engineering, Kusatsu,5258577, Japan
[2] Osaka University, Graduate School of Information Science and Technology, Osaka,5650871, Japan
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning, especially graph neural networks (GNNs), provides efficient, fast, and automated methods to detect vulnerable code. However, the accuracy could be improved as previous studies were limited by existing code representations. Additionally, the diversity of embedding techniques and GNN models can make selecting the appropriate method challenging. Herein we propose Code Aggregate Graph (CAG) to improve vulnerability detection efficiency. CAG combines the principles of different code analyses such as abstract syntax tree, control flow graph, and program dependence graph with dominator and post-dominator trees. This extensive representation empowers deep graph networks for enhanced classification. We also implement different data encoding methods and neural networks to provide a multidimensional view of the system performance. Specifically, three word embedding approaches and three deep GNNs are utilized to build classifiers. Then CAG is evaluated using two datasets: a real-world open-source dataset and the software assurance reference dataset. CAG is also compared with seven state-of-the-art methods and six classic representations. CAG shows the best performance. Compared to previous studies, CAG has an increased accuracy (5.4%) and F1-score (5.1%). Additionally, experiments confirm that encoding has a positive impact on accuracy (4-6%) but the network type does not. The study should contribute to a meaningful benchmark for future research on code representations, data encoding, and GNNs. © 2013 IEEE.
引用
收藏
页码:123786 / 123800
相关论文
共 50 条
  • [31] Automatic source code summarization with graph attention networks
    Zhou, Yu
    Shen, Juanjuan
    Zhang, Xiaoqing
    Yang, Wenhua
    Han, Tingting
    Chen, Taolue
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 188
  • [32] Understanding the Representation Power of Graph Neural Networks in Learning Graph Topology
    Dehmamy, Nima
    Barabasi, Albert-Laszlo
    Yu, Rose
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [33] A comparison of statistical relational learning and graph neural networks for aggregate graph queries
    Embar, Varun
    Srinivasan, Sriram
    Getoor, Lise
    MACHINE LEARNING, 2021, 110 (07) : 1847 - 1866
  • [34] A comparison of statistical relational learning and graph neural networks for aggregate graph queries
    Varun Embar
    Sriram Srinivasan
    Lise Getoor
    Machine Learning, 2021, 110 : 1847 - 1866
  • [35] Code classification with graph neural networks: Have you ever struggled to make it work?
    Yu, Qingchen
    Xin, Liu
    Zhou, Qingguo
    Zhuge, Jianwei
    Wu, Chunming
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [36] Code Similarity Prediction Model for Industrial Management Features Based on Graph Neural Networks
    Li, Zhenhao
    Lei, Hang
    Ma, Zhichao
    Zhang, Fengyun
    ENTROPY, 2024, 26 (06)
  • [37] A General Source Code Vulnerability Detection Method via Ensemble of Graph Neural Networks
    Zeng, Ciling
    Zhou, Bo
    Dong, Huoyuan
    Wu, Haolin
    Xie, Peiyuan
    Guan, Zhitao
    FRONTIERS IN CYBER SECURITY, FCS 2023, 2024, 1992 : 560 - 574
  • [38] On Representation Knowledge Distillation for Graph Neural Networks
    Joshi, Chaitanya K.
    Liu, Fayao
    Xun, Xu
    Lin, Jie
    Foo, Chuan Sheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4656 - 4667
  • [39] Graph neural networks for molecular and materials representation
    Wu, Xing
    Wang, Hongye
    Gong, Yifei
    Fan, Dong
    Ding, Peng
    Li, Qian
    Qian, Quan
    JOURNAL OF MATERIALS INFORMATICS, 2023, 3 (02):
  • [40] Discovering the Representation Bottleneck of Graph Neural Networks
    Wu, Fang
    Li, Siyuan
    Li, Stan Z.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7998 - 8008