VulDeePecker: A Deep Learning-Based System for Vulnerability Detection

被引:414
|
作者
Li, Zhen [1 ,2 ]
Zou, Deqing [1 ,3 ]
Xu, Shouhuai [4 ]
Ou, Xinyu [1 ]
Jin, Hai [1 ]
Wang, Sujuan [1 ]
Deng, Zhijun [1 ]
Zhong, Yuyi [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Big Data Technol & Syst Lab,Serv Comp Technol & S, Wuhan, Peoples R China
[2] Hebei Univ, Sch Cyber Secur & Comp, Baoding, Peoples R China
[3] Shenzhen Huazhong Univ Sci & Technol Res Inst, Wuhan, Peoples R China
[4] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
基金
美国国家科学基金会;
关键词
D O I
10.14722/ndss.2018.23158
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic detection of software vulnerabilities is an important research problem. However, existing solutions to this problem rely on human experts to define features and often miss many vulnerabilities (i.e., incurring high false negative rate). In this paper, we initiate the study of using deep learning-based vulnerability detection to relieve human experts from the tedious and subjective task of manually defining features. Since deep learning is motivated to deal with problems that are very different from the problem of vulnerability detection, we need some guiding principles for applying deep learning to vulnerability detection. In particular, we need to find representations of software programs that are suitable for deep learning. For this purpose, we propose using code gadgets to represent programs and then transform them into vectors, where a code gadget is a number of (not necessarily consecutive) lines of code that are semantically related to each other. This leads to the design and implementation of a deep learning-based vulnerability detection system, called Vulnerability Deep Pecker (VulDeePecker). In order to evaluate VulDeePecker, we present the first vulnerability dataset for deep learning approaches. Experimental results show that VulDeePecker can achieve much fewer false negatives (with reasonable false positives) than other approaches. We further apply VulDeePecker to 3 software products (namely Xen, Seamonkey, and Libav) and detect 4 vulnerabilities, which are not reported in the National Vulnerability Database but were "silently" patched by the vendors when releasing later versions of these products; in contrast, these vulnerabilities are almost entirely missed by the other vulnerability detection systems we experimented with.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] μVulDeePecker: A Deep Learning-Based System for Multiclass Vulnerability Detection
    Zou, Deqing
    Wang, Sujuan
    Xu, Shouhuai
    Li, Zhen
    Jin, Hai
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (05) : 2224 - 2236
  • [2] A Comparative Study of Deep Learning-Based Vulnerability Detection System
    Li, Zhen
    Zou, Deqing
    Tang, Jing
    Zhang, Zhihao
    Sun, Mingqian
    Jin, Hai
    IEEE ACCESS, 2019, 7 : 103184 - 103197
  • [3] Deep learning-based methodology for vulnerability detection in smart contracts
    Wang, Zhibo
    Guoming, Liu
    Xu, Hongzhen
    You, Shengyu
    Ma, Han
    Wang, Hongling
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [4] DeeSCVHunter: A Deep Learning-Based Framework for Smart Contract Vulnerability Detection
    Yu, Xingxin
    Zhao, Haoyue
    Hou, Botao
    Ying, Zonghao
    Wu, Bin
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] DEEP LEARNING-BASED VULNERABILITY DETECTION AND MITIGATION IN VIRTUALIZATION DATA CENTER
    Manikandan, J.
    Srilakshmi, U.
    INTERNATIONAL JOURNAL OF MARITIME ENGINEERING, 2024, 1 (01): : A647 - A662
  • [6] Revisiting the Performance of Deep Learning-Based Vulnerability Detection on Realistic Datasets
    Chakraborty, Partha
    Arumugam, Krishna Kanth
    Alfadel, Mahmoud
    Nagappan, Meiyappan
    McIntosh, Shane
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (08) : 2163 - 2177
  • [7] COCL: An Intelligent Framework for Enhancing Deep Learning-Based Vulnerability Detection
    Li, Wenxuan
    Dou, Shihan
    Wu, Yueming
    Li, Chenxi
    Liu, Yang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (03) : 4953 - 4961
  • [8] Deep Learning-Based System for Automatic Melanoma Detection
    Adegun, Adekanmi A.
    Viriri, Serestina
    IEEE ACCESS, 2020, 8 : 7160 - 7172
  • [9] Deep learning-based image forgery detection system
    Suresh, Helina Rajini
    Shanmuganathan, M.
    Senthilkumar, T.
    Vidhyasagar, B. S.
    INTERNATIONAL JOURNAL OF ELECTRONIC SECURITY AND DIGITAL FORENSICS, 2024, 16 (02) : 160 - 172
  • [10] A deep learning-based approach for software vulnerability detection using code metrics
    Subhan, Fazli
    Wu, Xiaoxue
    Bo, Lili
    Sun, Xiaobing
    Rahman, Muhammad
    IET SOFTWARE, 2022, 16 (05) : 516 - 526