VulDeePecker: A Deep Learning-Based System for Vulnerability Detection

被引:414
|
作者
Li, Zhen [1 ,2 ]
Zou, Deqing [1 ,3 ]
Xu, Shouhuai [4 ]
Ou, Xinyu [1 ]
Jin, Hai [1 ]
Wang, Sujuan [1 ]
Deng, Zhijun [1 ]
Zhong, Yuyi [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Big Data Technol & Syst Lab,Serv Comp Technol & S, Wuhan, Peoples R China
[2] Hebei Univ, Sch Cyber Secur & Comp, Baoding, Peoples R China
[3] Shenzhen Huazhong Univ Sci & Technol Res Inst, Wuhan, Peoples R China
[4] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
基金
美国国家科学基金会;
关键词
D O I
10.14722/ndss.2018.23158
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic detection of software vulnerabilities is an important research problem. However, existing solutions to this problem rely on human experts to define features and often miss many vulnerabilities (i.e., incurring high false negative rate). In this paper, we initiate the study of using deep learning-based vulnerability detection to relieve human experts from the tedious and subjective task of manually defining features. Since deep learning is motivated to deal with problems that are very different from the problem of vulnerability detection, we need some guiding principles for applying deep learning to vulnerability detection. In particular, we need to find representations of software programs that are suitable for deep learning. For this purpose, we propose using code gadgets to represent programs and then transform them into vectors, where a code gadget is a number of (not necessarily consecutive) lines of code that are semantically related to each other. This leads to the design and implementation of a deep learning-based vulnerability detection system, called Vulnerability Deep Pecker (VulDeePecker). In order to evaluate VulDeePecker, we present the first vulnerability dataset for deep learning approaches. Experimental results show that VulDeePecker can achieve much fewer false negatives (with reasonable false positives) than other approaches. We further apply VulDeePecker to 3 software products (namely Xen, Seamonkey, and Libav) and detect 4 vulnerabilities, which are not reported in the National Vulnerability Database but were "silently" patched by the vendors when releasing later versions of these products; in contrast, these vulnerabilities are almost entirely missed by the other vulnerability detection systems we experimented with.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Does data sampling improve deep learning-based vulnerability detection? Yeas! and Nays!
    Yang, Xu
    Wang, Shaowei
    Li, Yi
    Wang, Shaohua
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 2287 - 2298
  • [22] Understanding and Tackling Label Errors in Deep Learning-Based Vulnerability Detection (Experience Paper)
    Nie, Xu
    Li, Ningke
    Wang, Kailong
    Wang, Shangguang
    Luo, Xiapu
    Wang, Haoyu
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 52 - 63
  • [23] Deep learning-based fall detection
    Chiang, Jason Wei Hoe
    Zhang, Li
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 891 - 898
  • [24] VulHunter: An Automated Vulnerability Detection System Based on Deep Learning and Bytecode
    Guo, Ning
    Li, Xiaoyong
    Yin, Hui
    Gao, Yali
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2019), 2020, 11999 : 199 - 218
  • [25] MAPAS: a practical deep learning-based android malware detection system
    Kim, Jinsung
    Ban, Younghoon
    Ko, Eunbyeol
    Cho, Haehyun
    Yi, Jeong Hyun
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2022, 21 (04) : 725 - 738
  • [26] A Deep Learning-Based Real-time Seizure Detection System
    Shawki, N.
    Elseify, T.
    Cap, T.
    Shah, V
    Obeid, I
    Picone, J.
    2020 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM, 2020,
  • [27] Deep Learning-Based Intrusion Detection System for Advanced Metering Infrastructure
    El Mrabet, Zakaria
    Ezzari, Mehdi
    Elghazi, Hassan
    Abou El Majd, Badr
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEMS & SECURITY (NISS19), 2019,
  • [28] Deep Learning-based Embedded Intrusion Detection System for Automotive CAN
    Khandelwal, Shashwat
    Wadhwa, Eashan
    Shreejith, Shanker
    2022 IEEE 33RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2022, : 88 - 92
  • [29] A Novel Deep Learning-Based Intrusion Detection System for IoT Networks
    Awajan, Albara
    COMPUTERS, 2023, 12 (02)
  • [30] Deep Learning-Based Multistage Fire Detection System and Emerging Direction
    Sultan, Tofayet
    Chowdhury, Mohammad Sayem
    Safran, Mejdl
    Mridha, M. F.
    Dey, Nilanjan
    FIRE-SWITZERLAND, 2024, 7 (12):