Few-VulD: A Few-shot learning framework for software vulnerability detection☆ ☆

被引:0
|
作者
Zheng, Tianming [1 ]
Liu, Haojun [2 ]
Xu, Hang [1 ]
Chen, Xiang [1 ]
Yi, Ping [1 ]
Wu, Yue [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA USA
基金
国家重点研发计划;
关键词
Vulnerability detection; Few-shot learning; Meta-learning; BiLSTM; Artificial intelligence; Deep learning; NEURAL-NETWORKS;
D O I
10.1016/j.cose.2024.103992
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of artificial intelligence (AI) has led to the introduction of numerous software vulnerability detection methods based on deep learning algorithms. However, a significant challenge is their dependency on large volumes of code samples for effective training. This requirement poses a considerable hurdle, particularly when adapting to diverse software application scenarios and various vulnerability types, where gathering sufficient and relevant training data for different classification tasks is often arduous. To address the challenge, this paper introduces Few-VulD, a novel framework for software vulnerability detection based on few-shot learning. This framework is designed to be efficiently trained with a minimal number of samples from a variety of existing classification tasks. Its key advantage lies in its ability to rapidly adapt to new vulnerability detection tasks, such as identifying new types of vulnerabilities, with only a small set of learning samples. This capability is particularly beneficial in scenarios where available vulnerability samples are limited. We compare Few-VulD with five state-of-the-art methods on the SySeVR and Big-Vul datasets. On the SySeVR dataset, Few-VulD outperforms all other methods, achieving a recall rate of 87.9% and showing an improvement of 11.7% to 57.8%. On the Big-Vul dataset, Few-VulD outperforms three of the methods, including one that utilizes a pretrained large language model (LLM), with recall improvements ranging from 8.5% to 40.1%. The other two methods employ pretrained LLMs from Microsoft CodeXGLUE (Lu et al., 2021). Few-VulD reaches 78.7% and 95.5% of their recall rates without the need for extensive data pretraining. The performance proves the effectiveness of Few-VulD in vulnerability detection tasks with limited samples.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Few-Shot Lifelong Learning
    Mazumder, Pratik
    Singh, Pravendra
    Rai, Piyush
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2337 - 2345
  • [22] FS-IDS: A framework for intrusion detection based on few-shot learning
    Yang, Jingcheng
    Li, Hongwei
    Shao, Shuo
    Zou, Futai
    Wu, Yue
    COMPUTERS & SECURITY, 2022, 122
  • [23] VulPrompt: Prompt-Based Vulnerability Detection Using Few-Shot Graph Learning
    Irtiza, Saquib
    Li, Xiaodi
    Zamani, Mahmoud
    Khan, Latifur
    Hamlen, Kevin W.
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXVIII, DBSEC 2024, 2024, 14901 : 221 - 240
  • [24] An Intrusion Detection Method Using Few-Shot Learning
    Yu, Yingwei
    Bian, Naizheng
    IEEE ACCESS, 2020, 8 (08): : 49730 - 49740
  • [25] Dynamic relevance learning for few-shot object detection
    Liu, Weijie
    Cai, Xiaojie
    Wang, Chong
    Li, Haohe
    Yu, Shenghao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (04)
  • [26] Few-Shot Object Detection via Metric Learning
    Zhu Min
    Zhang Chongyang
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [27] Few-shot object detection via baby learning
    Vu, Anh-Khoa Nguyen
    Nguyen, Nhat-Duy
    Nguyen, Khanh-Duy
    Nguyen, Vinh-Tiep
    Ngo, Thanh Duc
    Do, Thanh-Toan
    Nguyen, Tam V.
    IMAGE AND VISION COMPUTING, 2022, 120
  • [28] Anomaly Detection via Few-Shot Learning on Normality
    Ando, Shin
    Yamamoto, Ayaka
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 13713 : 275 - 290
  • [29] Active Few-Shot Learning for Sound Event Detection
    Wang, Yu
    Cartwright, Mark
    Bello, Juan Pablo
    INTERSPEECH 2022, 2022, : 1551 - 1555
  • [30] Few-shot Object Detection with Refined Contrastive Learning
    Shangguan, Zeyu
    Huai, Lian
    Liu, Tong
    Jiang, Xingqun
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 991 - 996