Machine Learning to Combine Static Analysis Alerts with Software Metrics to Detect Security Vulnerabilities: An Empirical Study

被引:3
|
作者
Pereira, Jose D'Abruzzo [1 ]
Campos, Joao R. [1 ]
Vieira, Marco [1 ]
机构
[1] Univ Coimbra, CISUC, DEI, Coimbra, Portugal
关键词
Security; Vulnerability Detection; Static Code Analysis; Software Metrics; ANALYSIS TOOLS;
D O I
10.1109/EDCC53658.2021.00008
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Software developers can use diverse techniques and tools to reduce the number of vulnerabilities, but the effectiveness of existing solutions in real projects is questionable. For example, Static Analysis Tools (SATs) report potential vulnerabilities by analyzing code patterns, and Software Metrics (SMs) can be used to predict vulnerabilities based on high-level characteristics of the code. In theory, both approaches can be applied from the early stages of the development process, but it is well known that they fail to detect critical vulnerabilities and raise a large number of false alarms. This paper studies the hypothesis of using Machine Learning (ML) to combine alerts from SATs with SMs to predict vulnerabilities in a large software project (under development for many years). In practice, we use four ML algorithms, alerts from two SATs, and a large number of SMs to predict whether a source code file is vulnerable or not (binary classification) and to predict the vulnerability category (multiclass classification). Results show that one can achieve either high precision or high recall, but not both at the same time. To understand the reason, we analyze and compare snippets of source code, demonstrating that vulnerable and non-vulnerable files share similar characteristics, making it hard to distinguish vulnerable from non-vulnerable code based on SAT alerts and SMs.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [41] Using Machine Learning Algorithms to Detect Malware by Applying Static and Dynamic Analysis Methods
    Palsa, Jakub
    Hurtuk, Jan
    Chovanec, Martin
    Chovancova, Eva
    ACTA POLYTECHNICA HUNGARICA, 2022, 19 (07) : 177 - 196
  • [42] Predicting Code Smells and Analysis of Predictions: Using Machine Learning Techniques and Software Metrics
    Mohammad Y. Mhawish
    Manjari Gupta
    Journal of Computer Science and Technology, 2020, 35 : 1428 - 1445
  • [43] Predicting Code Smells and Analysis of Predictions: Using Machine Learning Techniques and Software Metrics
    Mhawish, Mohammad Y.
    Gupta, Manjari
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (06) : 1428 - 1445
  • [44] An empirical study on bug severity estimation using source code metrics and static analysis
    Mashhadi, Ehsan
    Chowdhury, Shaiful
    Modaberi, Somayeh
    Hemmati, Hadi
    Uddin, Gias
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 217
  • [45] Review and Empirical Analysis of Machine Learning-Based Software Effort Estimation
    Rahman, Mizanur
    Sarwar, Hasan
    Kader, MD. Abdul
    Goncalves, Teresa
    Tin, Ting Tin
    IEEE ACCESS, 2024, 12 : 85661 - 85680
  • [46] An Empirical Study on Bug Severity Estimation Using Source Code Metrics and Static Analysis
    Mashhadi, Ehsan
    Chowdhury, Shaiful
    Modaberi, Somayeh
    Ahmadvand, Hossein
    Hemmati, Hadi
    Uddin, Gias
    SSRN, 2023,
  • [47] An Empirical Study on Bug Severity Estimation using Source Code Metrics and Static Analysis
    Mashhadi, Ehsan
    Chowdhury, Shaiful
    Modaberi, Somayeh
    Hemmati, Hadi
    Uddin, Gias
    arXiv, 2022,
  • [48] An empirical study of software entropy based bug prediction using machine learning
    Kaur A.
    Kaur K.
    Chopra D.
    International Journal of System Assurance Engineering and Management, 2017, 8 (Suppl 2) : 599 - 616
  • [49] An Empirical Study on Data Balancing in Machine Learning Based Software Traceability Methods
    Wang, Bangchao
    Wang, Zihan
    Wan, Hongyan
    Li, Xingfu
    Deng, Yang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [50] Software Estimation in the Design Stage with Statistical Models and Machine Learning: An Empirical Study
    Sanchez-Garcia, Angel J.
    Gonzalez-Hernandez, Maria Saarayim
    Cortes-Verdin, Karen
    Perez-Arriaga, Juan Carlos
    MATHEMATICS, 2024, 12 (07)