Security bug reports classification using fasttext

被引:1
|
作者
Alqahtani, Sultan S. [1 ]
机构
[1] Al Imam Mohammad Ibn Saud Islamic Univ, Comp & Informat Sci Coll, Riyadh, Saudi Arabia
关键词
Maintenance; Bug reports; Machine learning; Security; Software vulnerabilities;
D O I
10.1007/s10207-023-00793-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software developers and maintainers must address security bug reports (SBRs) before they are publicly disclosed, and their system is left vulnerable to attack. Bug tracking systems may contain securities-related reports which are unlabeled as SBRs, which makes it hard for developers to identify them. Therefore, finding unlabeled SBRs is an essential to help security expert developers identify these security issues fast and accurately. The goal of this paper is to aid software developers to better classify bug reports that identify security vulnerabilities as security bug reports through fasttext classifier. Previous work has applied text analytics and machine learning learners to classify which bug reports are security related. We improve on that work, as shown by our analysis of five open-source projects. We first collected a dataset of 45,940 bug reports from five software repositories (e.g., the work of Peters et al. and Shu et al.). Second, we conducted an experiment throughout the classification of SBRs using machine learning technique; particularly, we built fasttext classifiers. Finally, we investigated the accuracy of our built fasttext classifiers in identifying SBRs. Our experiment results show that our fasttext classifier can achieve an average F1 score of 0.81 when used to identify SBRs. Furthermore, we examined the generalizability of identifying SBRs by applying cross-project validation, and our results showed that the fasttext classifier is able to achieve an average F1 score values of 0.65. Finally, we made our data and results available at Alqahtani (fasttext implementation, 2023. https://github.com/isultane/fasttext_classifications) to help the replication of our work.
引用
收藏
页码:1347 / 1358
页数:12
相关论文
共 50 条
  • [41] TRAM: An Approach for Assigning Bug Reports using their Metadata
    Banitaan, Shadi
    Alenezi, Mamdouh
    2013 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND INFORMATION TECHNOLOGY (ICCIT), 2013, : 215 - 219
  • [42] Improving Bug Localization using Correlations in Crash Reports
    Wang, Shaohua
    Khomh, Foutse
    Zou, Ying
    2013 10TH IEEE WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2013, : 247 - 256
  • [43] Predicting Severity of Bug Reports using Implicit Tags
    Kao, Wei-Chen
    Lee, Chao-Yuan
    Yang, Chih-Chuan
    Yang, Cheng-Zen
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1846 - 1855
  • [44] Improving bug management using correlations in crash reports
    Shaohua Wang
    Foutse Khomh
    Ying Zou
    Empirical Software Engineering, 2016, 21 : 337 - 367
  • [45] Improving bug management using correlations in crash reports
    Wang, Shaohua
    Khomh, Foutse
    Zou, Ying
    EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (02) : 337 - 367
  • [46] An Analysis of Software Bug Reports Using Random Forest
    Ha Manh Tran
    Sinh Van Nguyen
    Synh Viet Uyen Ha
    Thanh Quoc Le
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018, 2018, 11251 : 273 - 285
  • [47] Exploring Metadata in Bug Reports for Bug Localization
    Zhang, Xiaofei
    Yao, Yuan
    Wang, Yaojing
    Xu, Feng
    Lu, Jian
    2017 24TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2017), 2017, : 328 - 337
  • [48] Bug classification using program slicing metrics
    Pan, Kai
    Kim, Sunghun
    Whitehead, E. James, Jr.
    SIXTH IEEE INTERNATIONAL WORKSHOP ON SOURCE CODE ANALYSIS AND MANIPULATION, PROCEEDINGS, 2006, : 31 - +
  • [49] Experience Report: Security Vulnerability Profiles of Mission Critical Software: Empirical Analysis of Security Related Bug Reports
    Goseva-Popstojanova, Katerina
    Tyo, Jacob
    2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2017, : 152 - 163
  • [50] An HMM-based approach for automatic detection and classification of duplicate bug reports
    Ebrahimi, Neda
    Trabelsi, Abdelaziz
    Islam, Md Shariful
    Hamou-Lhadj, Abdelwahab
    Khanmohammadi, Kobra
    INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 113 : 98 - 109