Security bug reports classification using fasttext

被引:1
|
作者
Alqahtani, Sultan S. [1 ]
机构
[1] Al Imam Mohammad Ibn Saud Islamic Univ, Comp & Informat Sci Coll, Riyadh, Saudi Arabia
关键词
Maintenance; Bug reports; Machine learning; Security; Software vulnerabilities;
D O I
10.1007/s10207-023-00793-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software developers and maintainers must address security bug reports (SBRs) before they are publicly disclosed, and their system is left vulnerable to attack. Bug tracking systems may contain securities-related reports which are unlabeled as SBRs, which makes it hard for developers to identify them. Therefore, finding unlabeled SBRs is an essential to help security expert developers identify these security issues fast and accurately. The goal of this paper is to aid software developers to better classify bug reports that identify security vulnerabilities as security bug reports through fasttext classifier. Previous work has applied text analytics and machine learning learners to classify which bug reports are security related. We improve on that work, as shown by our analysis of five open-source projects. We first collected a dataset of 45,940 bug reports from five software repositories (e.g., the work of Peters et al. and Shu et al.). Second, we conducted an experiment throughout the classification of SBRs using machine learning technique; particularly, we built fasttext classifiers. Finally, we investigated the accuracy of our built fasttext classifiers in identifying SBRs. Our experiment results show that our fasttext classifier can achieve an average F1 score of 0.81 when used to identify SBRs. Furthermore, we examined the generalizability of identifying SBRs by applying cross-project validation, and our results showed that the fasttext classifier is able to achieve an average F1 score values of 0.65. Finally, we made our data and results available at Alqahtani (fasttext implementation, 2023. https://github.com/isultane/fasttext_classifications) to help the replication of our work.
引用
收藏
页码:1347 / 1358
页数:12
相关论文
共 50 条
  • [1] Security bug reports classification using fasttext
    Sultan S. Alqahtani
    International Journal of Information Security, 2024, 23 : 1347 - 1358
  • [2] Identification of Security related Bug Reports via Text Mining using Supervised and Unsupervised Classification
    Goseva-Popstojanova, Katerina
    Tyo, Jacob
    2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018), 2018, : 344 - 355
  • [3] Comparative analysis of impact of classification algorithms on security and performance bug reports
    Said, Maryyam
    Bin Faiz, Rizwan
    Aljaidi, Mohammad
    Alshammari, Muteb
    JOURNAL OF INTELLIGENT SYSTEMS, 2024, 33 (01)
  • [4] On the classification of bug reports to improve bug localization
    Fang, Fan
    Wu, John
    Li, Yanyan
    Ye, Xin
    Aljedaani, Wajdi
    Mkaouer, Mohamed Wiem
    SOFT COMPUTING, 2021, 25 (11) : 7307 - 7323
  • [5] On the classification of bug reports to improve bug localization
    Fan Fang
    John Wu
    Yanyan Li
    Xin Ye
    Wajdi Aljedaani
    Mohamed Wiem Mkaouer
    Soft Computing, 2021, 25 : 7307 - 7323
  • [6] Predicting the Severity of Bug Reports using Classification Algorithms
    Pushpalatha, M. N.
    Mrunalini, M.
    2016 INTERNATIONAL CONFERENCE ON CIRCUITS, CONTROLS, COMMUNICATIONS AND COMPUTING (I4C), 2016,
  • [7] Textual Analysis of Security Bug Reports
    Peeples, Cody R.
    Rotella, Pete
    McLaughlin, Mark-David
    2017 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2017,
  • [8] Automated Classification of Software Bug Reports
    Otoom, Ahmed Fawzi
    Al-jdaeh, Sara
    Hammad, Maen
    PROCEEDINGS OF 9TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND MANAGEMENT (ICICM 2019), 2019, : 17 - 21
  • [9] Malware Detection and Classification Using fastText and BERT
    Yesir, Salih
    Sogukpinar, Ibrahim
    9TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS'21), 2021,
  • [10] Automatic Classification of Complaint Reports in Waste Management Systems Using TF-IDF, fastText, and BERT
    Walkowiak, Tomasz
    Dabrowska, Alicja
    Giel, Robert
    Werbinska-Wojciechowska, Sylwia
    NEW ADVANCES IN DEPENDABILITY OF NETWORKS AND SYSTEMS, DEPCOS-RELCOMEX 2022, 2022, 484 : 371 - 378