A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

被引:16
|
作者
Dasgupta, Soham [2 ]
Piplai, Aritran [1 ]
Kotal, Anantaa [1 ]
Joshi, Anupam [1 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
[2] Mallya Aditi Int Sch, Bengaluru, Karnataka, India
关键词
Named Entity Recognition; Deep Learning; Cybersecurity; Artificial Intelligence;
D O I
10.1109/BigData50022.2020.9378482
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is important in the cybersecurity domain. It helps researchers extract cyber threat information from unstructured text sources. The extracted cyber-entities or key expressions can be used to model a cyber-attack described in an open-source text. A large number of generalpurpose NER algorithms have been published that work well in text analysis. These algorithms do not perform well when applied to the cybersecurity domain. In the field of cybersecurity, the open-source text available varies greatly in complexity and underlying structure of the sentences. General-purpose NER algorithms can misrepresent domain-specific words, such as "malicious" and "javascript". In this paper, we compare the recent deep learning-based NER algorithms on a cybersecurity dataset. We created a cybersecurity dataset collected from various sources, including "Microsoft Security Bulletin" and "Adobe Security Updates". Some of these approaches proposed in literature were not used for Cybersecurity. Others are innovations proposed by us. This comparative study helps us identify the NER algorithms that are robust and can work well in sentences taken from a large number of cybersecurity sources. We tabulate their performance on the test set and identify the best NER algorithm for a cybersecurity corpus. We also discuss the different embedding strategies that aid in the process of NER for the chosen deep learning algorithms.
引用
收藏
页码:2596 / 2604
页数:9
相关论文
共 50 条
  • [41] A Hybrid Deep Learning Framework for Bacterial Named Entity Recognition
    Li, Xusheng
    Wang, Xiaoyan
    Zhong, Ran
    Zhong, Duo
    He, Tingting
    Hu, Xiaohua
    Jiang, Xingpeng
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 428 - 433
  • [42] A Comparative Study on the Drum Sound Recognition Algorithms Based Deep Learning
    Lee, Sang Wook
    Heo, Jae Hyuk
    Lee, Sung Taek
    2021 21ST ACIS INTERNATIONAL WINTER CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD-WINTER 2021), 2021, : 282 - 283
  • [43] Bengali Named Entity Recognition: A survey with deep learning benchmark
    Rifat, Md Jamiur Rahman
    Abujar, Sheikh
    Noori, Sheak Rashed Haider
    Hossain, Syed Akhter
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [44] Research Progress on Named Entity Recognition in Chinese Deep Learning
    Li, Li
    Xi, Xuefeng
    Sheng, Shengli
    Cui, Zhiming
    Xu, Jiabao
    Computer Engineering and Applications, 2023, 59 (24) : 46 - 69
  • [45] A deep learning approach for Named Entity Recognition in Urdu language
    Anam, Rimsha
    Waqas Anwar, Muhammad
    Hasan Jamal, Muhammad
    Ijaz Bajwa, Usama
    de la Torre Diez, Isabel
    Silva Alvarado, Eduardo
    Soriano Flores, Emmanuel
    Ashraf, Imran
    PLOS ONE, 2024, 19 (03):
  • [46] A Comparative Study of Named Entity Recognition on Myanmar Language
    Nandar, Tin Latt
    Soe, Thinn Lai
    Soe, Khin Mar
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 60 - 64
  • [47] Named Entity Recognition for Hungarian Using Various Machine Learning Algorithms
    Farkas, Richard
    Szarvast, Gyorgy
    Kocsor, Andras
    ACTA CYBERNETICA, 2006, 17 (03): : 633 - 646
  • [48] Deep Learning-Based Named Entity Recognition System Using Hybrid Embedding
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    CYBERNETICS AND SYSTEMS, 2024, 55 (02) : 279 - 301
  • [49] A Chinese named entity recognition method for landslide geological disasters based on deep learning
    Yang, Banghui
    Zhou, Chunlei
    Li, Suju
    Wang, Yuzhu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [50] An imConvNet-based deep learning model for Chinese medical named entity recognition
    Zheng, Yuchen
    Han, Zhenggong
    Cai, Yimin
    Duan, Xubo
    Sun, Jiangling
    Yang, Wei
    Huang, Haisong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)