Multi-features based Semantic Augmentation Networks for Named Entity Recognition in Threat Intelligence

被引:9
|
作者
Liu, Peipei [1 ,2 ]
Li, Hong [1 ,2 ]
Wang, Zuoguang [1 ,2 ]
Liu, Jie [1 ,2 ]
Ren, Yimo [1 ,2 ]
Zhu, Hongsong [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
cybersecurity; named entity recognition; multi-features; semantic augmentation; attention mechanism;
D O I
10.1109/ICPR56361.2022.9956373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting cybersecurity entities such as attackers and vulnerabilities from unstructured network texts is an important part of security analysis. However, the sparsity of intelligence data resulted from the higher frequency variations and the randomness of cybersecurity entity names makes it difficult for current methods to perform well in extracting security-related concepts and entities. To this end, we propose a semantic augmentation method which incorporates different linguistic features to enrich the representation of input tokens to detect and classify the cybersecurity names over unstructured text. In particular, we encode and aggregate the constituent feature, morphological feature and part of speech feature for each input token to improve the robustness of the method. More than that, a token gets augmented semantic information from its most similar K words in cybersecurity domain corpus where an attentive module is leveraged to weigh differences of the words, and from contextual clues based on a large-scale general field corpus. We have conducted experiments on the cybersecurity datasets DNRTI and MalwareTextDB, and the results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:1557 / 1563
页数:7
相关论文
共 50 条
  • [1] Biomedical named entity recognition based on fusion multi-features embedding
    Li, Meijing
    Yang, Hao
    Liu, Yuxin
    TECHNOLOGY AND HEALTH CARE, 2023, 31 : S111 - S121
  • [2] Named Entity Recognition in Threat Intelligence Domain Based on Deep Learning
    Wang Y.
    Wang Z.-H.
    Li H.
    Huang W.-J.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2023, 44 (01): : 33 - 39
  • [3] Named Entity Recognition for Social Media Texts with Semantic Augmentation
    Nie, Yuyang
    Tian, Yuanhe
    Wan, Xiang
    Yan Song
    Bo Dai
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1383 - 1391
  • [4] Research on Named Entity Recognition Method of Network Threat Intelligence
    Zhang, Keke
    Chen, Xu
    Jing, Yongjun
    Wang, Shuyang
    Tang, Lijun
    CYBER SECURITY, CNCERT 2022, 2022, 1699 : 213 - 224
  • [5] An Effective Approach of Named Entity Recognition for Cyber Threat Intelligence
    Wu, Han
    Li, Xiaoyong
    Gao, Yali
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1370 - 1374
  • [6] Simple Semantic-based Data Augmentation for Named Entity Recognition in Biomedical Texts
    Phan, Uyen T. P.
    Nguyen, Nhung T. H.
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 123 - 129
  • [7] Named Entity Recognition in Cyber Threat Intelligence Using Transformer-based Models
    Evangelatos, Pavlos
    Iliou, Christos
    Mavropoulos, Thanassis
    Apostolou, Konstantinos
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE (IEEE CSR), 2021, : 348 - 353
  • [8] Threat intelligence named entity recognition techniques based on few-shot learning
    Wang, Haiyan
    Yang, Weimin
    Feng, Wenying
    Zeng, Liyi
    Gu, Zhaoquan
    ARRAY, 2024, 23
  • [9] Integrating Semantic Features for Enhancing Arabic Named Entity Recognition
    Alsayadi, Hamzah A.
    ElKorany, Abeer M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (03) : 128 - 136
  • [10] Chinese engineering geological named entity recognition by fusing multi-features and data enhancement using deep learning
    Qiu, Qinjun
    Tian, Miao
    Huang, Zhen
    Xie, Zhong
    Ma, Kai
    Tao, Liufeng
    Xu, Dexin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238