SeMalBERT: Semantic-based malware detection with bidirectional encoder representations from transformers

被引:9
|
作者
Liu, Junming [1 ]
Zhao, Yuntao [1 ]
Feng, Yongxin [2 ]
Hu, Yutao [1 ]
Ma, Xiangyu [1 ]
机构
[1] Shenyang Ligong Univ, Sch Informat Sci & Engn, Shenyang 110159, Peoples R China
[2] Shenyang Ligong Univ, Grad Sch, Shenyang 110159, Peoples R China
关键词
Cyber security; BERT; Malware; CNN; LSTM;
D O I
10.1016/j.jisa.2023.103690
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models are widely used for identifying malicious software. However, existing models suffer from issues such as imprecise polysemous representations and a lack of contextual semantic representations, leading to the failure to recognize certain types of malicious software. In this paper, we propose a semantic-based intelligent malware detection model called SeMalBERT for identifying malicious software in Windows systems. Specifically, the model utilizes the API function sequences of malicious software as the learned features. Firstly, BERT is applied to accomplish word representation tasks and extract semantic information from the sequences. Secondly, a hybrid discriminator based on Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) is used to explore the chaining relationships between functions. Lastly, an external attention mechanism is added after the LSTM to enable the model to better focus on key information within the text. Experimental results demonstrate that SeMalBERT outperforms existing malware detection techniques in terms of accuracy, F1 score, and loss function value on a general dataset.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] MalBERT: Malware Detection using Bidirectional Encoder Representations from Transformers
    Rahali, Abir
    Akhloufi, Moulay A.
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3226 - 3231
  • [2] Korean Semantic Role Labeling with Bidirectional Encoder Representations from Transformers and Simple Semantic Information
    Bae, Jangseong
    Lee, Changki
    APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [3] Cyberbullying Detection Using Bidirectional Encoder Representations from Transformers (BERT)
    Sujud, Razan
    Fahs, Walid
    Khatoun, Rida
    Chbib, Fadlallah
    2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 257 - 262
  • [4] Smart Contracts Implementation Based on Bidirectional Encoder Representations from Transformers
    Aejas, Bajeela
    Bouras, Abdelaziz
    Belhi, Abdelhak
    Gasmi, Houssem
    PRODUCT LIFECYCLE MANAGEMENT: GREEN AND BLUE TECHNOLOGIES TO SUPPORT SMART AND SUSTAINABLE ORGANIZATIONS, PT I, 2022, 639 : 293 - 304
  • [5] A Literature Review on Bidirectional Encoder Representations from Transformers
    Shreyashree, S.
    Sunagar, Pramod
    Rajarajeswari, S.
    Kanavalli, Anita
    INVENTIVE COMPUTATION AND INFORMATION TECHNOLOGIES, ICICIT 2021, 2022, 336 : 305 - 320
  • [6] Transient chaos in bidirectional encoder representations from transformers
    Inoue, Katsuma
    Ohara, Soh
    Kuniyoshi, Yasuo
    Nakajima, Kohei
    PHYSICAL REVIEW RESEARCH, 2022, 4 (01):
  • [7] exBAKE: Automatic Fake News Detection Model Based on Bidirectional Encoder Representations from Transformers (BERT)
    Jwa, Heejung
    Oh, Dongsuk
    Park, Kinam
    Kang, Jang Mook
    Lim, Heuiseok
    APPLIED SCIENCES-BASEL, 2019, 9 (19):
  • [8] Emotion detection on Greek social media using Bidirectional Encoder Representations from Transformers
    Alexandridis, Georgios
    Korovesis, Konstantinos
    Varlamis, Iraklis
    Tsantilas, Panagiotis
    Caridakis, George
    25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 28 - 32
  • [9] Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers
    Nakamura, Yuta
    Hanaoka, Shouhei
    Nomura, Yukihiro
    Nakao, Takahiro
    Miki, Soichiro
    Watadani, Takeyuki
    Yoshikawa, Takeharu
    Hayashi, Naoto
    Abe, Osamu
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [10] BIDIRECTIONAL ENCODER REPRESENTATIONS FROM TRANSFORMERS FOR CYBERBULLYING TEXT DETECTION IN INDONESIAN SOCIAL MEDIA
    Candra, Aswin
    Wella
    Wicaksana, Arya
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2021, 17 (05): : 1599 - 1615