Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation

被引：25

作者：

Ullah, Farhan ^{[1
]}

Alsirhani, Amjad ^{[2
,3
]}

Alshahrani, Mohammed Mujib ^{[4
]}

Alomari, Abdullah ^{[5
]}

Naeem, Hamad ^{[6
]}

Shah, Syed Aziz ^{[7
]}

机构：

[1] Northwestern Polytech Univ, Sch Software, 127 West Youyi Rd, Xian 710072, Peoples R China

[2] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Aljouf, Saudi Arabia

[3] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada

[4] Univ Bisha, Coll Comp & Informat Technol, Bisha 61361, Saudi Arabia

[5] Albaha Univ, Dept Comp Sci, Albaha 65799, Saudi Arabia

[6] Zhoukou Normal Univ, Sch Comp Sci & Technol, Zhoukou 466001, Peoples R China

[7] Coventry Univ, Fac Res Ctr Intelligent Healthcare, Coventry CV1 5RW, W Midlands, England

来源：

SENSORS | 2022年 / 22卷 / 18期

关键词：

malware analysis; transfer learning; malware visualization; explainable AI; cybersecurity; malicious; network behavior; PERMISSION;

D O I：

10.3390/s22186766

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Android has become the leading mobile ecosystem because of its accessibility and adaptability. It has also become the primary target of widespread malicious apps. This situation needs the immediate implementation of an effective malware detection system. In this study, an explainable malware detection system was proposed using transfer learning and malware visual features. For effective malware detection, our technique leverages both textual and visual features. First, a pre-trained model called the Bidirectional Encoder Representations from Transformers (BERT) model was designed to extract the trained textual features. Second, the malware-to-image conversion algorithm was proposed to transform the network byte streams into a visual representation. In addition, the FAST (Features from Accelerated Segment Test) extractor and BRIEF (Binary Robust Independent Elementary Features) descriptor were used to efficiently extract and mark important features. Third, the trained and texture features were combined and balanced using the Synthetic Minority Over-Sampling (SMOTE) method; then, the CNN network was used to mine the deep features. The balanced features were then input into the ensemble model for efficient malware classification and detection. The proposed method was analyzed extensively using two public datasets, CICMalDroid 2020 and CIC-InvesAndMal2019. To explain and validate the proposed methodology, an interpretable artificial intelligence (AI) experiment was conducted.

引用

页数：22

共 50 条

[21] Multi-Model Switching Based Fault Detection for the Suspension System of Maglev Train
Wang, Ping
Long, Zhiqiang
Dang, Ning
IEEE ACCESS, 2019, 7 : 6831 - 6841
[22] An Opcode-Based Malware Detection Model Using Supervised Learning Algorithms
Samantray, Om Prakash
Tripathy, Satya Narayan
INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2021, 15 (04) : 18 - 30
[23] Multi-Model Fusion Framework Using Deep Learning for Visual-Textual Sentiment Classification
Al-Tameemi, Israa K. Salman
Feizi-Derakhshi, Mohammad-Reza
Pashazadeh, Saeed
Asadpour, Mohammad
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2145 - 2177
[24] Multi-Model Fusion Framework Using Deep Learning for Visual-Textual Sentiment Classification
Salman Al-Tameemi I.K.
Feizi-Derakhshi M.-R.
Pashazadeh S.
Asadpour M.
Computers, Materials and Continua, 2023, 76 (02): : 2145 - 2177
[25] Heartbeat Classification and Arrhythmia Detection Using a Multi-Model Deep-Learning Technique
Irfan, Saad
Anjum, Nadeem
Althobaiti, Turke
Alotaibi, Abdullah Alhumaidi
Siddiqui, Abdul Basit
Ramzan, Naeem
SENSORS, 2022, 22 (15)
[26] Malware Detection based on Dynamic Multi-feature using Ensemble Learning at Hypervisor
Zhang, Jian
Gao, Cheng
Gong, Liangyi
Gu, Zhaojun
Man, Dapeng
Yang, Wu
Du, Xiaojiang
2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
[27] An explainable machine learning approach for hospital emergency department visits forecasting using continuous training and multi-model regression
Pelaez-Rodriguez, C.
Torres-Lopez, R.
Perez-Aracil, J.
Lopez-Laguna, N.
Sanchez-Rodriguez, S.
Salcedo-Sanz, S.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 245
[28] Autism spectrum disorder identification using multi-model deep ensemble classifier with transfer learning
Herath, Lakmini
Meedeniya, Dulani
Marasinghe, Janaka
Weerasinghe, Vajira
Tan, Tele
EXPERT SYSTEMS, 2025, 42 (02)
[29] Research on sentiment analysis method of opinion mining based on multi-model fusion transfer learning
Zhongnan Zhao
Wenjing Liu
Kun Wang
Journal of Big Data, 10
[30] Multi-model deep learning system for screening human monkeypox using skin images
Gupta, Kapil
Bajaj, Varun
Jain, Deepak Kumar
Hussain, Amir
EXPERT SYSTEMS, 2024, 41 (10)

← 1 2 3 4 5 →