Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation

被引：25

作者：

Ullah, Farhan ^{[1
]}

Alsirhani, Amjad ^{[2
,3
]}

Alshahrani, Mohammed Mujib ^{[4
]}

Alomari, Abdullah ^{[5
]}

Naeem, Hamad ^{[6
]}

Shah, Syed Aziz ^{[7
]}

机构：

[1] Northwestern Polytech Univ, Sch Software, 127 West Youyi Rd, Xian 710072, Peoples R China

[2] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Aljouf, Saudi Arabia

[3] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada

[4] Univ Bisha, Coll Comp & Informat Technol, Bisha 61361, Saudi Arabia

[5] Albaha Univ, Dept Comp Sci, Albaha 65799, Saudi Arabia

[6] Zhoukou Normal Univ, Sch Comp Sci & Technol, Zhoukou 466001, Peoples R China

[7] Coventry Univ, Fac Res Ctr Intelligent Healthcare, Coventry CV1 5RW, W Midlands, England

来源：

SENSORS | 2022年 / 22卷 / 18期

关键词：

malware analysis; transfer learning; malware visualization; explainable AI; cybersecurity; malicious; network behavior; PERMISSION;

D O I：

10.3390/s22186766

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Android has become the leading mobile ecosystem because of its accessibility and adaptability. It has also become the primary target of widespread malicious apps. This situation needs the immediate implementation of an effective malware detection system. In this study, an explainable malware detection system was proposed using transfer learning and malware visual features. For effective malware detection, our technique leverages both textual and visual features. First, a pre-trained model called the Bidirectional Encoder Representations from Transformers (BERT) model was designed to extract the trained textual features. Second, the malware-to-image conversion algorithm was proposed to transform the network byte streams into a visual representation. In addition, the FAST (Features from Accelerated Segment Test) extractor and BRIEF (Binary Robust Independent Elementary Features) descriptor were used to efficiently extract and mark important features. Third, the trained and texture features were combined and balanced using the Synthetic Minority Over-Sampling (SMOTE) method; then, the CNN network was used to mine the deep features. The balanced features were then input into the ensemble model for efficient malware classification and detection. The proposed method was analyzed extensively using two public datasets, CICMalDroid 2020 and CIC-InvesAndMal2019. To explain and validate the proposed methodology, an interpretable artificial intelligence (AI) experiment was conducted.

引用

页数：22

共 50 条

[41] A speech denoising demonstration system using multi-model deep-learning neural networks
Lu, Ching-Ta
Shen, Jun-Hong
Castiglione, Aniello
Chung, Cheng-Han
Lu, Yen-Yu
MULTIMEDIA TOOLS AND APPLICATIONS, 2023,
[42] Multi-model Ontology-based Hybrid Recommender System in E-learning Domain
Zhuhadar, Leyla
Nasraoui, Olfa
Wyatt, Robert
Romero, Elizabeth
2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 91 - +
[43] Fault detection of nonlinear systems using multi-model structure: Application to a ship propulsion system
Habib, Dallagi
Lecture Notes in Mechanical Engineering, 2013, 1 : 151 - 158
[44] Email-Based Cyberstalking Detection On Textual Data Using Multi-Model Soft Voting Technique Of Machine Learning Approach
Gautam, Arvind Kumar
Bansal, Abhishek
JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2023, 63 (06) : 1362 - 1381
[45] Toward Semantic-Based Android Malware Detection Using Model Checking and Machine Learning
El Hatib, Souad
Ricaud, Loic
Desharnais, Josee
Tawbi, Nadia
RISKS AND SECURITY OF INTERNET AND SYSTEMS (CRISIS 2020), 2021, 12528 : 289 - 307
[46] NMal-Droid: network-based android malware detection system using transfer learning and CNN-BiGRU ensemble
Ullah, Farhan
Ullah, Shamsher
Srivastava, Gautam
Lin, Jerry Chun-Wei
Zhao, Yue
WIRELESS NETWORKS, 2024, 30 (06) : 6177 - 6198
[47] Type II Solar Radio Burst Segmentation and Detection using Multi-Model Deep Learning Networks
Scully, Jeremiah
Flynn, Ronan
Gallagher, Peter
Daly, Mark
2023 34TH IRISH SIGNALS AND SYSTEMS CONFERENCE, ISSC, 2023,
[48] Multi-scale Learning based Malware Variant Detection using Spatial Pyramid Pooling Network
Sriram, S.
Vinayakumar, R.
Sowmya, V
Alazab, Mamoun
Soman, K. P.
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 740 - 745
[49] Explainable AI for binary and multi-class classification of leukemia using a modified transfer learning ensemble model
Deshpande, Nilkanth Mukund
Gite, Shilpa
Pradhan, Biswajeet
INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
[50] Hourly-Similarity Based Solar Forecasting Using Multi-Model Machine Learning Blending
Feng, Cong
Zhang, Jie
2018 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2018,

← 1 2 3 4 5 →