Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation

被引:25
|
作者
Ullah, Farhan [1 ]
Alsirhani, Amjad [2 ,3 ]
Alshahrani, Mohammed Mujib [4 ]
Alomari, Abdullah [5 ]
Naeem, Hamad [6 ]
Shah, Syed Aziz [7 ]
机构
[1] Northwestern Polytech Univ, Sch Software, 127 West Youyi Rd, Xian 710072, Peoples R China
[2] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Aljouf, Saudi Arabia
[3] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[4] Univ Bisha, Coll Comp & Informat Technol, Bisha 61361, Saudi Arabia
[5] Albaha Univ, Dept Comp Sci, Albaha 65799, Saudi Arabia
[6] Zhoukou Normal Univ, Sch Comp Sci & Technol, Zhoukou 466001, Peoples R China
[7] Coventry Univ, Fac Res Ctr Intelligent Healthcare, Coventry CV1 5RW, W Midlands, England
关键词
malware analysis; transfer learning; malware visualization; explainable AI; cybersecurity; malicious; network behavior; PERMISSION;
D O I
10.3390/s22186766
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Android has become the leading mobile ecosystem because of its accessibility and adaptability. It has also become the primary target of widespread malicious apps. This situation needs the immediate implementation of an effective malware detection system. In this study, an explainable malware detection system was proposed using transfer learning and malware visual features. For effective malware detection, our technique leverages both textual and visual features. First, a pre-trained model called the Bidirectional Encoder Representations from Transformers (BERT) model was designed to extract the trained textual features. Second, the malware-to-image conversion algorithm was proposed to transform the network byte streams into a visual representation. In addition, the FAST (Features from Accelerated Segment Test) extractor and BRIEF (Binary Robust Independent Elementary Features) descriptor were used to efficiently extract and mark important features. Third, the trained and texture features were combined and balanced using the Synthetic Minority Over-Sampling (SMOTE) method; then, the CNN network was used to mine the deep features. The balanced features were then input into the ensemble model for efficient malware classification and detection. The proposed method was analyzed extensively using two public datasets, CICMalDroid 2020 and CIC-InvesAndMal2019. To explain and validate the proposed methodology, an interpretable artificial intelligence (AI) experiment was conducted.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Cyber-Threat Detection System Using a Hybrid Approach of Transfer Learning and Multi-Model Image Representation
    Ullah, Farhan
    Ullah, Shamsher
    Naeem, Muhammad Rashid
    Mostarda, Leonardo
    Rho, Seungmin
    Cheng, Xiaochun
    SENSORS, 2022, 22 (15)
  • [2] Malware detection using image representation of malware data and transfer learning
    Rustam, Furqan
    Ashraf, Imran
    Jurcut, Anca Delia
    Bashir, Ali Kashif
    Bin Zikria, Yousaf
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 172 : 32 - 50
  • [3] A multi-model ensemble learning framework for imbalanced android malware detection
    Zhu, Hui-juan
    Li, Yang
    Wang, Liang-min
    Sheng, Victor S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [4] Fake User Detection Based on Multi-Model Joint Representation
    Li, Jun
    Jiang, Wentao
    Zhang, Jianyi
    Shao, Yanhua
    Zhu, Wei
    INFORMATION, 2024, 15 (05)
  • [5] Vision Transformers, Ensemble Model, and Transfer Learning Leveraging Explainable AI for Brain Tumor Detection and Classification
    Hossain, Shahriar
    Chakrabarty, Amitabha
    Gadekallu, Thippa Reddy
    Alazab, Mamoun
    Piran, Md. Jalil
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1261 - 1272
  • [6] An Early Detection of Android Malware Using System Calls based Machine Learning Model
    Zhang, Xinrun
    Mathur, Akshay
    Zhao, Lei
    Rahmat, Safia
    Niyaz, Quamar
    Javaid, Ahmad
    Yang, Xiaoli
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, ARES 2022, 2022,
  • [7] A New Framework for Visual Classification of Multi-Channel Malware Based on Transfer Learning
    Zhao, Zilin
    Yang, Shumian
    Zhao, Dawei
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [8] Explainable Transfer Learning-Based Deep Learning Model for Pelvis Fracture Detection
    Kassem, Mohamed A. A.
    Naguib, Soaad M. M.
    Hamza, Hanaa M. M.
    Fouda, Mostafa M. M.
    Saleh, Mohamed K. K.
    Hosny, Khalid M. M.
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [9] BAITRADAR: A MULTI-MODEL CLICKBAIT DETECTION ALGORITHM USING DEEP LEARNING
    Gamage, Bhanuka
    Labib, Adnan
    Joomun, Aisha
    Lim, Chern Hong
    Wong, KokSheik
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2665 - 2669
  • [10] Multi-model imaging detection using a learning feature fusion module
    Gao, Sihao
    Cao, Yu
    Zhang, Wenjing
    Dai, Qian
    Li, Jun
    Xu, Xiaojun
    SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166