Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation

被引:25
|
作者
Ullah, Farhan [1 ]
Alsirhani, Amjad [2 ,3 ]
Alshahrani, Mohammed Mujib [4 ]
Alomari, Abdullah [5 ]
Naeem, Hamad [6 ]
Shah, Syed Aziz [7 ]
机构
[1] Northwestern Polytech Univ, Sch Software, 127 West Youyi Rd, Xian 710072, Peoples R China
[2] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Aljouf, Saudi Arabia
[3] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[4] Univ Bisha, Coll Comp & Informat Technol, Bisha 61361, Saudi Arabia
[5] Albaha Univ, Dept Comp Sci, Albaha 65799, Saudi Arabia
[6] Zhoukou Normal Univ, Sch Comp Sci & Technol, Zhoukou 466001, Peoples R China
[7] Coventry Univ, Fac Res Ctr Intelligent Healthcare, Coventry CV1 5RW, W Midlands, England
关键词
malware analysis; transfer learning; malware visualization; explainable AI; cybersecurity; malicious; network behavior; PERMISSION;
D O I
10.3390/s22186766
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Android has become the leading mobile ecosystem because of its accessibility and adaptability. It has also become the primary target of widespread malicious apps. This situation needs the immediate implementation of an effective malware detection system. In this study, an explainable malware detection system was proposed using transfer learning and malware visual features. For effective malware detection, our technique leverages both textual and visual features. First, a pre-trained model called the Bidirectional Encoder Representations from Transformers (BERT) model was designed to extract the trained textual features. Second, the malware-to-image conversion algorithm was proposed to transform the network byte streams into a visual representation. In addition, the FAST (Features from Accelerated Segment Test) extractor and BRIEF (Binary Robust Independent Elementary Features) descriptor were used to efficiently extract and mark important features. Third, the trained and texture features were combined and balanced using the Synthetic Minority Over-Sampling (SMOTE) method; then, the CNN network was used to mine the deep features. The balanced features were then input into the ensemble model for efficient malware classification and detection. The proposed method was analyzed extensively using two public datasets, CICMalDroid 2020 and CIC-InvesAndMal2019. To explain and validate the proposed methodology, an interpretable artificial intelligence (AI) experiment was conducted.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A speech denoising demonstration system using multi-model deep-learning neural networks
    Lu, Ching-Ta
    Shen, Jun-Hong
    Castiglione, Aniello
    Chung, Cheng-Han
    Lu, Yen-Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023,
  • [42] Multi-model Ontology-based Hybrid Recommender System in E-learning Domain
    Zhuhadar, Leyla
    Nasraoui, Olfa
    Wyatt, Robert
    Romero, Elizabeth
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 91 - +
  • [43] Fault detection of nonlinear systems using multi-model structure: Application to a ship propulsion system
    Habib, Dallagi
    Lecture Notes in Mechanical Engineering, 2013, 1 : 151 - 158
  • [44] Email-Based Cyberstalking Detection On Textual Data Using Multi-Model Soft Voting Technique Of Machine Learning Approach
    Gautam, Arvind Kumar
    Bansal, Abhishek
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2023, 63 (06) : 1362 - 1381
  • [45] Toward Semantic-Based Android Malware Detection Using Model Checking and Machine Learning
    El Hatib, Souad
    Ricaud, Loic
    Desharnais, Josee
    Tawbi, Nadia
    RISKS AND SECURITY OF INTERNET AND SYSTEMS (CRISIS 2020), 2021, 12528 : 289 - 307
  • [46] NMal-Droid: network-based android malware detection system using transfer learning and CNN-BiGRU ensemble
    Ullah, Farhan
    Ullah, Shamsher
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    Zhao, Yue
    WIRELESS NETWORKS, 2024, 30 (06) : 6177 - 6198
  • [47] Type II Solar Radio Burst Segmentation and Detection using Multi-Model Deep Learning Networks
    Scully, Jeremiah
    Flynn, Ronan
    Gallagher, Peter
    Daly, Mark
    2023 34TH IRISH SIGNALS AND SYSTEMS CONFERENCE, ISSC, 2023,
  • [48] Multi-scale Learning based Malware Variant Detection using Spatial Pyramid Pooling Network
    Sriram, S.
    Vinayakumar, R.
    Sowmya, V
    Alazab, Mamoun
    Soman, K. P.
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 740 - 745
  • [49] Explainable AI for binary and multi-class classification of leukemia using a modified transfer learning ensemble model
    Deshpande, Nilkanth Mukund
    Gite, Shilpa
    Pradhan, Biswajeet
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
  • [50] Hourly-Similarity Based Solar Forecasting Using Multi-Model Machine Learning Blending
    Feng, Cong
    Zhang, Jie
    2018 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2018,