Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation

被引:25
|
作者
Ullah, Farhan [1 ]
Alsirhani, Amjad [2 ,3 ]
Alshahrani, Mohammed Mujib [4 ]
Alomari, Abdullah [5 ]
Naeem, Hamad [6 ]
Shah, Syed Aziz [7 ]
机构
[1] Northwestern Polytech Univ, Sch Software, 127 West Youyi Rd, Xian 710072, Peoples R China
[2] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Aljouf, Saudi Arabia
[3] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[4] Univ Bisha, Coll Comp & Informat Technol, Bisha 61361, Saudi Arabia
[5] Albaha Univ, Dept Comp Sci, Albaha 65799, Saudi Arabia
[6] Zhoukou Normal Univ, Sch Comp Sci & Technol, Zhoukou 466001, Peoples R China
[7] Coventry Univ, Fac Res Ctr Intelligent Healthcare, Coventry CV1 5RW, W Midlands, England
关键词
malware analysis; transfer learning; malware visualization; explainable AI; cybersecurity; malicious; network behavior; PERMISSION;
D O I
10.3390/s22186766
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Android has become the leading mobile ecosystem because of its accessibility and adaptability. It has also become the primary target of widespread malicious apps. This situation needs the immediate implementation of an effective malware detection system. In this study, an explainable malware detection system was proposed using transfer learning and malware visual features. For effective malware detection, our technique leverages both textual and visual features. First, a pre-trained model called the Bidirectional Encoder Representations from Transformers (BERT) model was designed to extract the trained textual features. Second, the malware-to-image conversion algorithm was proposed to transform the network byte streams into a visual representation. In addition, the FAST (Features from Accelerated Segment Test) extractor and BRIEF (Binary Robust Independent Elementary Features) descriptor were used to efficiently extract and mark important features. Third, the trained and texture features were combined and balanced using the Synthetic Minority Over-Sampling (SMOTE) method; then, the CNN network was used to mine the deep features. The balanced features were then input into the ensemble model for efficient malware classification and detection. The proposed method was analyzed extensively using two public datasets, CICMalDroid 2020 and CIC-InvesAndMal2019. To explain and validate the proposed methodology, an interpretable artificial intelligence (AI) experiment was conducted.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Research on sentiment analysis method of opinion mining based on multi-model fusion transfer learning
    Zhao, Zhongnan
    Liu, Wenjing
    Wang, Kun
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [32] Permission-Based Malware Detection System for Android Using Machine Learning Techniques
    Arslan, Recep Sinan
    Dogru, Ibrahim Alper
    Barisci, Necaattin
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (01) : 43 - 61
  • [33] Malware-Detection Model Using Learning-Based Discovery of Static Features
    Hsiao, Shou-Ching
    Kao, Da-Yu
    Tso, Raylin
    2018 IEEE CONFERENCE ON APPLICATION, INFORMATION AND NETWORK SECURITY (AINS 2018), 2018, : 54 - 59
  • [34] Malware Detection Based on Multi-level and Dynamic Multi-feature Using Ensemble Learning at Hypervisor
    Zhang, Jian
    Gao, Cheng
    Gong, Liangyi
    Gu, Zhaojun
    Man, Dapeng
    Yang, Wu
    Li, Wenzhen
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (04): : 1668 - 1685
  • [35] Malware Detection Based on Multi-level and Dynamic Multi-feature Using Ensemble Learning at Hypervisor
    Jian Zhang
    Cheng Gao
    Liangyi Gong
    Zhaojun Gu
    Dapeng Man
    Wu Yang
    Wenzhen Li
    Mobile Networks and Applications, 2021, 26 : 1668 - 1685
  • [36] PlantDet: A Robust Multi-Model Ensemble Method Based on Deep Learning For Plant Disease Detection
    Shovon, Md. Sakib Hossain
    Mozumder, Shakrin Jahan
    Pal, Osim Kumar
    Mridha, M. F.
    Asai, Nobuyoshi
    Shin, Jungpil
    IEEE ACCESS, 2023, 11 : 34846 - 34859
  • [37] Visual Trunk Detection Using Transfer Learning and a Deep Learning-Based Coprocessor
    Aguiar, Andre Silva
    Dos Santos, Filipe Neves
    Miranda De Sousa, Armando Jorge
    Oliveira, Paulo Moura
    Santos, Luis Carlos
    IEEE ACCESS, 2020, 8 : 77308 - 77320
  • [38] A multi-label waste detection model based on transfer learning
    Zhang, Qiang
    Yang, Qifan
    Zhang, Xujuan
    Wei, Wei
    Bao, Qiang
    Su, Jinqi
    Liu, Xueyan
    RESOURCES CONSERVATION AND RECYCLING, 2022, 181
  • [39] Texture Analysis of Tongue Coating in Traditional Chinese Medicine Based on Transfer Learning and Multi-Model Decision
    Qingxin Xiao
    Hui Zhang
    Jing Zhang
    Li Zhuo
    Sensing and Imaging, 2021, 22
  • [40] Texture Analysis of Tongue Coating in Traditional Chinese Medicine Based on Transfer Learning and Multi-Model Decision
    Xiao, Qingxin
    Zhang, Hui
    Zhang, Jing
    Zhuo, Li
    SENSING AND IMAGING, 2021, 22 (01):