Effective and Efficient Hybrid Android Malware Classification Using Pseudo-Label Stacked Auto-Encoder

被引:74
|
作者
Mahdavifar, Samaneh [1 ]
Alhadidi, Dima [2 ]
Ghorbani, Ali. A. [1 ]
机构
[1] Univ New Brunswick, Canadian Inst Cybersecur CIC, Fac Comp Sci, Fredericton, NB, Canada
[2] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
关键词
Android malware; Category; Classification; Hybrid analysis; Semi-supervised learning; Stacked auto-encoder; Deep learning;
D O I
10.1007/s10922-021-09634-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Android has become the target of attackers because of its popularity. The detection of Android mobile malware has become increasingly important due to its significant threat. Supervised machine learning, which has been used to detect Android malware is far from perfect because it requires a significant amount of labeled data. Since labeled data is expensive and difficult to get while unlabeled data is abundant and cheap in this context, we resort to a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), which involves training using a set of labeled and unlabeled instances. We use a hybrid approach of dynamic analysis and static analysis to craft feature vectors. We evaluate our proposed model on CICMalDroid2020, which includes 17,341 most recent samples of five different Android apps categories. After that, we compare the results with state-of-the-art techniques in terms of accuracy and efficiency. Experimental results show that our proposed framework outperforms other semi-supervised approaches and common machine learning algorithms.
引用
收藏
页数:34
相关论文
共 50 条
  • [31] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Nahed Tawfik
    Heba A. Elnemr
    Mahmoud Fakhr
    Moawad I. Dessouky
    Fathi E. Abd El-Samie
    Journal of Digital Imaging, 2022, 35 : 1308 - 1325
  • [32] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Tawfik, Nahed
    Elnemr, Heba A.
    Fakhr, Mahmoud
    Dessouky, Moawad I.
    Abd El-Samie, Fathi E.
    JOURNAL OF DIGITAL IMAGING, 2022, 35 (05) : 1308 - 1325
  • [33] An Anomaly Detection Method to Detect Web Attacks Using Stacked Auto-Encoder
    Vartouni, Ali Moradi
    Kashi, Saeed Sedighian
    Teshnehlab, Mohammad
    2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 131 - 134
  • [34] TOWARDS EFFICIENT VARIATIONAL AUTO-ENCODER USING WASSERSTEIN DISTANCE
    Chen, Zichuan
    Liu, Peng
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 81 - 85
  • [35] Stacked supervised auto-encoder with graph regularization for feature extraction and fault classification in chemical processes
    Li, Dazi
    Liu, Jianxun
    Ma, Xin
    Jin, Qibing
    JOURNAL OF PROCESS CONTROL, 2023, 127
  • [36] Design of Ensemble Stacked Auto-Encoder for Classification of Horse Gaits with MEMS Inertial Sensor Technology
    Lee, Jae-Neung
    Byeon, Yeong-Hyeon
    Kwak, Keun-Chang
    MICROMACHINES, 2018, 9 (08):
  • [37] Familial Classification of Android Malware using Hybrid Analysis
    Cavli, Omer Faruk Turan
    Sen, Sevil
    2020 INTERNATIONAL CONFERENCE ON INFORMATION SECURITY AND CRYPTOLOGY (ISCTURKEY 2020), 2020, : 62 - 67
  • [38] Effective and Efficient Android Malware Detection and Category Classification Using the Enhanced KronoDroid Dataset
    Waheed, Mudassar
    Qadir, Sana
    Security and Communication Networks, 2024, 2024
  • [39] An effective fault diagnosis approach for bearing using stacked de-noising auto-encoder with structure adaptive adjustment
    Chen, Lerui
    Ma, Yidan
    Hu, Heyu
    Khan, Umer Sadiq
    MEASUREMENT, 2023, 214
  • [40] Automatic Personality Perception Using Auto-encoder And Hierarchical Fuzzy Classification
    Zaferani, Effat Jalaeian
    Teshnehlab, Mohammad
    Vali, Mansour
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,