DroidEncoder: Malware detection using auto-encoder based feature extractor and machine learning algorithms

被引:27
|
作者
Bakir, Halit [1 ]
Bakir, Rezan [1 ]
机构
[1] Sivas Univ Sci & Technol, Fac Engn & Nat Sci, Comp Engn Dept, Sivas, Turkiye
关键词
Auto encoder; Malware detection; Android application; Code analysis; HYBRID ANALYSIS; EFFICIENT; ENSEMBLE;
D O I
10.1016/j.compeleceng.2023.108804
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Android Malware detection became a hot topic over the last several years. Although considerable studies have been conducted utilizing machine learning-based methods, little attention has been dedicated to the feature extraction importance which considers an essential factor when using machine learning methods. Thus, in this study, we proposed a new feature extraction method based on the auto-encoder structure. Particularly, we propose DroidEncoder, a novel autoencoder-based model to classify Android malware applications. On the grounds of this, an image-based Android app dataset composed of 3000 malicious apps and 3000 benign apps is constructed. Then, three different auto-encoders, namely ANN-based auto-encoder, CNN-based auto-encoder, and VGG19-based auto-encoder have been proposed to extract features from the visualized Malware dataset. Three different experiments were conducted for extracting features in order to train multiple machine learning algorithms, such as decision tree, extra tree, k-nearest neighbors, LightGBM, XGBoost, Random forest, linear regression, and support vector machine. Furthermore, cross-validation alongside multiple metrics was used for evaluating the performance of the proposed models. According to the obtained results, the proposed method approved its affectivity with superior performance in terms of all metrics.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Feature Extraction using Variational Auto-encoder for Radar-based Posture Detection Systems
    Casmin, Eugene
    Oliveira, Rodolfo
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [12] Anomaly-based Intrusion Detection Using Auto-encoder
    Nguimbous, Yves Nsoga
    Ksantini, Riadh
    Bouhoula, Adel
    2019 27TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2019, : 505 - 509
  • [13] Unsupervised feature selection using sparse manifold learning: Auto-encoder approach
    Moslemi, Amir
    Jamshidi, Mina
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [14] Unsupervised Feature Learning Of Gearbox Fault Using Stacked Wavelet Auto-encoder
    Shao, Haidong
    Jiang, Hongkai
    2018 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2018,
  • [15] A deep learning method for lincRNA detection using auto-encoder algorithm
    Yu, Ning
    Yu, Zeng
    Pan, Yi
    BMC BIOINFORMATICS, 2017, 18 : 511
  • [16] Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder
    Wang, H.
    Zhao, Y.
    Liu, X. F.
    Xu, X. N.
    Wang, L.
    Zhou, N.
    Xu, Y. M.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, AUTOMATION AND MECHANICAL ENGINEERING (EAME 2015), 2015, 13 : 342 - 345
  • [17] A deep learning method for lincRNA detection using auto-encoder algorithm
    Ning Yu
    Zeng Yu
    Yi Pan
    BMC Bioinformatics, 18
  • [18] Online deep learning based on auto-encoder
    Zhang, Si-si
    Liu, Jian-wei
    Zuo, Xin
    Lu, Run-kun
    Lian, Si-ming
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5420 - 5439
  • [19] Auto-Encoder based Structured Dictinoary Learning
    Liu, Deyin
    Wu, Yuanbo Lin
    Liu, Liangchen
    Hu, Qichang
    Qi, Lin
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [20] Online deep learning based on auto-encoder
    Si-si Zhang
    Jian-wei Liu
    Xin Zuo
    Run-kun Lu
    Si-ming Lian
    Applied Intelligence, 2021, 51 : 5420 - 5439