DroidEncoder: Malware detection using auto-encoder based feature extractor and machine learning algorithms

被引:27
|
作者
Bakir, Halit [1 ]
Bakir, Rezan [1 ]
机构
[1] Sivas Univ Sci & Technol, Fac Engn & Nat Sci, Comp Engn Dept, Sivas, Turkiye
关键词
Auto encoder; Malware detection; Android application; Code analysis; HYBRID ANALYSIS; EFFICIENT; ENSEMBLE;
D O I
10.1016/j.compeleceng.2023.108804
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Android Malware detection became a hot topic over the last several years. Although considerable studies have been conducted utilizing machine learning-based methods, little attention has been dedicated to the feature extraction importance which considers an essential factor when using machine learning methods. Thus, in this study, we proposed a new feature extraction method based on the auto-encoder structure. Particularly, we propose DroidEncoder, a novel autoencoder-based model to classify Android malware applications. On the grounds of this, an image-based Android app dataset composed of 3000 malicious apps and 3000 benign apps is constructed. Then, three different auto-encoders, namely ANN-based auto-encoder, CNN-based auto-encoder, and VGG19-based auto-encoder have been proposed to extract features from the visualized Malware dataset. Three different experiments were conducted for extracting features in order to train multiple machine learning algorithms, such as decision tree, extra tree, k-nearest neighbors, LightGBM, XGBoost, Random forest, linear regression, and support vector machine. Furthermore, cross-validation alongside multiple metrics was used for evaluating the performance of the proposed models. According to the obtained results, the proposed method approved its affectivity with superior performance in terms of all metrics.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] On the Robustness of Machine Learning Based Malware Detection Algorithms
    Hu, Weiwei
    Tan, Ying
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1435 - 1441
  • [32] Unsupervised deep feature representation using adversarial auto-encoder
    Cai, Jinyu
    Wang, Shiping
    Guo, Wenzhong
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER PHYSICAL SYSTEMS (ICPS 2019), 2019, : 749 - 754
  • [33] ECG signal generation using feature disentanglement auto-encoder
    Xiao, Hanbin
    Xia, Yong
    PHYSIOLOGICAL MEASUREMENT, 2025, 13 (01)
  • [34] Deep Representation Learning for Code Smells Detection using Variational Auto-Encoder
    Hadj-Kacem, Mouna
    Bouassida, Nadia
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [35] Unsupervised Text Feature Learning via Deep Variational Auto-encoder
    Liu, Genggeng
    Xie, Lin
    Chen, Chi-Hua
    INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (03): : 421 - 437
  • [36] An Auto-Encoder for Learning Conversation Representation Using LSTM
    Zhou, Xiaoqiang
    Hu, Baotian
    Chen, Qingcai
    Wang, Xiaolong
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 310 - 317
  • [37] Auto-encoder Based Clustering Algorithms for Intuitionistic Fuzzy Sets
    Du, Yimin
    Wu, Guixing
    Tang, Guolin
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [38] Identification of Citrus Huanglongbing Based on Contractive Auto-Encoder Combined Extreme Learning Machine
    Lu Hao-Xiang
    Xu Ming-Chang
    Zhang Wei-Dong
    Yang Hui-Hua
    Liu Zhen-Bing
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2019, 47 (05) : 652 - 660
  • [39] License Plate Detection Based On Sparse Auto-Encoder
    Yang, Ran
    Yin, Huarui
    Chen, Xiaohui
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 465 - 469
  • [40] Hyperspectral Anomaly Detection Method Based on Auto-encoder
    Bati, Emrecan
    Caliskan, Akin
    Koz, Alper
    Alatan, A. Aydin
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXI, 2015, 9643