DroidEncoder: Malware detection using auto-encoder based feature extractor and machine learning algorithms

被引:27
|
作者
Bakir, Halit [1 ]
Bakir, Rezan [1 ]
机构
[1] Sivas Univ Sci & Technol, Fac Engn & Nat Sci, Comp Engn Dept, Sivas, Turkiye
关键词
Auto encoder; Malware detection; Android application; Code analysis; HYBRID ANALYSIS; EFFICIENT; ENSEMBLE;
D O I
10.1016/j.compeleceng.2023.108804
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Android Malware detection became a hot topic over the last several years. Although considerable studies have been conducted utilizing machine learning-based methods, little attention has been dedicated to the feature extraction importance which considers an essential factor when using machine learning methods. Thus, in this study, we proposed a new feature extraction method based on the auto-encoder structure. Particularly, we propose DroidEncoder, a novel autoencoder-based model to classify Android malware applications. On the grounds of this, an image-based Android app dataset composed of 3000 malicious apps and 3000 benign apps is constructed. Then, three different auto-encoders, namely ANN-based auto-encoder, CNN-based auto-encoder, and VGG19-based auto-encoder have been proposed to extract features from the visualized Malware dataset. Three different experiments were conducted for extracting features in order to train multiple machine learning algorithms, such as decision tree, extra tree, k-nearest neighbors, LightGBM, XGBoost, Random forest, linear regression, and support vector machine. Furthermore, cross-validation alongside multiple metrics was used for evaluating the performance of the proposed models. According to the obtained results, the proposed method approved its affectivity with superior performance in terms of all metrics.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] An Effective Malware Detection Method Using Hybrid Feature Selection and Machine Learning Algorithms
    Namita Dabas
    Prachi Ahlawat
    Prabha Sharma
    Arabian Journal for Science and Engineering, 2023, 48 : 9749 - 9767
  • [22] An Effective Malware Detection Method Using Hybrid Feature Selection and Machine Learning Algorithms
    Dabas, Namita
    Ahlawat, Prachi
    Sharma, Prabha
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 9749 - 9767
  • [23] An Imputation Method for Missing Data Based on an Extreme Learning Machine Auto-Encoder
    Lu, Cheng-Bo
    Mei, Ying
    IEEE ACCESS, 2018, 6 : 52930 - 52935
  • [24] Nonlocal feature learning based on a variational graph auto-encoder network for small area change detection using SAR imagery
    Su, Hang
    Zhang, Xinzheng
    Luo, Yuqing
    Zhang, Ce
    Zhou, Xichuan
    Atkinson, Peter M.
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 193 : 137 - 149
  • [25] Calibration transfer via an extreme learning machine auto-encoder
    Chen, Wo-Ruo
    Bin, Jun
    Lu, Hong-Mei
    Zhang, Zhi-Min
    Liang, Yi-Zeng
    ANALYST, 2016, 141 (06) : 1973 - 1980
  • [26] Malware Detection Using Machine Learning Algorithms in Android
    Sri, Kovvuri Ramya
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 561 - 568
  • [27] A Hybrid Algorithm of Extreme Learning Machine and Sparse Auto-Encoder
    Lin, Yu
    Liang, Yanchun
    Yoshida, Shinichi
    Feng, Xiaoyue
    Guan, Renchu
    SMART COMPUTING AND COMMUNICATION, SMARTCOM 2016, 2017, 10135 : 194 - 204
  • [28] Malware Analysis and Detection Using Machine Learning Algorithms
    Akhtar, Muhammad Shoaib
    Feng, Tao
    SYMMETRY-BASEL, 2022, 14 (11):
  • [29] An efficient and effective convolutional auto-encoder extreme learning machine network for 3d feature learning
    Wang, Yueqing
    Xie, Zhige
    Xu, Kai
    Dou, Yong
    Lei, Yuanwu
    NEUROCOMPUTING, 2016, 174 : 988 - 998
  • [30] Building Auto-Encoder Intrusion Detection System based on random forest feature selection
    Li, XuKui
    Chen, Wei
    Zhang, Qianru
    Wu, Lifa
    COMPUTERS & SECURITY, 2020, 95 (95)