Empirical Analysis of Learning-based Malware Detection Methods using Image Visualization

被引:1
|
作者
Sheneamer, Abdullah [1 ]
Alhazmi, Essa [1 ]
Henrydoss, James [2 ]
机构
[1] Jazan Univ, Dept Comp Sci, Jazan, Saudi Arabia
[2] Univ Colorado, Vis & Secur Technol Lab, Colorado Springs, CO 80907 USA
关键词
Malware detection; malware analysis; deep learning; machine learning; malware features;
D O I
10.14569/IJACSA.2022.01304106
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Malware, a short name for malicious software is an emerging cyber threat. Various researchers have proposed ways to build advanced malware detectors that can mitigate threat actors and enable effective cybersecurity decisions in the past. Recent research implements malware detectors based on visualized images of malware executable files. In this framework, a malware binary is converted into an image, and by extracting image features and applying machine learning methods, the malware is identified based on image similarity. In this research work, we implement the Image visualization-based malware detection method and conduct an empirical analysis of vari-ous learners for selecting a candidate learning classifier that can provide better prediction performance. We evaluate our framework using the following malware datasets, Search And RetrieVAl of Malware (SARVAM), Xue-dataset, and Canadian Institutes for Cyber Security (CIC) datasets. Our experiments include the following learning algorithms, Linear Regression, Random Forest, K-Nearest Neighbor (KNN), Classification and Decision Tree (CART), Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), and deep learning-based Convolutional Neural Network (CNN). This image-visualization-based method proves to be effective in terms of prediction accuracy. Some conclusions emerge from our initial study and find that a Con-volutional Neural Network (CNN) algorithm provides relatively better performance when used against SARvAM and various malware datasets. The CNN model achieved a high performance of F1-score and accuracy in the binary classification task reaching 95.70% and 99.50%, consecutively. The model in the multi-classification task achieved of 95.96% and 99.30% (F1-score and accuracy) for detecting malware types. We find that the KNN model outperforms other traditional classifiers.
引用
收藏
页码:925 / 936
页数:12
相关论文
共 50 条
  • [21] Automatic fire pixel detection using image processing: a comparative analysis of rule-based and machine learning-based methods
    Toulouse, Tom
    Rossi, Lucile
    Celik, Turgay
    Akhloufi, Moulay
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (04) : 647 - 654
  • [22] HybriDroid: an empirical analysis on effective malware detection model developed using ensemble methods
    Mahindru, Arvind
    Sangal, A. L.
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (08): : 8209 - 8251
  • [23] HybriDroid: an empirical analysis on effective malware detection model developed using ensemble methods
    Arvind Mahindru
    A. L. Sangal
    The Journal of Supercomputing, 2021, 77 : 8209 - 8251
  • [24] A Learning-based Static Malware Detection System with Integrated Feature
    Chen, Zhiguo
    Zhang, Xiaorui
    Kim, Sungryul
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 27 (03): : 891 - 908
  • [25] A lightweight deep learning-based android malware detection framework
    Ma, Runze
    Yin, Shangnan
    Feng, Xia
    Zhu, Huijuan
    Sheng, Victor S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [26] Leveraging Machine Learning-Based PDF Malware Detection in Snort
    Chbib, Fadlallah
    Mustafa, Ali
    Khatoun, Rida
    International Conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2024, 2024,
  • [27] A survey on machine learning-based malware detection in executable files
    Singh, Jagsir
    Singh, Jaswinder
    JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 112
  • [28] Transferability of Adversarial Examples in Machine Learning-based Malware Detection
    Hu, Yang
    Wang, Ning
    Chen, Yimin
    Lou, Wenjing
    Hou, Y. Thomas
    2022 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2022, : 28 - 36
  • [29] An Adversarial Learning-based Tor Malware Traffic Detection Model
    Hu, Xiaoyan
    Gao, Yishu
    Cheng, Guang
    Wu, Hua
    Li, Ruidong
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 74 - 79
  • [30] A hybrid deep learning image-based analysis for effective malware detection
    Venkatraman, Sitalakshmi
    Alazab, Mamoun
    Vinayakumar, R.
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2019, 47 : 377 - 389