Empirical Analysis of Learning-based Malware Detection Methods using Image Visualization

被引：1

作者：

Sheneamer, Abdullah ^{[1
]}

Alhazmi, Essa ^{[1
]}

Henrydoss, James ^{[2
]}

机构：

[1] Jazan Univ, Dept Comp Sci, Jazan, Saudi Arabia

[2] Univ Colorado, Vis & Secur Technol Lab, Colorado Springs, CO 80907 USA

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2022年 / 13卷 / 04期

关键词：

Malware detection; malware analysis; deep learning; machine learning; malware features;

D O I：

10.14569/IJACSA.2022.01304106

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Malware, a short name for malicious software is an emerging cyber threat. Various researchers have proposed ways to build advanced malware detectors that can mitigate threat actors and enable effective cybersecurity decisions in the past. Recent research implements malware detectors based on visualized images of malware executable files. In this framework, a malware binary is converted into an image, and by extracting image features and applying machine learning methods, the malware is identified based on image similarity. In this research work, we implement the Image visualization-based malware detection method and conduct an empirical analysis of vari-ous learners for selecting a candidate learning classifier that can provide better prediction performance. We evaluate our framework using the following malware datasets, Search And RetrieVAl of Malware (SARVAM), Xue-dataset, and Canadian Institutes for Cyber Security (CIC) datasets. Our experiments include the following learning algorithms, Linear Regression, Random Forest, K-Nearest Neighbor (KNN), Classification and Decision Tree (CART), Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), and deep learning-based Convolutional Neural Network (CNN). This image-visualization-based method proves to be effective in terms of prediction accuracy. Some conclusions emerge from our initial study and find that a Con-volutional Neural Network (CNN) algorithm provides relatively better performance when used against SARvAM and various malware datasets. The CNN model achieved a high performance of F1-score and accuracy in the binary classification task reaching 95.70% and 99.50%, consecutively. The model in the multi-classification task achieved of 95.96% and 99.30% (F1-score and accuracy) for detecting malware types. We find that the KNN model outperforms other traditional classifiers.

引用

页码：925 / 936

页数：12

共 50 条

[21] Automatic fire pixel detection using image processing: a comparative analysis of rule-based and machine learning-based methods
Toulouse, Tom
Rossi, Lucile
Celik, Turgay
Akhloufi, Moulay
SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (04) : 647 - 654
[22] HybriDroid: an empirical analysis on effective malware detection model developed using ensemble methods
Mahindru, Arvind
Sangal, A. L.
JOURNAL OF SUPERCOMPUTING, 2021, 77 (08): : 8209 - 8251
[23] HybriDroid: an empirical analysis on effective malware detection model developed using ensemble methods
Arvind Mahindru
A. L. Sangal
The Journal of Supercomputing, 2021, 77 : 8209 - 8251
[24] A Learning-based Static Malware Detection System with Integrated Feature
Chen, Zhiguo
Zhang, Xiaorui
Kim, Sungryul
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 27 (03): : 891 - 908
[25] A lightweight deep learning-based android malware detection framework
Ma, Runze
Yin, Shangnan
Feng, Xia
Zhu, Huijuan
Sheng, Victor S.
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
[26] Leveraging Machine Learning-Based PDF Malware Detection in Snort
Chbib, Fadlallah
Mustafa, Ali
Khatoun, Rida
International Conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2024, 2024,
[27] A survey on machine learning-based malware detection in executable files
Singh, Jagsir
Singh, Jaswinder
JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 112
[28] Transferability of Adversarial Examples in Machine Learning-based Malware Detection
Hu, Yang
Wang, Ning
Chen, Yimin
Lou, Wenjing
Hou, Y. Thomas
2022 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2022, : 28 - 36
[29] An Adversarial Learning-based Tor Malware Traffic Detection Model
Hu, Xiaoyan
Gao, Yishu
Cheng, Guang
Wu, Hua
Li, Ruidong
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 74 - 79
[30] A hybrid deep learning image-based analysis for effective malware detection
Venkatraman, Sitalakshmi
Alazab, Mamoun
Vinayakumar, R.
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2019, 47 : 377 - 389

← 1 2 3 4 5 →