Automated, Reliable Zero-Day Malware Detection Based on Autoencoding Architecture

被引：12

作者：

Kim, Chiho ^{[1
]}

Chang, Sang-Yoon ^{[2
]}

Kim, Jonghyun ^{[3
]}

Lee, Dongeun ^{[1
]}

Kim, Jinoh ^{[1
]}

机构：

[1] Texas A&M Univ, Dept Comp Sci, Commerce, TX 75428 USA

[2] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80918 USA

[3] Elect Telecommun Res Inst, Cybersecur Res Div, Daejeon 34129, South Korea

来源：

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2023年 / 20卷 / 03期

关键词：

Zero-day detection; malware detection; evasion attacks; adversarial attacks; autoencoder; one-class classification; semi-supervised learning; ATTACKS;

D O I：

10.1109/TNSM.2023.3251282

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While a body of studies has been carried out for malware detection with its significance, they are often limited to known malware patterns due to the reliance on signature-based or supervised learning approaches. The semi-supervised learning approach would be an option for identifying previously unseen patterns (i.e., zero-day detection); however, our preliminary study reveals critical limitations from existing methods, including (i) the profiling-based approach using an autoencoder can provide better detection but is sensitive to the threshold setting, and (ii) one-class (OC) classification does not require a manual threshold discovery but may be limited with low detection rates. In this paper, we present a new detection method incorporating the concept of autoencoding and OC classification, designed to benefit from strong abstraction by neural networks (using an autoencoder) and the removal of the complex threshold selection (using an OC classifier). For this combined architecture, a challenge is concurrent training of the autoencoder and the OC classifier, which may cause an ill-suited learner due to no reference to malware instances. To this end, we introduce a new model selection method that discovers well-optimized models from a variety of combinations. The experimental results performed with public malware datasets (Meraz'18 and Drebin) show the effectiveness of our presented methods with up to 97.1% accuracy, comparable to the supervised learning-based detection. We also examine the impact of evading attacks using adversarial attack tools, the result of which shows resilience to malware variants with over 99% detection rates.

引用

页码：3900 / 3914

页数：15

共 50 条

[1] Zero-day Malware Detection using Threshold-free Autoencoding Architecture
Kim, Chiho
Chang, Sang-Yoon
Kim, Jonghyun
Lee, Dongeun
Kim, Jinoh
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1279 - 1284
[2] Zero-Day Malware Detection
Gandotra, Ekta
Bansal, Divya
Sofat, Sanjccv
2016 SIXTH INTERNATIONAL SYMPOSIUM ON EMBEDDED COMPUTING AND SYSTEM DESIGN (ISED 2016), 2016, : 171 - 175
[3] Detection of Zero-day Malware Based on the Analysis of Opcode Sequences
Zolotukhin, Mikhail
Hamalainen, Timo
2014 IEEE 11TH CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE (CCNC), 2014,
[4] Big Data Framework for Zero-Day Malware Detection
Gupta, Deepak
Rani, Rinkle
CYBERNETICS AND SYSTEMS, 2018, 49 (02) : 103 - 121
[5] Use of Data Visualisation for Zero-Day Malware Detection
Venkatraman, Sitalakshmi
Alazab, Mamoun
SECURITY AND COMMUNICATION NETWORKS, 2018,
[6] CNN based zero-day malware detection using small binary segments
Wen, Qiaokun
Chow, K. P.
FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION, 2021, 38
[7] A survey of zero-day malware attacks and its detection methodology
Radhakrishnan, Kiran
Menon, Rajeev R.
Nath, Hiran V.
PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 533 - 539
[8] Deep Learning for Zero-day Malware Detection and Classification: A Survey
Deldar, Fatemeh
Abadi, Mahdi
ACM COMPUTING SURVEYS, 2024, 56 (02)
[9] Zero-Day Malware Classification and Detection Using Machine Learning
Kumar J.
Rajendran B.
Sudarsan S.D.
SN Computer Science, 5 (1)
[10] Combining Supervised and Unsupervised Learning for Zero-Day Malware Detection
Comar, Prakash Mandayam
Liu, Lei
Saha, Sabyasachi
Tan, Pang-Ning
Nucci, Antonio
2013 PROCEEDINGS IEEE INFOCOM, 2013, : 2022 - 2030

← 1 2 3 4 5 →