Memory-efficient detection of large-scale obfuscated malware

被引:0
|
作者
Wang Y. [1 ]
Zhang M. [1 ]
机构
[1] College of Computer Science and Technology, Jilin University, Jilin, Changchun
关键词
algorithm; malware; Naïve Bayes;
D O I
10.1504/IJWMC.2024.136586
中图分类号
学科分类号
摘要
Obfuscation techniques are frequently used in malicious programs to evade detection. However, current effective methods often require much memory space during training. This paper proposes a machine-learning-based solution to the malware detection problem that consumes fewer memory resources. We use hash and sparse matrix to build a text bag of words to reduce memory usage during training. Experiments show that our approach reduces the memory footprint by 95% when using 110,000 text data for confusion recognition training compared to the existing model. In the de-obfuscation step, our method improves the recognition accuracy of the import table function by 40%. Our model achieves shallow memory usage during confusion recognition training and enhances the accuracy of imported table recognition. Additionally, the confusion recognition accuracy is only about 10% lower than the confusion recognition model before the improvement. Copyright © 2024 Inderscience Enterprises Ltd.
引用
收藏
页码:48 / 60
页数:12
相关论文
共 50 条
  • [21] RealDroid: Large-Scale Evasive Malware Detection on "Real Devices"
    Liu, Lang
    Gu, Yacong
    Li, Qi
    Su, Purui
    2017 26TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN 2017), 2017,
  • [22] EFFICIENT MEMORY ACCESS IN LARGE-SCALE COMPUTATION
    VITTER, JS
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 480 : 26 - 41
  • [23] A Heuristic Approach for Detection of Obfuscated Malware
    Treadwell, Scott
    Zhou, Mian
    ISI: 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS, 2009, : 291 - 299
  • [24] Malware Propagation in Large-Scale Networks
    Yu, Shui
    Gu, Guofei
    Barnawi, Ahmed
    Guo, Song
    Stojmenovic, Ivan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (01) : 170 - 179
  • [25] Hi-Corrector: a fast, scalable and memory-efficient package for normalizing large-scale Hi-C data
    Li, Wenyuan
    Gong, Ke
    Li, Qingjiao
    Alber, Frank
    Zhou, Xianghong Jasmine
    BIOINFORMATICS, 2015, 31 (06) : 960 - 962
  • [26] Efficient Large-Scale Stance Detection in Tweets
    Yan, Yilin
    Chen, Jonathan
    Shyu, Mei-Ling
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2018, 9 (03): : 1 - 16
  • [27] Extraordinarily Time-and Memory-Efficient Large-Scale Canonical Correlation Analysis in Fourier Domain: From Shallow to Deep
    Shen, Xiang-Jun
    Xu, Zhaorui
    Wang, Liangjun
    Li, Zechao
    Liu, Guangcan
    Fan, Jianping
    Zha, ZhengJun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14989 - 15003
  • [28] MAD-DDS: Memory-efficient automatic discovery data distribution service for large-scale distributed control network
    Nwadiugwu, Williams-Paul
    Kim, Dong-Seong
    Ejaz, Waleed
    Anpalagan, Alagan
    IET COMMUNICATIONS, 2023, 17 (12) : 1432 - 1446
  • [29] Detecting Obfuscated Malware using Memory Feature Engineering
    Carrier, Tristan
    Victor, Princy
    Tekeoglu, Ali
    Lashkari, Arash Habibi
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY (ICISSP), 2021, : 177 - 188
  • [30] Enhanced detection of obfuscated malware in memory dumps: a machine learning approach for advanced cybersecurity
    Md. Alamgir Hossain
    Md. Saiful Islam
    Cybersecurity, 7