Static PE Malware Detection Using Gradient Boosting Decision Trees Algorithm

被引:16
|
作者
Huu-Danh Pham [1 ]
Tuan Dinh Le [2 ]
Thanh Nguyen Vu [1 ]
机构
[1] Vietnam Natl Univ Ho Chi Minh City, Univ Informat Technol, Ho Chi Minh City, Vietnam
[2] Long An Univ Econ & Ind, Tan An, Long An Provinc, Vietnam
关键词
Malware detection; Machine learning; PE file format; Gradient boosting decision trees; EMBER dataset;
D O I
10.1007/978-3-030-03192-3_17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Static malware detection is an essential layer in a security suite, which attempts to classify samples as malicious or benign before execution. However, most of the related works incur the scalability issues, for examples, methods using neural networks usually take a lot of training time [13], or use imbalanced datasets [17, 20], which makes validation metrics misleading in reality. In this study, we apply a static malware detection method by Portable Executable analysis and Gradient Boosting Decision Tree algorithm. We manage to reduce the training time by appropriately reducing the feature dimension. The experiment results show that our proposed method can achieve up to 99.394% detection rate at 1% false alarm rate, and score results in less than 0.1% false alarm rate at a detection rate 97.572%, based on more than 600,000 training and 200,000 testing samples from Endgame Malware BEnchmark for Research (EMBER) dataset [1].
引用
收藏
页码:228 / 236
页数:9
相关论文
共 50 条
  • [31] A simple boosting algorithm using multi-way branching decision trees
    Hatano, K
    THEORY OF COMPUTING SYSTEMS, 2004, 37 (04) : 503 - 518
  • [32] A Simple Boosting Algorithm Using Multi-Way Branching Decision Trees
    Kohei Hatano
    Theory of Computing Systems, 2004, 37 : 503 - 518
  • [33] PARALLEL IMPLEMENTATION OF PREDICTION ALGORITHM IN GRADIENT BOOSTING TREES METHOD
    Druzhkov, P. N.
    Zolotykh, N. Yu.
    Polovinkin, A. N.
    BULLETIN OF THE SOUTH URAL STATE UNIVERSITY SERIES-MATHEMATICAL MODELLING PROGRAMMING & COMPUTER SOFTWARE, 2011, (10): : 82 - 89
  • [34] Windows PE Malware Detection Using Ensemble Learning
    Azeez, Nureni Ayofe
    Odufuwa, Oluwanifise Ebunoluwa
    Misra, Sanjay
    Oluranti, Jonathan
    Damasevicius, Robertas
    INFORMATICS-BASEL, 2021, 8 (01):
  • [35] Boosting constraint satisfaction using decision trees
    O'Sullivan, B
    Ferguson, A
    Freuder, EC
    ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 646 - 651
  • [36] Predicting Phenotypes From High-Dimensional Genomes Using Gradient Boosting Decision Trees
    Yu, Tingxi
    Wang, Li
    Zhang, Wuping
    Xing, Guofang
    Han, Jiwan
    Li, Fuzhong
    Cao, Chunqing
    IEEE ACCESS, 2022, 10 : 48126 - 48140
  • [37] Android Malware Detection Based on Network Traffic Using Decision Tree Algorithm
    Zulkifli, Aqil
    Hamid, Isredza Rahmi A.
    Shah, Wahidah Md
    Abdullah, Zubaile
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING (SCDM 2018), 2018, 700 : 485 - 494
  • [38] Fractal Based Adaptive Boosting Algorithm for Cognitive Detection of Computer Malware
    Khan, Muhammad Salman
    Siddiqui, Sana
    McLeod, Robert D.
    Ferens, Ken
    Kinsner, Witold
    2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, : 50 - 59
  • [39] Perceptron-Based Ensembles and Binary Decision Trees for Malware Detection
    Vatamanu, Cristina
    Cosovan, Doina
    Gavrilut, Dragos
    Luchian, Henri
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 250 - 259
  • [40] Predicting the Gas Storage Capacity in Shale Formations Using the Extreme Gradient Boosting Decision Trees Method
    Wang, Jiaheng
    Li, Nong
    Huo, Xiangyu
    Yang, Mingli
    Zhang, Li
    ENERGY TECHNOLOGY, 2024, 12 (10)