Static PE Malware Detection Using Gradient Boosting Decision Trees Algorithm

被引:16
|
作者
Huu-Danh Pham [1 ]
Tuan Dinh Le [2 ]
Thanh Nguyen Vu [1 ]
机构
[1] Vietnam Natl Univ Ho Chi Minh City, Univ Informat Technol, Ho Chi Minh City, Vietnam
[2] Long An Univ Econ & Ind, Tan An, Long An Provinc, Vietnam
关键词
Malware detection; Machine learning; PE file format; Gradient boosting decision trees; EMBER dataset;
D O I
10.1007/978-3-030-03192-3_17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Static malware detection is an essential layer in a security suite, which attempts to classify samples as malicious or benign before execution. However, most of the related works incur the scalability issues, for examples, methods using neural networks usually take a lot of training time [13], or use imbalanced datasets [17, 20], which makes validation metrics misleading in reality. In this study, we apply a static malware detection method by Portable Executable analysis and Gradient Boosting Decision Tree algorithm. We manage to reduce the training time by appropriately reducing the feature dimension. The experiment results show that our proposed method can achieve up to 99.394% detection rate at 1% false alarm rate, and score results in less than 0.1% false alarm rate at a detection rate 97.572%, based on more than 600,000 training and 200,000 testing samples from Endgame Malware BEnchmark for Research (EMBER) dataset [1].
引用
收藏
页码:228 / 236
页数:9
相关论文
共 50 条
  • [21] Histogram-Based Algorithm for Building Gradient Boosting Ensembles of Piecewise Linear Decision Trees
    Guryanov, Aleksei
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2019, 2019, 11832 : 39 - 50
  • [22] DYNAMIC ANALYSIS OF MALWARE USING DECISION TREES
    Ravula, Ravinder R.
    Liszka, Kathy J.
    Chan, Chien-Chung
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 74 - 83
  • [23] PDF Malware Detection Based on Optimizable Decision Trees
    Abu Al-Haija, Qasem
    Odeh, Ammar
    Qattous, Hazem
    ELECTRONICS, 2022, 11 (19)
  • [24] DeepMalNet: Evaluating shallow and deep networks for static PE malware detection
    Vinayakumar, R.
    Soman, K. P.
    ICT EXPRESS, 2018, 4 (04): : 255 - 258
  • [25] Root Cause Identification for Road Network Congestion Using the Gradient Boosting Decision Trees
    Chen, Yue
    Li, Changle
    Yue, Wenwei
    Zhang, Hehe
    Mao, Guoqiang
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [26] Gradient boosting decision trees for cyber security threats detection based on network events logs
    Quang Hieu Vu
    Ruta, Dymitr
    Cen, Ling
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5921 - 5928
  • [27] Investigating the contributors to hit-and-run crashes using gradient boosting decision trees
    Han, Baorui
    Huang, Haibo
    Li, Gen
    Jiang, Chenming
    Yang, Zhen
    Zhu, Zhenjun
    PLOS ONE, 2025, 20 (01):
  • [28] Booster: An Accelerator for Gradient Boosting Decision Trees Training and Inference
    He, Mingxuan
    Thottethodi, Mithuna
    Vijaykumar, T. N.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1051 - 1062
  • [29] Trojan attribute inference attack on gradient boosting decision trees
    Ito, Kunihiro
    Enkhtaivan, Batnyam
    Teranishi, Isamu
    Sakuma, Jun
    9TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY, EUROS&P 2024, 2024, : 542 - 559
  • [30] FDPBoost: Federated differential privacy gradient boosting decision trees
    Li, Yingjie
    Feng, Yan
    Qian, Quan
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2023, 74