Static PE Malware Detection Using Gradient Boosting Decision Trees Algorithm

被引:16
|
作者
Huu-Danh Pham [1 ]
Tuan Dinh Le [2 ]
Thanh Nguyen Vu [1 ]
机构
[1] Vietnam Natl Univ Ho Chi Minh City, Univ Informat Technol, Ho Chi Minh City, Vietnam
[2] Long An Univ Econ & Ind, Tan An, Long An Provinc, Vietnam
关键词
Malware detection; Machine learning; PE file format; Gradient boosting decision trees; EMBER dataset;
D O I
10.1007/978-3-030-03192-3_17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Static malware detection is an essential layer in a security suite, which attempts to classify samples as malicious or benign before execution. However, most of the related works incur the scalability issues, for examples, methods using neural networks usually take a lot of training time [13], or use imbalanced datasets [17, 20], which makes validation metrics misleading in reality. In this study, we apply a static malware detection method by Portable Executable analysis and Gradient Boosting Decision Tree algorithm. We manage to reduce the training time by appropriately reducing the feature dimension. The experiment results show that our proposed method can achieve up to 99.394% detection rate at 1% false alarm rate, and score results in less than 0.1% false alarm rate at a detection rate 97.572%, based on more than 600,000 training and 200,000 testing samples from Endgame Malware BEnchmark for Research (EMBER) dataset [1].
引用
收藏
页码:228 / 236
页数:9
相关论文
共 50 条
  • [41] An Efficient Approach For Malware Detection Using PE Header Specifications
    Rezaei, Tina
    Hamze, Ali
    2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 234 - 239
  • [42] Improving Accuracy and Speed of Network-based Intrusion Detection using Gradient Boosting Trees
    Terado, Ryosuke
    Hayashida, Morihiro
    ICISSP: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2020, : 490 - 497
  • [43] An improved anomaly detection model for IoT security using decision tree and gradient boosting
    Maryam Douiba
    Said Benkirane
    Azidine Guezzaz
    Mourade Azrour
    The Journal of Supercomputing, 2023, 79 : 3392 - 3411
  • [44] Feet Fidgeting Detection Based on Accelerometers Using Decision Tree Learning and Gradient Boosting
    Esseiva, Julien
    Caon, Maurizio
    Mugellini, Elena
    Abou Khaled, Omar
    Aminian, Kamiar
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2018), PT II, 2019, 10814 : 75 - 84
  • [45] An improved anomaly detection model for IoT security using decision tree and gradient boosting
    Douiba, Maryam
    Benkirane, Said
    Guezzaz, Azidine
    Azrour, Mourade
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (03): : 3392 - 3411
  • [46] Effects of Driving Behavior on Fuel Consumption with Explainable Gradient Boosting Decision Trees
    Konstantinou, Christos
    Fafoutellis, Panagiotis
    Mantouka, Eleni G.
    Chalkiadakis, Charis
    Fortsakis, Petro S.
    Vlahogianni, Eleni I.
    2023 8TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS, MT-ITS, 2023,
  • [47] Gradient boosting decision trees to study laboratory and field performance in pavement management
    Berangi, Mohammadjavad
    Lontra, Bernardo Mota
    Anupam, Kumar
    Erkens, Sandra
    Van Vliet, Dave
    Snippe, Almar
    Moenielal, Mahesh
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2025, 40 (01) : 3 - 32
  • [48] A mobile recommendation system based on Logistic Regression and Gradient Boosting Decision Trees
    Wang, Yaozheng
    Feng, Dawei
    Ii, Dongsheng
    Chen, Xinyuan
    Zhac, Yunxiang
    Niu, Xin
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1896 - 1902
  • [49] Explainable Steel Quality Prediction System Based on Gradient Boosting Decision Trees
    Takalo-Mattila, Janne
    Heiskanen, Mikko
    Kyllonen, Vesa
    Maatta, Leena
    Bogdanoff, Agne
    IEEE ACCESS, 2022, 10 : 68099 - 68110
  • [50] Retrieval-Based Gradient Boosting Decision Trees for Disease Risk Assessment
    Ma, Handong
    Cao, Jiahang
    Fang, Yuchen
    Zhang, Weinan
    Sheng, Wenbo
    Zhang, Shaodian
    Yu, Yong
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3468 - 3476