Detection of Algorithmically Generated Domain Names Using SMOTE and Hybrid Neural Network

被引:0
|
作者
Zhang, Yudong [1 ,2 ]
Chen, Yuzhong [1 ,2 ]
Lin, Yangyang [1 ,2 ]
Zhang, Yankun [1 ,2 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Peoples R China
[2] Fujian Prov Key Lab Network Comp & Intelligent In, Fuzhou 350116, Peoples R China
关键词
Domain name generation; SMOTE; LSTM; CNN; Malicious domain name detection; DGA-BASED BOTNET;
D O I
10.1007/978-981-15-1377-0_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain generation algorithms (DGA) provide methods that use specific parameters as random seeds to generate a large number of random domain names for preventing malicious domain name detection, which greatly increases the difficulty of detecting and defending botnets and malware. State-of-the-art models for detecting algorithmically generated domain names are generally based on the principle of analyzing the statistical characteristics of the domain name and building a classifier to locate the algorithmically generated ones. However, most current models have problems of requiring the manual construction of feature sets for classification, as they are sensitive to the imbalance of the sample distribution in the domain name dataset and are difficult to adapt to frequent changes of the domain name algorithm. To address this issue, we propose a hybrid model that combines a convolutional neural network (CNN) and a bidirectional long-term memory network (BLSTM). First, to solve the problem of the number of domain names generated by DGAs being relatively small and the sample distribution being unbalanced, which consequently decreases detection accuracy, the borderline synthetic minority over sampling technique is employed to optimize the sample balance of the domain name dataset. Second, a hybrid deep neural network that combines CNN and BLSTM is introduced to extract the semantic and context-dependency features from the domain names. The experimental results from different domain-name datasets demonstrate that the proposed model achieves significant improvement over state-of-the-art models with regard to precision and robustness.
引用
收藏
页码:738 / 751
页数:14
相关论文
共 50 条
  • [1] Detection of Algorithmically Generated Domain Names using LSTM
    Vij, Palak
    Nikam, Sayali
    Bhatia, Ashutosh
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [2] Detection of Algorithmically Generated Domain Names Using the Recurrent Convolutional Neural Network with Spatial Pyramid Pooling
    Liu, Zhanghui
    Zhang, Yudong
    Chen, Yuzhong
    Fan, Xinwen
    Dong, Chen
    ENTROPY, 2020, 22 (09)
  • [3] AHDom: Algorithmically generated domain detection using attribute heterogeneous graph neural network
    Hu, Xiaoyan
    Li, Di
    Li, Miao
    Cheng, Guang
    Li, Ruidong
    Wu, Hua
    COMPUTER NETWORKS, 2024, 254
  • [4] Algorithmically Generated Domain Names Detection Using Gated Recurrent Unit Deep Learning
    Nadagoudar, Ranjana B.
    Ramakrishna, M.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (07) : 469 - 481
  • [5] Detection of algorithmically generated malicious domain names using masked N-grams
    Selvi, Jose
    Rodriguez, Ricardo J.
    Soria-Olivas, Emilio
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 124 : 156 - 163
  • [6] UMUDGA: A dataset for profiling algorithmically generated domain names in botnet detection
    Zago, Mattia
    Gil Perez, Manuel
    Martinez Perez, Gregorio
    DATA IN BRIEF, 2020, 30
  • [7] Detecting algorithmically generated malicious domain names
    Department of Electrical and Computer Engineering, Texas A and M University, College Station, TX 77843, United States
    不详
    Proc. ACM SIGCOMM Internet Meas. Conf. IMC, (48-61):
  • [8] Toward Optimal LSTM Neural Networks for Detecting Algorithmically Generated Domain Names
    Selvi, Jose
    Rodriguez, Ricardo J.
    Soria-Olivas, Emilio
    IEEE ACCESS, 2021, 9 : 126446 - 126456
  • [9] Dictionary Extraction and Detection of Algorithmically Generated Domain Names in Passive DNS Traffic
    Pereira, Mayana
    Coleman, Shaun
    Yu, Bin
    DeCock, Martine
    Nascimento, Anderson
    RESEARCH IN ATTACKS, INTRUSIONS, AND DEFENSES, RAID 2018, 2018, 11050 : 295 - 314
  • [10] Detection of Algorithmically Generated Domain Names used by Botnets: A Dual Arms Race.
    Spooren, Jan
    Preuveneers, Davy
    Desmet, Lieven
    Janssen, Peter
    Joosen, Wouter
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 1916 - 1923