N-Trans: Parallel Detection Algorithm for DGA Domain Names

被引:7
|
作者
Yang, Cheng [1 ]
Lu, Tianliang [1 ]
Yan, Shangyi [1 ]
Zhang, Jianling [1 ]
Yu, Xingzhan [1 ]
机构
[1] Peoples Publ Secur Univ China, Coll Informat & Cyber Secur, Beijing 100038, Peoples R China
关键词
malicious domain name; DGA; parallel detection model; N-gram; Transformer model; N-Trans;
D O I
10.3390/fi14070209
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain name generation algorithms are widely used in malware, such as botnet binaries, to generate large sequences of domain names of which some are registered by cybercriminals. Accurate detection of malicious domains can effectively defend against cyber attacks. The detection of such malicious domain names by the use of traditional machine learning algorithms has been explored by many researchers, but still is not perfect. To further improve on this, we propose a novel parallel detection model named N-Trans that is based on the N-gram algorithm with the Transformer model. First, we add flag bits to the first and last positions of the domain name for the parallel combination of the N-gram algorithm and Transformer framework to detect a domain name. The model can effectively extract the letter combination features and capture the position features of letters in the domain name. It can capture features such as the first and last letters in the domain name and the position relationship between letters. In addition, it can accurately distinguish between legitimate and malicious domain names. In the experiment, the dataset is the legal domain name of Alexa and the malicious domain name collected by the 360 Security Lab. The experimental results show that the parallel detection model based on N-gram and Transformer achieves 96.97% accuracy for DGA malicious domain name detection. It can effectively and accurately identify malicious domain names and outperforms the mainstream malicious domain name detection algorithms.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Effective DGA-Domain Detection and Classification with TextCNN and Additional Features
    Hwang, Chanwoong
    Kim, Hyosik
    Lee, Hooki
    Lee, Taejin
    ELECTRONICS, 2020, 9 (07) : 1 - 18
  • [32] Domain-Embeddings Based DGA Detection with Incremental Training Method
    Fang, Xin
    Sun, Xiaoqing
    Yang, Jiahai
    Liu, Xinran
    2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 185 - 190
  • [33] Algorithmically generated malicious domain names detection based on n-grams features
    Cucchiarelli, Alessandro
    Morbidoni, Christian
    Spalazzi, Luca
    Baldi, Marco
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 170
  • [34] Detection of algorithmically generated malicious domain names using masked N-grams
    Selvi, Jose
    Rodriguez, Ricardo J.
    Soria-Olivas, Emilio
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 124 : 156 - 163
  • [35] DGA Domain Name Detection Model Based on Gated Convolution and LSTM
    Jiang, Kui
    Wu, Siwei
    Huang, Ruibin
    Deng, Zhaorui
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2025, 19 (03): : 987 - 1006
  • [36] A Semi-Supervised Learning Scheme to Detect Unknown DGA Domain Names based on Graph Analysis
    Yan, Fan
    Liu, Jia
    Gu, Liang
    Chen, Zelong
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1578 - 1583
  • [37] DGA Domain Name Detection and Classification Using Deep Learning Models
    Nadagoudar, Ranjana B.
    Ramakrishna, M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 306 - 315
  • [38] A Machine Learning Framework for Studying Domain Generation Algorithm (DGA)-Based Malware
    Chin, Tommy
    Xiong, Kaiqi
    Hu, Chengbin
    Li, Yi
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2018, PT I, 2018, 254 : 433 - 448
  • [39] Far from classification algorithm: dive into the preprocessing stage in DGA detection
    Tong, Mingkai
    Li, Guo
    Zhang, Runzi
    Xue, Jianxin
    Liu, Wenmao
    Yang, Jiahai
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 468 - 474
  • [40] From Real Malicious Domains to Possible False Positives in DGA Domain Detection
    Shahzad, Haleh
    Sattar, Abdul Rahman
    Skandaraniyam, Janahan
    2021 IEEE 13TH INTERNATIONAL CONFERENCE ON COMPUTER RESEARCH AND DEVELOPMENT (ICCRD 2021), 2021, : 6 - 10