Entropy and likelihood-based detection of DGA generated domain names and their families

被引:1
|
作者
Bhatia A. [1 ]
Vishvakarma D.K. [2 ]
Kaushik R. [3 ]
Agrawal A. [1 ]
机构
[1] Department of Computer Science, BITS, Rajasthan, Pilani
[2] Center for Artificial Intelligence and Robotics (CAIR), DRDO, Karnataka, Bangalore
[3] Department of Electronics and Communication Engineering, Maulana Azad National Institute of Technology, Madhya Pradesh, Bhopal
关键词
botnets; C&C server; command and control servers; domain generations algorithms; domain name system;
D O I
10.1504/IJSN.2022.125512
中图分类号
学科分类号
摘要
Botnet is a network of hosts (bots) infected by a common malware and controlled by command and control (C&C) servers. Once the malware is found in an infected host, it is easy to get the domain of its C&C server and block it. To counter such detection, many malware families use probabilistic algorithms, known as domain generation algorithms (DGAs), to generate domain names for the C&C servers. In this paper, we propose a probabilistic approach to identify the domain names that are likely to be generated by malware using DGAs. The proposed solution is based on the hypothesis that the entropy of human-generated domain names should be lesser than the entropy of DGA generated domain names. Results show that the percentage of false negatives in the detection of DGA generated domain names using the proposed method is less than 29% across 39 DGA families considered by us in our experimentation. Copyright © 2022 Inderscience Enterprises Ltd.
引用
收藏
页码:147 / 192
页数:45
相关论文
共 50 条
  • [31] An empirical likelihood-based CUSUM for on-line model change detection
    Verdier, Ghislain
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2020, 49 (08) : 1818 - 1839
  • [32] UMUDGA: A dataset for profiling algorithmically generated domain names in botnet detection
    Zago, Mattia
    Gil Perez, Manuel
    Martinez Perez, Gregorio
    DATA IN BRIEF, 2020, 30
  • [33] Using Auxiliary Inputs in Deep Learning Models for Detecting DGA-based Domain Names
    Ghosh, Indraneel
    Kumar, Subham
    Bhatia, Ashutosh
    Vishwakarma, Deepak Kumar
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 391 - 396
  • [34] Detecting Machine Generated Domain Names Based on Morpheme Features
    Zhang Wei-wei
    Gong Jian
    Liu Qian
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON CLOUD COMPUTING AND INFORMATION SECURITY (CCIS 2013), 2013, 52 : 408 - 411
  • [35] Domain-Embeddings Based DGA Detection with Incremental Training Method
    Fang, Xin
    Sun, Xiaoqing
    Yang, Jiahai
    Liu, Xinran
    2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 185 - 190
  • [36] DGA Domain Name Detection Model Based on Gated Convolution and LSTM
    Jiang, Kui
    Wu, Siwei
    Huang, Ruibin
    Deng, Zhaorui
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2025, 19 (03): : 987 - 1006
  • [37] Likelihood-Based Tree Search for Low Complexity Detection in Large MIMO Systems
    Agarwal, Saksham
    Sah, Abhay Kumar
    Chaturvedi, A. K.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2017, 6 (04) : 450 - 453
  • [38] Performance Analysis of Likelihood-Based Multiple Access for Detection Over Fading Channels
    Cohen, Kobi
    Leshem, Amir
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (04) : 2471 - 2481
  • [39] A pairwise likelihood-based approach for changepoint detection in multivariate time series models
    Ma, Ting Fung
    Yau, Chun Yip
    BIOMETRIKA, 2016, 103 (02) : 409 - 421
  • [40] GNSS spoofing detection using a maximum likelihood-based sliding window method
    Jeong, Seongkyun
    PLOS ONE, 2020, 15 (08):