CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

被引:18
|
作者
Liu, Dongjie [1 ,2 ]
Lee, Jong-Hyouk [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
[3] Sejong Univ, Dept Comp & Informat Secur, Seoul 13557, South Korea
关键词
Machine learning; Internet; Browsers; Uniform resource locators; Support vector machines; Feature extraction; Crawlers; Convolutional neural network; machine learning; malicious website detection; NEURAL-NETWORK; DEEP CNN;
D O I
10.1109/ACCESS.2020.2995157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.
引用
收藏
页码:97258 / 97266
页数:9
相关论文
共 50 条
  • [21] Malicious Domain Name Detection Model Based on CNN-GRU-Attention
    Jiang, Yanshu
    Jia, Mingqi
    Zhang, Biao
    Deng, Liwei
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1602 - 1607
  • [22] Malicious URL Detection Based on Multiple Feature Fusion
    Wu, Sen-Yan
    Luo, Xi
    Wang, Wei-Ping
    Qin, Yan
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (09): : 2916 - 2934
  • [23] Real-time detection of cloud tenant malicious behavior based on CNN
    Chen, Hao
    Xiao, Ruizhi
    Jin, Shuyuan
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 998 - 1005
  • [24] Evasive Malicious Website Detection by Leveraging Redirection Subgraph Similarities
    Shibahara, Toshiki
    Takata, Yuta
    Akiyama, Mitsuaki
    Yagi, Takeshi
    Hato, Kunio
    Murata, Masayuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (03) : 430 - 443
  • [25] Detection of malicious and non-malicious website visitors using unsupervised neural network learning
    Stevanovic, Dusan
    Vlajic, Natalija
    An, Aijun
    APPLIED SOFT COMPUTING, 2013, 13 (01) : 698 - 708
  • [26] Malicious DNS detection by combining improved transformer and CNN
    Li, Heyu
    Li, Zhangmeizhi
    Zhang, Shuyan
    Pu, Xiao
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [27] Web Security in the Digital Age: Artificial Intelligence Solution for Malicious Website Classification
    Krishna, Sujatha
    Natarajan, Rajesh
    Flammini, Francesco
    Alfurhood, Badria Sulaiman
    Janhavi, V.
    Gupta, Shashi Kant
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2025, 21 (01)
  • [28] Web-Based Android Malicious Software Detection and Classification System
    Dogru, Ibrahim Alper
    Kiraz, Omer
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [29] CNN Based Image Classification of Malicious UAVs
    Brown, Jason
    Gharineiat, Zahra
    Raj, Nawin
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [30] MALICIOUS URL RECOGNITION AND DETECTION USING ATTENTION-BASED CNN-LSTM
    Peng, Yongfang
    Tian, Shengwei
    Yu, Long
    Lv, Yalong
    Wang, Ruijin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (11) : 5580 - 5593