CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

被引:18
|
作者
Liu, Dongjie [1 ,2 ]
Lee, Jong-Hyouk [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
[3] Sejong Univ, Dept Comp & Informat Secur, Seoul 13557, South Korea
关键词
Machine learning; Internet; Browsers; Uniform resource locators; Support vector machines; Feature extraction; Crawlers; Convolutional neural network; machine learning; malicious website detection; NEURAL-NETWORK; DEEP CNN;
D O I
10.1109/ACCESS.2020.2995157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.
引用
收藏
页码:97258 / 97266
页数:9
相关论文
共 50 条
  • [31] Phishing Website Detection Based on Effective CSS Features of Web Pages
    Mao, Jian
    Tian, Wenqian
    Li, Pei
    Wei, Tao
    Liang, Zhenkai
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2017, 2017, 10251 : 804 - 815
  • [32] JABBERWOCK: A Tool for WebAssembly Dataset Generation towards Malicious Website Detection
    Komiya, Chika
    Yanai, Naoto
    Yamashita, Kyosuke
    Okamura, Shingo
    2023 53RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS, DSN-W, 2023, : 36 - 39
  • [33] Malicious Website Detection Using Probabilistic Data Structure Bloom Filter
    Nandhini, K.
    Balasubramaniam, Ramesh
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 311 - 316
  • [34] Malicious web content detection by machine learning
    Hou, Yung-Tsung
    Chang, Yimeng
    Chen, Tsuhan
    Laih, Chi-Sung
    Chen, Chia-Mei
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (01) : 55 - 60
  • [35] A WEB PAGE MALICIOUS SCRIPT DETECTION SYSTEM
    Zhang, Siyue
    Wang, Weiguang
    Chen, Zhao
    Gu, Heng
    Liu, Jianyi
    Wang, Cong
    2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 394 - 399
  • [36] DETECTION OF MALICIOUS DNS AND WEB SERVERS USING GRAPH-BASED APPROACHES
    Jia, Jinyuan
    Dong, Zheng
    Li, Jie
    Stokes, Jack W.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2625 - 2629
  • [37] Malicious User Nodes Detection by Web Mining Based Artificial Intelligence Technique
    Kumar, Gaurav
    Rishiwal, V.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (01) : 1 - 24
  • [38] A Solution for Automatically Malicious Web Shell and Web Application Vulnerability Detection
    Van-Giap Le
    Huu-Tung Nguyen
    Dang-Nhac Lu
    Ngoc-Hoa Nguyen
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT I, 2016, 9875 : 367 - 378
  • [39] Exploiting Feature Interactions for Malicious Website Detection with Overhead-accuracy Tradeoff
    Shen, Shuaiqi
    Yu, Chong
    Zhang, Kuan
    Ci, Song
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [40] Malicious Web Content Detection Using Machine Leaning
    Desai, Anand
    Jatakia, Janvi
    Naik, Rohit
    Raul, Nataasha
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1432 - 1436