CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

被引：18

作者：

Liu, Dongjie ^{[1
,2
]}

Lee, Jong-Hyouk ^{[3
]}

机构：

[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China

[3] Sejong Univ, Dept Comp & Informat Secur, Seoul 13557, South Korea

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Machine learning; Internet; Browsers; Uniform resource locators; Support vector machines; Feature extraction; Crawlers; Convolutional neural network; machine learning; malicious website detection; NEURAL-NETWORK; DEEP CNN;

D O I：

10.1109/ACCESS.2020.2995157

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.

引用

页码：97258 / 97266

页数：9

共 50 条

[31] Phishing Website Detection Based on Effective CSS Features of Web Pages
Mao, Jian
Tian, Wenqian
Li, Pei
Wei, Tao
Liang, Zhenkai
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2017, 2017, 10251 : 804 - 815
[32] JABBERWOCK: A Tool for WebAssembly Dataset Generation towards Malicious Website Detection
Komiya, Chika
Yanai, Naoto
Yamashita, Kyosuke
Okamura, Shingo
2023 53RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS, DSN-W, 2023, : 36 - 39
[33] Malicious Website Detection Using Probabilistic Data Structure Bloom Filter
Nandhini, K.
Balasubramaniam, Ramesh
PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 311 - 316
[34] Malicious web content detection by machine learning
Hou, Yung-Tsung
Chang, Yimeng
Chen, Tsuhan
Laih, Chi-Sung
Chen, Chia-Mei
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (01) : 55 - 60
[35] A WEB PAGE MALICIOUS SCRIPT DETECTION SYSTEM
Zhang, Siyue
Wang, Weiguang
Chen, Zhao
Gu, Heng
Liu, Jianyi
Wang, Cong
2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 394 - 399
[36] DETECTION OF MALICIOUS DNS AND WEB SERVERS USING GRAPH-BASED APPROACHES
Jia, Jinyuan
Dong, Zheng
Li, Jie
Stokes, Jack W.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2625 - 2629
[37] Malicious User Nodes Detection by Web Mining Based Artificial Intelligence Technique
Kumar, Gaurav
Rishiwal, V.
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (01) : 1 - 24
[38] A Solution for Automatically Malicious Web Shell and Web Application Vulnerability Detection
Van-Giap Le
Huu-Tung Nguyen
Dang-Nhac Lu
Ngoc-Hoa Nguyen
COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT I, 2016, 9875 : 367 - 378
[39] Exploiting Feature Interactions for Malicious Website Detection with Overhead-accuracy Tradeoff
Shen, Shuaiqi
Yu, Chong
Zhang, Kuan
Ci, Song
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[40] Malicious Web Content Detection Using Machine Leaning
Desai, Anand
Jatakia, Janvi
Naik, Rohit
Raul, Nataasha
2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1432 - 1436

← 1 2 3 4 5 →