Building a Multi-class Prediction App for Malicious URLs

被引：0

作者：

Sundaram, Vijayaraj ^{[1
]}

Abhi, Shinu ^{[1
]}

Agarwal, Rashmi ^{[1
]}

机构：

[1] REVA Univ, REVA Acad Corp Excellence, Bangalore, Karnataka, India

来源：

ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2022, PT II | 2023年 / 1798卷

关键词：

Multiclass classification; Malicious URLs; Ensemble learning; Nonparametric models; Prediction;

D O I：

10.1007/978-3-031-28183-9_32

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The page that houses a malicious snippet that could misuse a user's computing resources, steal confidential data, or carry out other forms of assaults is known as a malicious host URL. They are generally distributed across the world wide web under various usage categories like spam, malware, phishing, etc. Although numerous methods or fixes (to identify URLs) have been developed in recent years, still cyberattacks continue to occur. This study contributes towards implementing three tiers of the system for detection and protection from harmful URLs. The first tier focuses on evaluating the performance of discriminative features in model creation. Discriminative features are derived fromURL details and "Whois" webpage information that helps in improving detection performance with less latency and low computational complexity. The influence of feature variation on Parametric (neural network) and non-parametric classifier detection results are assessed to narrow down to the most prominent features to be adapted in the best model for the task of identifying URLs with multi-categorization. The study reveals that non-parametric ensemble models like Light GBM, XGBoost, and Random Forest performed well with a detection accuracy of over 95%, which facilitated building a real-time detection system and differentiating multiple attack types (such as Malware, Phishing, and spam). The second tier focuses on validationwith a global database to know, if entered URL is reported as suspicious by various detection engines already. If not, it enables the user in updating the global database with URL details that are new and not reported yet. Finally, the two modules are integrated to create a web application using Streamlit that provides full system protection against malicious URLs.

引用

页码：455 / 475

页数：21

共 50 条

[41] Competitive Voting-based Multi-class Prediction for Ore Selection
Zhang, Zelin
Liu, Ying
Hu, Qi
Zhang, Zhiwei
Liu, Yang
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 514 - 519
[42] Map Feature Based Trajectory Prediction with Multi-class Traffic Participants
Zuo, Zhiqiang
Zhang, Xiao
Wang, Yijing
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7312 - 7317
[43] Enhanced Malicious Traffic Detection in Encrypted Communication Using TLS Features and a Multi-class Classifier Ensemble
Kondaiah, Cheemaladinne
Pais, Alwyn Roshan
Rao, Routhu Srinivasa
JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2024, 32 (04)
[44] Multi-Class Phased Prediction of Academic Performance and Dropout in Higher Education
Martins, Monica V.
Baptista, Luis
Machado, Jorge
Realinho, Valentim
APPLIED SCIENCES-BASEL, 2023, 13 (08):
[45] Multi-class financial distress prediction based on stacking ensemble method
Chen, Xiaofang
Wu, Chong
Zhang, Zijiao
Liu, Jiaming
INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2024,
[46] Comparing multi-class classifier performance by multi-class ROC analysis: A nonparametric approach
Xu, Jingyan
NEUROCOMPUTING, 2024, 583
[47] Building trees to support comparable multi-class services in edge overlay multicast
Li, Suogang
Wu, Jianping
Xu, Ke
Liu, Ying
ICCCN 2006: 15TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 2006, : 441 - +
[48] Multi-Class Strategies for Joint Building Footprint and Road Detection in Remote Sensing
Ayala, Christian
Aranda, Carlos
Galar, Mikel
APPLIED SCIENCES-BASEL, 2021, 11 (18):
[49] CLASSIFICATION OF RARE BUILDING CHANGE USING CNN WITH MULTI-CLASS FOCAL LOSS
Nemoto, Keisuke
Hamaguchi, Ryuhei
Imaizumi, Tomoyuki
Hikosaka, Shuhei
IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4663 - 4666
[50] On reoptimizing multi-class classifiers
Bourke, Chris
Deng, Kun
Scott, Stephen D.
Schapire, Robert E.
Vinodchandran, N. V.
MACHINE LEARNING, 2008, 71 (2-3) : 219 - 242

← 1 2 3 4 5 →