Building a Multi-class Prediction App for Malicious URLs

被引:0
|
作者
Sundaram, Vijayaraj [1 ]
Abhi, Shinu [1 ]
Agarwal, Rashmi [1 ]
机构
[1] REVA Univ, REVA Acad Corp Excellence, Bangalore, Karnataka, India
关键词
Multiclass classification; Malicious URLs; Ensemble learning; Nonparametric models; Prediction;
D O I
10.1007/978-3-031-28183-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The page that houses a malicious snippet that could misuse a user's computing resources, steal confidential data, or carry out other forms of assaults is known as a malicious host URL. They are generally distributed across the world wide web under various usage categories like spam, malware, phishing, etc. Although numerous methods or fixes (to identify URLs) have been developed in recent years, still cyberattacks continue to occur. This study contributes towards implementing three tiers of the system for detection and protection from harmful URLs. The first tier focuses on evaluating the performance of discriminative features in model creation. Discriminative features are derived fromURL details and "Whois" webpage information that helps in improving detection performance with less latency and low computational complexity. The influence of feature variation on Parametric (neural network) and non-parametric classifier detection results are assessed to narrow down to the most prominent features to be adapted in the best model for the task of identifying URLs with multi-categorization. The study reveals that non-parametric ensemble models like Light GBM, XGBoost, and Random Forest performed well with a detection accuracy of over 95%, which facilitated building a real-time detection system and differentiating multiple attack types (such as Malware, Phishing, and spam). The second tier focuses on validationwith a global database to know, if entered URL is reported as suspicious by various detection engines already. If not, it enables the user in updating the global database with URL details that are new and not reported yet. Finally, the two modules are integrated to create a web application using Streamlit that provides full system protection against malicious URLs.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 50 条
  • [41] Competitive Voting-based Multi-class Prediction for Ore Selection
    Zhang, Zelin
    Liu, Ying
    Hu, Qi
    Zhang, Zhiwei
    Liu, Yang
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 514 - 519
  • [42] Map Feature Based Trajectory Prediction with Multi-class Traffic Participants
    Zuo, Zhiqiang
    Zhang, Xiao
    Wang, Yijing
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7312 - 7317
  • [43] Enhanced Malicious Traffic Detection in Encrypted Communication Using TLS Features and a Multi-class Classifier Ensemble
    Kondaiah, Cheemaladinne
    Pais, Alwyn Roshan
    Rao, Routhu Srinivasa
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2024, 32 (04)
  • [44] Multi-Class Phased Prediction of Academic Performance and Dropout in Higher Education
    Martins, Monica V.
    Baptista, Luis
    Machado, Jorge
    Realinho, Valentim
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [45] Multi-class financial distress prediction based on stacking ensemble method
    Chen, Xiaofang
    Wu, Chong
    Zhang, Zijiao
    Liu, Jiaming
    INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2024,
  • [46] Comparing multi-class classifier performance by multi-class ROC analysis: A nonparametric approach
    Xu, Jingyan
    NEUROCOMPUTING, 2024, 583
  • [47] Building trees to support comparable multi-class services in edge overlay multicast
    Li, Suogang
    Wu, Jianping
    Xu, Ke
    Liu, Ying
    ICCCN 2006: 15TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 2006, : 441 - +
  • [48] Multi-Class Strategies for Joint Building Footprint and Road Detection in Remote Sensing
    Ayala, Christian
    Aranda, Carlos
    Galar, Mikel
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [49] CLASSIFICATION OF RARE BUILDING CHANGE USING CNN WITH MULTI-CLASS FOCAL LOSS
    Nemoto, Keisuke
    Hamaguchi, Ryuhei
    Imaizumi, Tomoyuki
    Hikosaka, Shuhei
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4663 - 4666
  • [50] On reoptimizing multi-class classifiers
    Bourke, Chris
    Deng, Kun
    Scott, Stephen D.
    Schapire, Robert E.
    Vinodchandran, N. V.
    MACHINE LEARNING, 2008, 71 (2-3) : 219 - 242