Building a Multi-class Prediction App for Malicious URLs

被引:0
|
作者
Sundaram, Vijayaraj [1 ]
Abhi, Shinu [1 ]
Agarwal, Rashmi [1 ]
机构
[1] REVA Univ, REVA Acad Corp Excellence, Bangalore, Karnataka, India
关键词
Multiclass classification; Malicious URLs; Ensemble learning; Nonparametric models; Prediction;
D O I
10.1007/978-3-031-28183-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The page that houses a malicious snippet that could misuse a user's computing resources, steal confidential data, or carry out other forms of assaults is known as a malicious host URL. They are generally distributed across the world wide web under various usage categories like spam, malware, phishing, etc. Although numerous methods or fixes (to identify URLs) have been developed in recent years, still cyberattacks continue to occur. This study contributes towards implementing three tiers of the system for detection and protection from harmful URLs. The first tier focuses on evaluating the performance of discriminative features in model creation. Discriminative features are derived fromURL details and "Whois" webpage information that helps in improving detection performance with less latency and low computational complexity. The influence of feature variation on Parametric (neural network) and non-parametric classifier detection results are assessed to narrow down to the most prominent features to be adapted in the best model for the task of identifying URLs with multi-categorization. The study reveals that non-parametric ensemble models like Light GBM, XGBoost, and Random Forest performed well with a detection accuracy of over 95%, which facilitated building a real-time detection system and differentiating multiple attack types (such as Malware, Phishing, and spam). The second tier focuses on validationwith a global database to know, if entered URL is reported as suspicious by various detection engines already. If not, it enables the user in updating the global database with URL details that are new and not reported yet. Finally, the two modules are integrated to create a web application using Streamlit that provides full system protection against malicious URLs.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 50 条
  • [21] The Influence of Multi-class Feature Selection on the Prediction of Diagnostic Phenotypes
    Ludwig Lausser
    Robin Szekely
    Lyn-Rouven Schirra
    Hans A. Kestler
    Neural Processing Letters, 2018, 48 : 863 - 880
  • [22] Back to Basics: An Interpretable Multi-Class Grade Prediction Framework
    Basma Alharbi
    Arabian Journal for Science and Engineering, 2022, 47 : 2171 - 2186
  • [23] Dynamic and Probabilistic Multi-class Prediction of Tunnel Squeezing Intensity
    Chen, Yu
    Li, Tianbin
    Zeng, Peng
    Ma, Junjie
    Patelli, Edoardo
    Edwards, Ben
    ROCK MECHANICS AND ROCK ENGINEERING, 2020, 53 (08) : 3521 - 3542
  • [24] Dynamic and Probabilistic Multi-class Prediction of Tunnel Squeezing Intensity
    Yu Chen
    Tianbin Li
    Peng Zeng
    Junjie Ma
    Edoardo Patelli
    Ben Edwards
    Rock Mechanics and Rock Engineering, 2020, 53 : 3521 - 3542
  • [25] Back to Basics: An Interpretable Multi-Class Grade Prediction Framework
    Alharbi, Basma
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 2171 - 2186
  • [26] Efficient set-valued prediction in multi-class classification
    Thomas Mortier
    Marek Wydmuch
    Krzysztof Dembczyński
    Eyke Hüllermeier
    Willem Waegeman
    Data Mining and Knowledge Discovery, 2021, 35 : 1435 - 1469
  • [27] A prediction method for multi-class systems based on limited data
    Kuznetsov, VA
    Knott, GD
    FOURTEENTH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2001, : 279 - 284
  • [28] Trajectory prediction for multi-class target based on preferred speed
    Ye H.
    Liu M.
    Zheng W.
    Liu H.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2017, 45 (10): : 100 - 104
  • [29] A Comparison of MCC and CEN Error Measures in Multi-Class Prediction
    Jurman, Giuseppe
    Riccadonna, Samantha
    Furlanello, Cesare
    PLOS ONE, 2012, 7 (08):
  • [30] Multi-instance iris remote authentication using private multi-class perceptron on malicious cloud server
    Mahesh Kumar Morampudi
    Sowmya Veldandi
    Munaga V. N. K. Prasad
    U. S. N. Raju
    Applied Intelligence, 2020, 50 : 2848 - 2866