Building a Multi-class Prediction App for Malicious URLs

被引:0
|
作者
Sundaram, Vijayaraj [1 ]
Abhi, Shinu [1 ]
Agarwal, Rashmi [1 ]
机构
[1] REVA Univ, REVA Acad Corp Excellence, Bangalore, Karnataka, India
关键词
Multiclass classification; Malicious URLs; Ensemble learning; Nonparametric models; Prediction;
D O I
10.1007/978-3-031-28183-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The page that houses a malicious snippet that could misuse a user's computing resources, steal confidential data, or carry out other forms of assaults is known as a malicious host URL. They are generally distributed across the world wide web under various usage categories like spam, malware, phishing, etc. Although numerous methods or fixes (to identify URLs) have been developed in recent years, still cyberattacks continue to occur. This study contributes towards implementing three tiers of the system for detection and protection from harmful URLs. The first tier focuses on evaluating the performance of discriminative features in model creation. Discriminative features are derived fromURL details and "Whois" webpage information that helps in improving detection performance with less latency and low computational complexity. The influence of feature variation on Parametric (neural network) and non-parametric classifier detection results are assessed to narrow down to the most prominent features to be adapted in the best model for the task of identifying URLs with multi-categorization. The study reveals that non-parametric ensemble models like Light GBM, XGBoost, and Random Forest performed well with a detection accuracy of over 95%, which facilitated building a real-time detection system and differentiating multiple attack types (such as Malware, Phishing, and spam). The second tier focuses on validationwith a global database to know, if entered URL is reported as suspicious by various detection engines already. If not, it enables the user in updating the global database with URL details that are new and not reported yet. Finally, the two modules are integrated to create a web application using Streamlit that provides full system protection against malicious URLs.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 50 条
  • [1] Detection of Android Malicious Obfuscation Applications Based on Multi-class Features
    Zhao, Meichen
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1795 - 1799
  • [2] BEHAVIOR-BASED MALICIOUS EXECUTABLES DETECTION BY MULTI-CLASS SVM
    Zou, Meng-song
    Han, Lan-sheng
    Liu, Qi-wen
    Liu, Ming
    2009 IEEE YOUTH CONFERENCE ON INFORMATION, COMPUTING AND TELECOMMUNICATION, PROCEEDINGS, 2009, : 331 - 334
  • [3] Building hierarchical class structures for extreme multi-class learning
    Hongzhi Huang
    Yu Wang
    Qinghua Hu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2575 - 2590
  • [4] Building hierarchical class structures for extreme multi-class learning
    Huang, Hongzhi
    Wang, Yu
    Hu, Qinghua
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2575 - 2590
  • [5] Malicious Software Family Classification using Machine Learning Multi-class Classifiers
    San, Cho Cho
    Thwin, Mie Mie Su
    Htun, Naing Linn
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, 2019, 481 : 423 - 433
  • [6] Adalward: a deep-learning framework for multi-class malicious webpage detection
    Shrivastava, Vishal
    Damodaran, Shashank Satish
    Kamble, Megha
    Journal of Cyber Security Technology, 2020, 4 (03) : 153 - 195
  • [7] PsyneuroNet architecture for multi-class prediction of neurological disorders
    Rawat, Kavita
    Sharma, Trapti
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [8] Multi-class clustering and prediction in the analysis of microarray data
    Tsai, CA
    Lee, TC
    Ho, IC
    Yang, UC
    Chen, CH
    Chen, JJ
    MATHEMATICAL BIOSCIENCES, 2005, 193 (01) : 79 - 100
  • [9] Gene selection for multi-class prediction of microarray data
    Chen, DC
    Hua, D
    Reifman, J
    Cheng, XZ
    PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 492 - 495
  • [10] Multi-class prediction using stochastic logic programs
    Chen, Jianzhong
    Kelley, Lawrence
    Muggleton, Stephen
    Sternberg, Michael
    INDUCTIVE LOGIC PROGRAMMING, 2007, 4455 : 109 - +