Building a Multi-class Prediction App for Malicious URLs

被引:0
|
作者
Sundaram, Vijayaraj [1 ]
Abhi, Shinu [1 ]
Agarwal, Rashmi [1 ]
机构
[1] REVA Univ, REVA Acad Corp Excellence, Bangalore, Karnataka, India
关键词
Multiclass classification; Malicious URLs; Ensemble learning; Nonparametric models; Prediction;
D O I
10.1007/978-3-031-28183-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The page that houses a malicious snippet that could misuse a user's computing resources, steal confidential data, or carry out other forms of assaults is known as a malicious host URL. They are generally distributed across the world wide web under various usage categories like spam, malware, phishing, etc. Although numerous methods or fixes (to identify URLs) have been developed in recent years, still cyberattacks continue to occur. This study contributes towards implementing three tiers of the system for detection and protection from harmful URLs. The first tier focuses on evaluating the performance of discriminative features in model creation. Discriminative features are derived fromURL details and "Whois" webpage information that helps in improving detection performance with less latency and low computational complexity. The influence of feature variation on Parametric (neural network) and non-parametric classifier detection results are assessed to narrow down to the most prominent features to be adapted in the best model for the task of identifying URLs with multi-categorization. The study reveals that non-parametric ensemble models like Light GBM, XGBoost, and Random Forest performed well with a detection accuracy of over 95%, which facilitated building a real-time detection system and differentiating multiple attack types (such as Malware, Phishing, and spam). The second tier focuses on validationwith a global database to know, if entered URL is reported as suspicious by various detection engines already. If not, it enables the user in updating the global database with URL details that are new and not reported yet. Finally, the two modules are integrated to create a web application using Streamlit that provides full system protection against malicious URLs.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 50 条
  • [31] Multi-instance iris remote authentication using private multi-class perceptron on malicious cloud server
    Morampudi, Mahesh Kumar
    Veldandi, Sowmya
    Prasad, Munaga V. N. K.
    Raju, U. S. N.
    APPLIED INTELLIGENCE, 2020, 50 (09) : 2848 - 2866
  • [32] Feature-based Malicious URL and Attack Type Detection Using Multi-class Classification
    Patil, Dharmaraj R.
    Patil, Jayantrao B.
    ISECURE-ISC INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2018, 10 (02): : 141 - 162
  • [33] Multi-class AdaBoost
    Zhu, Ji
    Zou, Hui
    Rosset, Saharon
    Hastie, Trevor
    STATISTICS AND ITS INTERFACE, 2009, 2 (03) : 349 - 360
  • [34] Multi-Class Cosegmentation
    Joulin, Armand
    Bach, Francis
    Ponce, Jean
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 542 - 549
  • [35] Activity Recognition using Multi-Class Classification inside an Educational Building
    Das, Anooshmita
    Kjaergaard, Mikkel Baun
    2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [36] Multi-Class Support Vector Machine via Maximizing Multi-Class Margins
    Xu, Jie
    Liu, Xianglong
    Huo, Zhouyuan
    Deng, Cheng
    Nie, Feiping
    Huang, Heng
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3154 - 3160
  • [37] Multi-class Boosting with Class Hierarchies
    Jun, Goo
    Ghosh, Joydeep
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 32 - 41
  • [38] Enhancing Multi-Class Prediction of Skin Lesions with Feature Importance Assessment
    Paulauskaite-Taraseviciene, Agne
    Sutiene, Kristina
    Dimsa, Nojus
    Valiukeviciene, Skaidra
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2024, 34 (04) : 617 - 629
  • [39] Prediction and estimation consistency of sparse multi-class penalized optimal scoring
    Gaynanova, Irina
    BERNOULLI, 2020, 26 (01) : 286 - 322
  • [40] Utilizing Multi-Class Classification Methods for Automated Sleep Disorder Prediction
    Dritsas, Elias
    Trigka, Maria
    INFORMATION, 2024, 15 (08)