Identification of Phishing URLs Using Machine Learning Models

被引:0
|
作者
Vivek, Meghashyam [1 ]
Premjith, Nithin [1 ]
Johnson, Aaron Antonio [1 ]
Maurya, Ashutosh Kumar [1 ]
Jingle, I. Diana Jeba [1 ]
机构
[1] Christ, Bangalore, Karnataka, India
关键词
XGBoost; Phishing; Prediction; Machine learning; Classifier;
D O I
10.1007/978-981-99-9043-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we provide a machine learning-based method for identifying phishing URLs. Sixteen features, including Have IP, Have At, URL Length, URL Depth, Non-standard double slash, HTTPS domain, Shortened URL, Hyphen Count, DNS Record, Domain age, Domain active, iFrame, Mouse Over, Right click, Web Forwards, and Label, were extracted from the 600,000 URLs we gathered as a dataset of legitimate and phishing URLs. We then used this dataset to train a variety of machine learning models. These included standalone models such Naive Bayes, Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN). We also used ensemble models like Hard Voting, XGBoost, Random Forests, and AdaBoost. Finally, we used deep learning models such as Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU) and Convolutional Neural Networks (CNN). On evaluation of performance metrics like accuracy, precision, recall, train time and prediction time it was found that XGBoost provides the best performance across all categories.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 50 条
  • [31] Detecting Phishing Domains Using Machine Learning
    Alnemari, Shouq
    Alshammari, Majid
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [32] Detecting Phishing Website Using Machine Learning
    Alkawaz, Mohammed Hazim
    Steven, Stephanie Joanne
    Hajamydeen, Asif Iqbal
    2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), 2020, : 111 - 114
  • [33] Prediction of phishing websites using machine learning
    Mithilesh Kumar Pandey
    Munindra Kumar Singh
    Saurabh Pal
    B. B. Tiwari
    Spatial Information Research, 2023, 31 : 157 - 166
  • [34] Addressing Phishing Threats Using A Metaheuristic Perspective On Machine Learning Classification Models Code
    Hu, Bo
    Zhang, Sainan
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2025, 28 (07): : 1503 - 1514
  • [35] An ensemble learning approach for detecting phishing URLs in encrypted TLS traffic
    Kondaiah, Cheemaladinne
    Pais, Alwyn Roshan
    Rao, Routhu Srinivasa
    TELECOMMUNICATION SYSTEMS, 2024, 87 (04) : 1015 - 1031
  • [36] A Novel Algorithm to Detect Phishing URLs
    Hawanna, Varsharani Ramdas
    Kulkarni, V. Y.
    Rane, R. A.
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 548 - 552
  • [37] Detecting Phishing Sites Using URLs Collected from Emails
    Wang, Chuan-Sheng
    Hsu, Fu-Hau
    Chen, Shih-Jen
    Hwang, Yan-Ling
    Wu, Min-Hao
    APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 916 - +
  • [38] Detection of Phishing Websites by Investigating Their URLs using LSTM Algorithm
    Alanzi, Barah Mohammed
    Uliyan, Diaa Mohammed
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 419 - 428
  • [39] Phishing URL Detection Using Machine Learning and Deep Learning
    Ferdaws, Rawshon
    Majd, Nahid Ebrahimi
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0485 - 0490
  • [40] Machine learning models for phishing detection from TLS traffic
    Munish Kumar
    Cheemaladinne Kondaiah
    Alwyn Roshan Pais
    Routhu Srinivasa Rao
    Cluster Computing, 2023, 26 : 3263 - 3277