Identification of Phishing URLs Using Machine Learning Models

被引:0
|
作者
Vivek, Meghashyam [1 ]
Premjith, Nithin [1 ]
Johnson, Aaron Antonio [1 ]
Maurya, Ashutosh Kumar [1 ]
Jingle, I. Diana Jeba [1 ]
机构
[1] Christ, Bangalore, Karnataka, India
关键词
XGBoost; Phishing; Prediction; Machine learning; Classifier;
D O I
10.1007/978-981-99-9043-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we provide a machine learning-based method for identifying phishing URLs. Sixteen features, including Have IP, Have At, URL Length, URL Depth, Non-standard double slash, HTTPS domain, Shortened URL, Hyphen Count, DNS Record, Domain age, Domain active, iFrame, Mouse Over, Right click, Web Forwards, and Label, were extracted from the 600,000 URLs we gathered as a dataset of legitimate and phishing URLs. We then used this dataset to train a variety of machine learning models. These included standalone models such Naive Bayes, Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN). We also used ensemble models like Hard Voting, XGBoost, Random Forests, and AdaBoost. Finally, we used deep learning models such as Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU) and Convolutional Neural Networks (CNN). On evaluation of performance metrics like accuracy, precision, recall, train time and prediction time it was found that XGBoost provides the best performance across all categories.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 50 条
  • [21] Classifying Phishing URLs Using Recurrent Neural Networks
    Correa Bahnsen, Alejandro
    Contreras Bohorquez, Eduardo
    Villegas, Sergio
    Vargas, Javier
    Gonzalez, Fabio A.
    PROCEEDINGS OF THE 2017 APWG SYMPOSIUM ON ELECTRONIC CRIME RESEARCH (ECRIME), 2017, : 1 - 8
  • [22] Prediction of phishing websites using machine learning
    Pandey, Mithilesh Kumar
    Singh, Munindra Kumar
    Pal, Saurabh
    Tiwari, B. B.
    SPATIAL INFORMATION RESEARCH, 2023, 31 (02) : 157 - 166
  • [23] Phishing attack detection using Machine Learning
    Pandiyan S S.
    Selvaraj P.
    Burugari V.K.
    Benadit P J.
    P K.
    Measurement: Sensors, 2022, 24
  • [24] Detecting Phishing Websites Using Machine Learning
    Alswailem, Amani
    Alabdullah, Bashayr
    Alrumayh, Norah
    Alsedrani, Aram
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
  • [25] Phishing and Smishing Detection Using Machine Learning
    El Karhani, Hadi
    Al Jamal, Riad
    Samra, Yorgo Bou
    Elhajj, Imad H.
    Kayssi, Ayman
    2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 206 - 211
  • [26] Predicting Phishing Vulnerabilities Using Machine Learning
    Rutherford, Sarah
    Lin, Kevin
    Blaine, Raymond W.
    SOUTHEASTCON 2022, 2022, : 779 - 786
  • [27] Detection of Phishing Websites Using Machine Learning
    Abbas, Ahmed Raad
    Singh, Sukhvir
    Kau, Mandeep
    INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019, 2020, 89 : 1307 - 1314
  • [28] Phishing Websites Detection using Machine Learning
    Kulkarni, Arun
    Brown, Leonard L., III
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (07) : 8 - 13
  • [29] Detection of phishing websites using machine learning
    Razaque, Abdul
    Frej, Mohamed Ben Haj
    Sabyrov, Dauren
    Shaikhyn, Aidana
    Amsaad, Fathi
    Oun, Ahmed
    Proceedings - 2020 IEEE Cloud Summit, Cloud Summit 2020, 2020, : 103 - 107
  • [30] Detection of Phishing Websites using Machine Learning
    Razaque, Abdul
    Frej, Mohamed Ben Haj
    Sabyrov, Dauren
    Shaikhyn, Aidana
    Amsaad, Fathi
    Oun, Ahmed
    2020 IEEE CLOUD SUMMIT, 2020, : 103 - 107