Rotation Forest-Based Logistic Model Tree for Website Phishing Detection

被引:6
|
作者
Balogun, Abdullateef O. [1 ,2 ]
Akande, Noah O. [3 ]
Usman-Hamza, Fatimah E. [1 ]
Adeyemo, Victor E. [4 ]
Mabayoje, Modinat A. [1 ]
Ameen, Ahmed O. [1 ]
机构
[1] Univ Ilorin, Dept Comp Sci, PMB 1515, Ilorin, Nigeria
[2] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Bandar Seri Iskandar 32610, Perak, Malaysia
[3] Landmark Univ, Dept Comp Sci, Omu Aran, Kwara State, Nigeria
[4] Leeds Beckett Univ, Sch Built Environm Engn & Comp, Headingley Campus, Leeds LS6 3QS, W Yorkshire, England
关键词
Cybersecurity; Logistic Model Tree; Machine learning; Phishing attack; Rotation forest; FEATURE-SELECTION; CLASS IMBALANCE; ALGORITHM; ENSEMBLE;
D O I
10.1007/978-3-030-87013-3_12
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The emergence of web and internet technology has led to its use in a broad array of services ranging from financial to educational services. This has led to a spike in the number of cybersecurity problems over the years, the most notable of which is the phishing attack, in which malicious websites imitate legitimate websites to capture gullible users' details needed for unauthorized access. However, current mitigation strategies, such as anti-phishing applications and Machine Learning (ML) methods, have been effective for detecting phishing activities. Hackers, on the other hand, are developing new ways to circumvent these countermeasures. Nevertheless, given the dynamism of phishing attempts, there is a continual demand for innovative and efficient solutions for website phishing detection. This study proposes a Rotation Forest-based Logistic Model Trees (RF-LMT) for website phishing detection. LMT is a technique that combines logistic regression and tree inference into a single model tree. Three datasets of different instance distributions, both balanced and imbalanced, are used to investigate the proposed RF-LMT. From the results, it was observed that LMT performed better than the selected baseline classifiers. This finding revealed that LMT can perform comparably to baseline classifiers. However, in comparison to LMT and experimented baseline classifiers, the proposed RF-LMT method showed superior performance in website phishing detection. Specifically, RF-LMT had a high detection accuracy (98.24%), AUC (0.998), f-measure (0.982) values with a low false-positive rate (0.018). Furthermore, RF-LMT outperformed existing ML-based phishing attack models. As a result, the proposed RF-LMT method is recommended for dealing with complex phishing attacks.
引用
收藏
页码:154 / 169
页数:16
相关论文
共 50 条
  • [41] RANDOM FOREST-BASED ECG PREMATURE VENTRICULAR CONTRACTION DETECTION
    Gao, Qin
    Du, Fanyu
    Xu, Wansong
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2025, 25 (02)
  • [42] Random forest-based robust classification for lithographic hotspot detection
    Dewar, Rohit
    Barai, Samit
    Kumar, Pardeep
    Srinivasan, Babji
    Mohapatra, Nihar R.
    JOURNAL OF MICRO-NANOLITHOGRAPHY MEMS AND MOEMS, 2019, 18 (02):
  • [43] Intelligent Ensemble Learning Approach for Phishing Website Detection Based on Weighted Soft Voting
    Taha, Altyeb
    MATHEMATICS, 2021, 9 (21)
  • [44] Enhancing Phishing Website Detection via Feature Selection in URL-Based Analysis
    Qasim M.A.
    Flayh N.A.
    Informatica (Slovenia), 2023, 47 (09): : 145 - 155
  • [45] Evolutionary Algorithm with Deep Auto Encoder Network Based Website Phishing Detection and Classification
    Alqahtani, Hamed
    Alotaibi, Saud S.
    Alrayes, Fatma S.
    Al-Turaiki, Isra
    Alissa, Khalid A.
    Aziz, Amira Sayed A.
    Maray, Mohammed
    Al Duhayyim, Mesfer
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [46] Towards benchmark datasets for machine learning based website phishing detection: An experimental study
    Hannousse, Abdelhakim
    Yahiouche, Salima
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [47] Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers: A Comparative Study
    Almomani, Ammar
    Alauthman, Mohammad
    Shatnawi, Mohd Taib
    Alweshah, Mohammed
    Alrosan, Ayat
    Alomoush, Waleed
    Gupta, Brij B.
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2022, 18 (01)
  • [48] AN INTEGRATED CORPORATE-PLANNING MODEL FOR FOREST-BASED INDUSTRIES
    HAUSMAN, WH
    SEPEHRI, M
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1985, 13 (01): : 29 - 38
  • [49] Deep forest-based hypertension and OSAHS patient screening model
    Wang P.-P.
    Ma L.
    Lv Y.-H.
    Xiang Y.
    Shao D.-G.
    Xiong X.
    International Journal of Information and Communication Technology, 2020, 16 (02) : 112 - 122
  • [50] Landslide Detection and Susceptibility Modeling on Cameron Highlands (Malaysia): A Comparison between Random Forest, Logistic Regression and Logistic Model Tree Algorithms
    Viet-Ha Nhu
    Mohammadi, Ayub
    Shahabi, Himan
    Bin Ahmad, Baharin
    Al-Ansari, Nadhir
    Shirzadi, Ataollah
    Geertsema, Marten
    Kress, Victoria R.
    Karimzadeh, Sadra
    Kamran, Khalil Valizadeh
    Chen, Wei
    Nguyen, Hoang
    FORESTS, 2020, 11 (08):