Insider Threat Detection Using Supervised Machine Learning Algorithms on an Extremely Imbalanced Dataset

被引:21
|
作者
Sheykhkanloo, Naghmeh Moradpoor [1 ,2 ,3 ,4 ]
Hall, Adam [5 ]
机构
[1] Edinburgh Napier Univ, Sch Comp, Cybersecur & Networks, Edinburgh, Midlothian, Scotland
[2] Edinburgh Napier Univ, Sch Comp, MSc Adv Secur & Cybercrime, Edinburgh, Midlothian, Scotland
[3] Edinburgh Napier Univ, Sch Comp, Ctr Distributed Comp Networking & Cybersecur, Edinburgh, Midlothian, Scotland
[4] Edinburgh Napier Univ, Sch Comp, Cyber Acad, Edinburgh, Midlothian, Scotland
[5] Edinburgh Napier Univ, Edinburgh, Midlothian, Scotland
关键词
Data Pre-Processing; Imbalanced Dataset; Insider Threat; Spread Subsample; Supervised Machine Learning;
D O I
10.4018/IJCWT.2020040101
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
An insider threat can take on many forms and fall under different categories. This includes malicious insider, careless/unaware/uneducated/naive employee, and the third-party contractor. Machine learning techniques have been studied in published literature as a promising solution for such threats. However, they can be biased and/or inaccurate when the associated dataset is hugely imbalanced. Therefore, this article addresses the insider threat detection on an extremely imbalanced dataset which includes employing a popular balancing technique known as spread subsample. The results show that although balancing the dataset using this technique did not improve performance metrics, it did improve the time taken to build the model and the time taken to test the model. Additionally, the authors realised that running the chosen classifiers with parameters other than the default ones has an impact on both balanced and imbalanced scenarios, but the impact is significantly stronger when using the imbalanced dataset.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 50 条
  • [21] Target detection using supervised machine learning algorithms for GPR data
    Smitha, N.
    Singh, Vipula
    SENSING AND IMAGING, 2020, 21 (01):
  • [22] Target detection using supervised machine learning algorithms for GPR data
    N. Smitha
    Vipula Singh
    Sensing and Imaging, 2020, 21
  • [23] Fall Detection Using Supervised Machine Learning Algorithms: A Comparative Study
    Zerrouki, Nabil
    Harrou, Fouzi
    Houacine, Amrane
    Sun, Ying
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 665 - 670
  • [24] Insider Threat Detection Based on NLP Word Embedding and Machine Learning
    Haq, Mohd Anul
    Khan, Mohd Abdul Rahim
    Alshehri, Mohammed
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (01): : 619 - 635
  • [25] Research Opportunity of Insider Threat Detection based on Machine Learning Methods
    Prajitno, Noer Tjahja Moekthi
    Hadiyanto, H.
    Rochim, Adian Fatchur
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 292 - 296
  • [26] Deep Temporal Graph Infomax for Imbalanced Insider Threat Detection
    Gao, Peng
    Zhang, Haotian
    Wang, Ming
    Yang, Weiyong
    Wei, Xinshen
    Lv, Zhuo
    Ma, Zengzhou
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2025, 65 (01) : 108 - 118
  • [27] Use of Machine Learning in Big Data Analytics for Insider Threat Detection
    Mayhew, Michael
    Atighetchi, Michael
    Adler, Aaron
    Greenstadt, Rachel
    2015 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2015), 2015, : 915 - 922
  • [28] Performance Assessment Using Supervised Machine Learning Algorithms of Opinion Mining on Social Media Dataset
    Susmitha, M.
    Pranitha, R. Laxmi
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 419 - 427
  • [29] Imbalanced Seismic Event Discrimination Using Supervised Machine Learning
    Ahn, Hyeongki
    Kim, Sangkyeum
    Lee, Kyunghyun
    Choi, Ahyeong
    You, Kwanho
    SENSORS, 2022, 22 (06)
  • [30] Contrastive Learning for Insider Threat Detection
    Vinay, M. S.
    Yuan, Shuhan
    Wu, Xintao
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 395 - 403