Fraud-BERT: transformer based context aware online recruitment fraud detection

被引:0
|
作者
Taneja, Khushboo [1 ]
Vashishtha, Jyoti [1 ]
Ratnoo, Saroj [1 ]
机构
[1] Guru Jambheshwar Univ Sci & Technol, Dept Comp Sci & Engn, Hisar 125001, India
关键词
Fake job detection; Text classification; Transformer; Large language model; Transfer learning;
D O I
10.1007/s10791-025-09502-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online recruitment facilitates the automatic hiring process for recruiters and provides convenience to job seekers via online job platforms. Parallelly, it has given rise to malicious use of such platforms by fraudsters who post fake jobs and steal money and personal information from innocent job seekers. It is difficult to detect fake jobs manually, as these are meticulously crafted to mimic legitimate ones. Previously, various machine learning approaches have employed Bag-of-Words (BoW) and Term Frequency-Inverse Document Frequency (TF-IDF) feature extraction methods for this task. However, these methods are non-contextual and show skewness in results due to imbalances in data distribution. This paper presents Fraud-BERT, a transformer-based contextual framework leveraging Bidirectional Encoder Representations from Transformers (BERT) via transfer learning approach and evaluates it on a highly imbalanced fake job dataset, which is popularly named as Employment Scam Aegean Dataset (EMSCAD). The dataset is available on Kaggle website. The superiority of the proposed method is demonstrated by a comparative analysis with conventional methods. The results of the study conclude that the proposed method is more robust in tackling imbalanced data and, it has significantly out-performed existing state-of-the-art studies with F1 score of 0.93 and 99% accuracy.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Online Credit Payment Fraud Detection via Structure-Aware Hierarchical Recurrent Neural Network
    Lin, Wangli
    Sun, Li
    Zhong, Qiwei
    Liu, Can
    Feng, Jinghua
    Ao, Xiang
    Yang, Hao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3670 - 3676
  • [42] Context-Aware Sarcasm Detection Using BERT
    Baruah, Arup
    Das, Kaushik Amar
    Barbhuiya, Ferdous Ahmed
    Dey, Kuntal
    FIGURATIVE LANGUAGE PROCESSING, 2020, : 83 - 87
  • [43] Financial Statement Fraud Detection Using Published Data Based on Fraud Triangle Theory
    Parlindungan, Ricardo
    Africano, Fernando
    Elizabeth, P. Sri Megawati
    ADVANCED SCIENCE LETTERS, 2017, 23 (08) : 7054 - 7058
  • [44] An online fuzzy fraud detection framework for credit card transactions
    Charizanos, Georgios
    Demirhan, Haydar
    Icen, Duygu
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [45] Invariant Diversity as a Proactive Fraud Detection Mechanism for Online Merchants
    Laurens, Roy
    Jusak, Jusak
    Zou, Cliff C.
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [46] A Novel GNN Model for Fraud Detection in Online Trading Activities
    Long, Jing
    Fang, Fei
    Luo, Haibo
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT II, 2022, 13156 : 603 - 614
  • [47] The detection and management of attempted fraud during an online randomised trial
    Thomas A. Willis
    Alexandra Wright-Hughes
    Clare Skinner
    Amanda J. Farrin
    Suzanne Hartley
    Rebecca Walwyn
    Ana Weller
    Mohamed Althaf
    Stephanie Wilson
    Chris P. Gale
    Robbie Foy
    Trials, 24
  • [48] An Intelligent Arabic Model for Recruitment Fraud Detection Using Machine Learning
    Sofy, Mohamed A.
    Khafagy, Mohammed H.
    Badry, Rasha M.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (01) : 102 - 111
  • [49] Click fraud detection for online advertising using machine learning
    Aljabri, Malak
    Mohammad, Rami Mustafa A.
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (02) : 341 - 350
  • [50] User Behavior Pre-training for Online Fraud Detection
    Liu, Can
    Gao, Yuncong
    Sun, Li
    Feng, Jinghua
    Yang, Hao
    Ao, Xiang
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3357 - 3365