Federated learning model for credit card fraud detection with data balancing techniques

被引：0

作者：

Mustafa Abdul Salam

Khaled M. Fouad

Doaa L. Elbably

Salah M. Elsayed

机构：

[1] Benha University,Faculty of Computers and Artificial Intelligence

[2] Arab Open University,Faculty of Computer Studies

[3] New Mansoura University,Faculty of Computer Science and Engineering

[4] ElShorouk,Higher Institute for Computers & Information Technology

来源：

Neural Computing and Applications | 2024年 / 36卷

关键词：

Credit card fraud detection (CCFD); Federated learning; Data privacy; Class imbalance; Undersampling; Oversampling;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In recent years, credit card transaction fraud has resulted in massive losses for both consumers and banks. Subsequently, both cardholders and banks need a strong fraud detection system to reduce cardholder losses. Credit card fraud detection (CCFD) is an important method of fraud prevention. However, there are many challenges in developing an ideal fraud detection system for banks. First off, due to data security and privacy concerns, various banks and other financial institutions are typically not permitted to exchange their transaction datasets. These issues make traditional systems find it difficult to learn and detect fraud depictions. Therefore, this paper proposes federated learning for CCFD over different frameworks (TensorFlow federated, PyTorch). Second, there is a significant imbalance in credit card transactions across all banks, with a small percentage of fraudulent transactions outweighing the majority of valid ones. In order to demonstrate the urgent need for a comprehensive investigation of class imbalance management techniques to develop a powerful model to identify fraudulent transactions, the dataset must be balanced. In order to address the issue of class imbalance, this study also seeks to give a comparative analysis of several individual and hybrid resampling techniques. In several experimental studies, the effectiveness of various resampling techniques in combination with classification approaches has been compared. In this study, it is found that the hybrid resampling methods perform well for machine learning classification models compared to deep learning classification models. The experimental results show that the best accuracy for the Random Forest (RF); Logistic Regression; K-Nearest Neighbors (KNN); Decision Tree (DT), and Gaussian Naive Bayes (NB) classifiers are 99,99%; 94,61%; 99.96%; 99,98%, and 91,47%, respectively. The comparative results show that the RF outperforms with high performance parameters (accuracy, recall, precision and f score) better than NB; RF; DT and KNN. RF achieve the minimum loss values with all resampling techniques, and the results, when utilizing the proposed models on the entire skewed dataset, achieved preferable outcomes to the unbalanced dataset. Furthermore, the PyTorch framework achieves higher prediction accuracy for the federated learning model than the TensorFlow federated framework but with more computational time.

引用

页码：6231 / 6256

页数：25

共 50 条

[41] Improving the Data Quality for Credit Card Fraud Detection
Jing, Rongrong
Tian, Hu
Li, Yidi
Zhang, Xingwei
Zheng, Xiaolong
Zhang, Zhu
Zeng, Daniel
2020 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2020, : 175 - 180
[42] Exploratory Data Analysis for Credit Card Fraud Detection
Kirar, Jyoti Singh
Kumar, Dhiraj
Chatterjee, Diptirtha
Patel, Prasoon Singh
Yadav, Shailendra Nath
2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 157 - 161
[43] Machine Learning Model for Credit Card Fraud Detection- A Comparative Analysis
Sharma, Pratyush
Banerjee, Souradeep
Tiwari, Devyanshi
Patni, Jagdish Chandra
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (06) : 789 - 796
[44] Developing a Credit Card Fraud Detection Model using Machine Learning Approaches
Khan, Shahnawaz
Mishra, Bharavi
Alourani, Abdullah
Ali, Ashraf
Kamal, Mustafa
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 411 - 418
[45] Credit Card Fraud Detection Model-based Machine Learning Algorithms
Idrees, Amira M.
Elhusseny, Nermin Samy
Ouf, Shimaa
IAENG International Journal of Computer Science, 2024, 51 (10) : 1649 - 1662
[46] Credit Card Fraud Detection: Personalized or Aggregated Model
Alowais, Mohammed Ibrahim
Soon, Lay-Ki
2012 THIRD FTRA INTERNATIONAL CONFERENCE ON MOBILE, UBIQUITOUS, AND INTELLIGENT COMPUTING (MUSIC), 2012, : 114 - 119
[47] FUZZGY: A Hybrid Model for Credit Card Fraud Detection
HaratiNik, Mohammad Reza
Akrami, Mahdi
Khadivi, Shahram
Shajari, Mahdi
2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 1088 - 1093
[48] Fraud Feature Boosting Mechanism and Spiral Oversampling Balancing Technique for Credit Card Fraud Detection
Ni, Lina
Li, Jufeng
Xu, Huixin
Wang, Xiangbo
Zhang, Jinquan
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1615 - 1630
[49] Federated Learning-Based Credit Card Fraud Detection: Performance Analysis with Sampling Methods and Deep Learning Algorithms
Aurna, Nahid Ferdous
Hossain, Md Delwar
Taenaka, Yuzo
Kadobayashi, Youki
2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 180 - 186
[50] A Survey on GAN Techniques for Data Augmentation to Address the Imbalanced Data Issues in Credit Card Fraud Detection
Strelcenia, Emilija
Prakoonwit, Simant
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 304 - 329

← 1 2 3 4 5 →