Federated learning model for credit card fraud detection with data balancing techniques

被引:0
|
作者
Mustafa Abdul Salam
Khaled M. Fouad
Doaa L. Elbably
Salah M. Elsayed
机构
[1] Benha University,Faculty of Computers and Artificial Intelligence
[2] Arab Open University,Faculty of Computer Studies
[3] New Mansoura University,Faculty of Computer Science and Engineering
[4] ElShorouk,Higher Institute for Computers & Information Technology
来源
关键词
Credit card fraud detection (CCFD); Federated learning; Data privacy; Class imbalance; Undersampling; Oversampling;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, credit card transaction fraud has resulted in massive losses for both consumers and banks. Subsequently, both cardholders and banks need a strong fraud detection system to reduce cardholder losses. Credit card fraud detection (CCFD) is an important method of fraud prevention. However, there are many challenges in developing an ideal fraud detection system for banks. First off, due to data security and privacy concerns, various banks and other financial institutions are typically not permitted to exchange their transaction datasets. These issues make traditional systems find it difficult to learn and detect fraud depictions. Therefore, this paper proposes federated learning for CCFD over different frameworks (TensorFlow federated, PyTorch). Second, there is a significant imbalance in credit card transactions across all banks, with a small percentage of fraudulent transactions outweighing the majority of valid ones. In order to demonstrate the urgent need for a comprehensive investigation of class imbalance management techniques to develop a powerful model to identify fraudulent transactions, the dataset must be balanced. In order to address the issue of class imbalance, this study also seeks to give a comparative analysis of several individual and hybrid resampling techniques. In several experimental studies, the effectiveness of various resampling techniques in combination with classification approaches has been compared. In this study, it is found that the hybrid resampling methods perform well for machine learning classification models compared to deep learning classification models. The experimental results show that the best accuracy for the Random Forest (RF); Logistic Regression; K-Nearest Neighbors (KNN); Decision Tree (DT), and Gaussian Naive Bayes (NB) classifiers are 99,99%; 94,61%; 99.96%; 99,98%, and 91,47%, respectively. The comparative results show that the RF outperforms with high performance parameters (accuracy, recall, precision and f score) better than NB; RF; DT and KNN. RF achieve the minimum loss values with all resampling techniques, and the results, when utilizing the proposed models on the entire skewed dataset, achieved preferable outcomes to the unbalanced dataset. Furthermore, the PyTorch framework achieves higher prediction accuracy for the federated learning model than the TensorFlow federated framework but with more computational time.
引用
收藏
页码:6231 / 6256
页数:25
相关论文
共 50 条
  • [41] Improving the Data Quality for Credit Card Fraud Detection
    Jing, Rongrong
    Tian, Hu
    Li, Yidi
    Zhang, Xingwei
    Zheng, Xiaolong
    Zhang, Zhu
    Zeng, Daniel
    2020 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2020, : 175 - 180
  • [42] Exploratory Data Analysis for Credit Card Fraud Detection
    Kirar, Jyoti Singh
    Kumar, Dhiraj
    Chatterjee, Diptirtha
    Patel, Prasoon Singh
    Yadav, Shailendra Nath
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 157 - 161
  • [43] Machine Learning Model for Credit Card Fraud Detection- A Comparative Analysis
    Sharma, Pratyush
    Banerjee, Souradeep
    Tiwari, Devyanshi
    Patni, Jagdish Chandra
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (06) : 789 - 796
  • [44] Developing a Credit Card Fraud Detection Model using Machine Learning Approaches
    Khan, Shahnawaz
    Mishra, Bharavi
    Alourani, Abdullah
    Ali, Ashraf
    Kamal, Mustafa
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 411 - 418
  • [45] Credit Card Fraud Detection Model-based Machine Learning Algorithms
    Idrees, Amira M.
    Elhusseny, Nermin Samy
    Ouf, Shimaa
    IAENG International Journal of Computer Science, 2024, 51 (10) : 1649 - 1662
  • [46] Credit Card Fraud Detection: Personalized or Aggregated Model
    Alowais, Mohammed Ibrahim
    Soon, Lay-Ki
    2012 THIRD FTRA INTERNATIONAL CONFERENCE ON MOBILE, UBIQUITOUS, AND INTELLIGENT COMPUTING (MUSIC), 2012, : 114 - 119
  • [47] FUZZGY: A Hybrid Model for Credit Card Fraud Detection
    HaratiNik, Mohammad Reza
    Akrami, Mahdi
    Khadivi, Shahram
    Shajari, Mahdi
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 1088 - 1093
  • [48] Fraud Feature Boosting Mechanism and Spiral Oversampling Balancing Technique for Credit Card Fraud Detection
    Ni, Lina
    Li, Jufeng
    Xu, Huixin
    Wang, Xiangbo
    Zhang, Jinquan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1615 - 1630
  • [49] Federated Learning-Based Credit Card Fraud Detection: Performance Analysis with Sampling Methods and Deep Learning Algorithms
    Aurna, Nahid Ferdous
    Hossain, Md Delwar
    Taenaka, Yuzo
    Kadobayashi, Youki
    2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 180 - 186
  • [50] A Survey on GAN Techniques for Data Augmentation to Address the Imbalanced Data Issues in Credit Card Fraud Detection
    Strelcenia, Emilija
    Prakoonwit, Simant
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 304 - 329