Fraud Detection in Healthcare Insurance Claims Using Machine Learning

被引:7
|
作者
Nabrawi, Eman [1 ,2 ]
Alanazi, Abdullah [1 ,2 ]
机构
[1] King Saud Ibn Abdulaziz Univ Hlth Sci, Hlth Informat Dept, POB 3660, Riyadh 11481, Saudi Arabia
[2] King Abdullah Int Med Res Ctr, Riyadh 14611, Saudi Arabia
关键词
fraud; insurance claims; artificial neural networks (ANN); logistic regression (LR); random forest (RF); Saudi Arabia;
D O I
10.3390/risks11090160
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Healthcare fraud is intentionally submitting false claims or producing misinterpretation of facts to obtain entitlement payments. Thus, it wastes healthcare financial resources and increases healthcare costs. Subsequently, fraud poses a substantial financial challenge. Therefore, supervised machine and deep learning analytics such as random forest, logistic regression, and artificial neural networks are successfully used to detect healthcare insurance fraud. This study aims to develop a health model that automatically detects fraud from health insurance claims in Saudi Arabia. The model indicates the greatest contributing factor to fraud with optimal accuracy. The labeled imbalanced dataset used three supervised deep and machine learning methods. The dataset was obtained from three healthcare providers in Saudi Arabia. The applied models were random forest, logistic regression, and artificial neural networks. The SMOT technique was used to balance the dataset. Boruta object feature selection was applied to exclude insignificant features. Validation metrics were accuracy, precision, recall, specificity, F1 score, and area under the curve (AUC). Random forest classifiers indicated policy type, education, and age as the most significant features with an accuracy of 98.21%, 98.08% precision, 100% recall, an F1 score of 99.03%, specificity of 80%, and an AUC of 90.00%. Logistic regression resulted in an accuracy of 80.36%, 97.62% precision, 80.39% recall, an F1 score of 88.17%, specificity of 80%, and an AUC of 80.20%. ANN revealed an accuracy of 94.64%, 98.00% precision, 96.08% recall, an F1 score of 97.03%, a specificity of 80%, and an AUC of 88.04%. This predictive analytics study applied three successful models, each of which yielded acceptable accuracy and validation metrics; however, further research on a larger dataset is advised.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Classification of Machine and Deep learning Techniques for Financial Fraud Detection of Healthcare Industry
    Shah, Harsh
    Pandya, Darsh
    Panchal, Krish
    More, Nilkamal Prashant
    2022 International Conference on Futuristic Technologies, INCOFT 2022, 2022,
  • [32] Machine Learning in Forecasting Motor Insurance Claims
    Poufinas, Thomas
    Gogas, Periklis
    Papadimitriou, Theophilos
    Zaganidis, Emmanouil
    RISKS, 2023, 11 (09)
  • [33] Simulation and Detection of Healthcare Fraud in German Inpatient Claims Data
    Schrupp, Bernhard
    Klede, Kai
    Raab, Rene
    Eskofier, Bjoern
    COMPUTATIONAL SCIENCE, ICCS 2024, PT IV, 2024, 14835 : 239 - 246
  • [34] Insurance fraud detection with unsupervised deep learning
    Gomes, Chamal
    Jin, Zhuo
    Yang, Hailiang
    JOURNAL OF RISK AND INSURANCE, 2021, 88 (03) : 591 - 624
  • [35] Auto Insurance Fraud Detection with Multimodal Learning
    Yang, Jiaxi
    Chen, Kui
    Ding, Kai
    Na, Chongning
    Wang, Meng
    DATA INTELLIGENCE, 2023, 5 (02) : 388 - 412
  • [36] Auto Insurance Fraud Detection with Multimodal Learning
    Jiaxi Yang
    Kui Chen
    Kai Ding
    Chongning Na
    Meng Wang
    Data Intelligence, 2023, 5 (02) : 388 - 412
  • [37] A Survey on Machine Learning Techniques for Insurance Fraud Prediction
    Patil, Komal S.
    Godbole, Anand
    HELIX, 2018, 8 (06): : 4358 - 4363
  • [38] Empirical Oversampling Threshold Strategy for Machine Learning Performance Optimisation in Insurance Fraud Detection
    Itri, Bouzgarne
    Mohamed, Youssfi
    Omar, Bouattane
    Mohamed, Qbadou
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 432 - 437
  • [39] Digital Verification: An Efficient Fraud Document Detection for Insurance Claims using IoT and ML Approaches
    Kumar, Swamp
    Kishan, R. N. Vamsi
    Devulapalli, Sal Surya Saketh
    Rajan, Lisa Alexander
    Kundu, Ayushi
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 242 - 247
  • [40] Detection of automobile insurance fraud with discrete choice models and misclassified claims
    Artís, M
    Ayuso, M
    Guillén, M
    JOURNAL OF RISK AND INSURANCE, 2002, 69 (03) : 325 - 340