Fraud Detection in Healthcare Insurance Claims Using Machine Learning

被引:7
|
作者
Nabrawi, Eman [1 ,2 ]
Alanazi, Abdullah [1 ,2 ]
机构
[1] King Saud Ibn Abdulaziz Univ Hlth Sci, Hlth Informat Dept, POB 3660, Riyadh 11481, Saudi Arabia
[2] King Abdullah Int Med Res Ctr, Riyadh 14611, Saudi Arabia
关键词
fraud; insurance claims; artificial neural networks (ANN); logistic regression (LR); random forest (RF); Saudi Arabia;
D O I
10.3390/risks11090160
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Healthcare fraud is intentionally submitting false claims or producing misinterpretation of facts to obtain entitlement payments. Thus, it wastes healthcare financial resources and increases healthcare costs. Subsequently, fraud poses a substantial financial challenge. Therefore, supervised machine and deep learning analytics such as random forest, logistic regression, and artificial neural networks are successfully used to detect healthcare insurance fraud. This study aims to develop a health model that automatically detects fraud from health insurance claims in Saudi Arabia. The model indicates the greatest contributing factor to fraud with optimal accuracy. The labeled imbalanced dataset used three supervised deep and machine learning methods. The dataset was obtained from three healthcare providers in Saudi Arabia. The applied models were random forest, logistic regression, and artificial neural networks. The SMOT technique was used to balance the dataset. Boruta object feature selection was applied to exclude insignificant features. Validation metrics were accuracy, precision, recall, specificity, F1 score, and area under the curve (AUC). Random forest classifiers indicated policy type, education, and age as the most significant features with an accuracy of 98.21%, 98.08% precision, 100% recall, an F1 score of 99.03%, specificity of 80%, and an AUC of 90.00%. Logistic regression resulted in an accuracy of 80.36%, 97.62% precision, 80.39% recall, an F1 score of 88.17%, specificity of 80%, and an AUC of 80.20%. ANN revealed an accuracy of 94.64%, 98.00% precision, 96.08% recall, an F1 score of 97.03%, a specificity of 80%, and an AUC of 88.04%. This predictive analytics study applied three successful models, each of which yielded acceptable accuracy and validation metrics; however, further research on a larger dataset is advised.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Fraud Claims Detection in Insurance Using Machine Learning
    Kalra, Hritik
    Singh, Ranvir
    Kumar, T. Senthil
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 327 - 331
  • [2] Fraud detection in healthcare claims using machine learning: A systematic review
    du Preez, Anli
    Bhattacharya, Sanmitra
    Beling, Peter
    Bowen, Edward
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2025, 160
  • [3] DETECTING INSURANCE CLAIMS FRAUD USING MACHINE LEARNING TECHNIQUES
    Roy, Riya
    George, Thomas K.
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [4] Healthcare Fraud Detection using Machine Learning
    Prova, Nuzhat Noor Islam
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1119 - 1123
  • [5] Machine Learning-Based Fraud Detection System for Insurance Claims in IoT Environment
    Sharan, Bediga
    Hassan, Mohammad
    Vani, V. Divya
    Raj, Vijilius Helena
    Nijhawan, Ginni
    Pawar, Priyanka Prabhakar
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [6] Prediction of Insurance Fraud Detection using Machine Learning Algorithms
    Rukhsar, Laiqa
    Bangyal, Waqas Haider
    Nisar, Kashif
    Nisar, Sana
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2022, 41 (01) : 33 - 40
  • [7] Fraud risk assessment in car insurance using claims graph features in machine learning
    Vorobyev, Ivan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [8] MACHINE LEARNING ALGORITHMS FOR AUTO INSURANCE FRAUD DETECTION
    Badal Valero, Elena
    Sanjuan Diaz, Andres
    Segura Gisbert, Jorge
    ANALES DEL INSTITUTO DE ACTUARIOS ESPANOLES, 2020, (26): : 23 - 46
  • [9] An interactive machine-learning-based electronic fraud and abuse detection system in healthcare insurance
    Kose, Ilker
    Gokturk, Mehmet
    Kilic, Kemal
    APPLIED SOFT COMPUTING, 2015, 36 : 283 - 299
  • [10] Healthcare insurance fraud detection using data mining
    Hamid, Zain
    Khalique, Fatima
    Mahmood, Saba
    Daud, Ali
    Bukhari, Amal
    Alshemaimri, Bader
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)