Healthcare insurance fraud detection using data mining

被引:3
|
作者
Hamid, Zain [1 ]
Khalique, Fatima [1 ]
Mahmood, Saba [1 ]
Daud, Ali [2 ]
Bukhari, Amal [3 ]
Alshemaimri, Bader [4 ]
机构
[1] Bahria Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Rabdan Acad, Fac Resilience, Abu Dhabi, U Arab Emirates
[3] Univ Jeddah, Coll Comp Sci & Engn, Dept Informat Syst & Technol, Jeddah, Saudi Arabia
[4] King Saud Univ, Coll Comp & Informat Sci, Software Engn Dept, Riyadh, Saudi Arabia
关键词
Unsupervised learning; Healthcare insurance; Healthcare insurance frauds; Association rules mining techniques; INTRUSION DETECTION; ANOMALY DETECTION; OCSVM;
D O I
10.1186/s12911-024-02512-4
中图分类号
R-058 [];
学科分类号
摘要
Background Healthcare programs and insurance initiatives play a crucial role in ensuring that people have access to medical care. There are many benefits of healthcare insurance programs but fraud in healthcare continues to be a significant challenge in the insurance industry. Healthcare insurance fraud detection faces challenges from evolving and sophisticated fraud schemes that adapt to detection methods. Analyzing extensive healthcare data is hindered by complexity, data quality issues, and the need for real-time detection, while privacy concerns and false positives pose additional hurdles. The lack of standardization in coding and limited resources further complicate efforts to address fraudulent activities effectively.Methodolgy In this study, a fraud detection methodology is presented that utilizes association rule mining augmented with unsupervised learning techniques to detect healthcare insurance fraud. Dataset from the Centres for Medicare and Medicaid Services (CMS) 2008-2010 DE-SynPUF is used for analysis. The proposed methodology works in two stages. First, association rule mining is used to extract frequent rules from the transactions based on patient, service and service provider features. Second, the extracted rules are passed to unsupervised classifiers, such as IF, CBLOF, ECOD, and OCSVM, to identify fraudulent activity.Results Descriptive analysis shows patterns and trends in the data revealing interesting relationship among diagnosis codes, procedure codes and the physicians. The baseline anomaly detection algorithms generated results in 902.24 seconds. Another experiment retrieved frequent rules using association rule mining with apriori algorithm combined with unsupervised techniques in 868.18 seconds. The silhouette scoring method calculated the efficacy of four different anomaly detection techniques showing CBLOF with highest score of 0.114 followed by isolation forest with the score of 0.103. The ECOD and OCSVM techniques have lower scores of 0.063 and 0.060, respectively.Conclusion The proposed methodology enhances healthcare insurance fraud detection by using association rule mining for pattern discovery and unsupervised classifiers for effective anomaly detection.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Fraud Detection in Health Insurance using Data Mining Techniques
    Rawte, Vipula
    Anuradha, G.
    2015 International Conference on Communication, Information & Computing Technology (ICCICT), 2015,
  • [2] A fraud detection approach with data mining in health insurance
    Kirlidog, Melih
    Asuk, Cuneyt
    WORLD CONFERENCE ON BUSINESS, ECONOMICS AND MANAGEMENT (BEM-2012), 2012, 62 : 989 - 994
  • [3] Automobile insurance fraud detection using data mining: A systematic literature review
    Schrijver, Gilian
    Sarmah, Dipti K.
    El-hajj, Mohammed
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 21
  • [4] Fraud Detection and Frequent Pattern Matching in Insurance claims using Data Mining Techniques
    Verma, Aayushi
    Taneja, Anu
    Arora, Anuja
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 84 - 90
  • [5] Fraud Detection in Healthcare Insurance Claims Using Machine Learning
    Nabrawi, Eman
    Alanazi, Abdullah
    RISKS, 2023, 11 (09)
  • [6] Using a Data Mining Approach to Detect Automobile Insurance Fraud
    Salmi, Mabrouka
    Atif, Dalia
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2021), 2022, 417 : 55 - 66
  • [7] Use of Data Mining Techniques for Data Balancing and Fraud Detection in Automobile Insurance Claims
    Padhi, Slokashree
    Panigrahi, Suvasini
    INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 221 - 230
  • [8] Modeling Insurance Fraud Detection Using Imbalanced Data Classification
    Hassan, Amira Kamil Ibrahim
    Abraham, Ajith
    ADVANCES IN NATURE AND BIOLOGICALLY INSPIRED COMPUTING, 2016, 419 : 117 - 127
  • [9] FRAUD DETECTION USING OUTLIER PREDICTOR IN HEALTH INSURANCE DATA
    Anbarasi, M. S.
    Dhivya, S.
    2017 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2017,
  • [10] Data Mining Techniques in Fraud Detection
    Bhowmik, Rekha
    JOURNAL OF DIGITAL FORENSICS SECURITY AND LAW, 2008, 3 (02) : 35 - 54