Healthcare Provider Summary Data for Fraud Classification

被引:1
|
作者
Johnson, Justin M. [1 ]
Khoshgoftaar, Taghi M. [1 ]
机构
[1] Florida Atlantic Univ, Coll Engn & Comp Sci, Boca Raton, FL 33431 USA
关键词
Healthcare; Medicare; Medical Providers; Fraud Detection; Big Data; Machine Learning; Feature Engineering;
D O I
10.1109/IRI54793.2022.00060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fraud, waste, and abuse are spreading throughout the healthcare industry and costing patients and taxpayers billions of dollars. Fortunately, electronic medical records and publicly available data sources like the Centers for Medicare & Medicaid Services (CMS) have enabled data mining and machine learning techniques that can help automate the detection of healthcare fraud. In this study, we explore the application of healthcare provider summary data for the purpose of fraud detection. We leverage the latest CMS Part B Summary by Provider big data sets to curate two new labeled data sets for supervised learning. The two new data sets are compared to a popular baseline data set from related works using six runs of cross validation with two popular ensemble learners, multiple complementary performance metrics, and statistical tests. Classification results show that the proposed provider summary features are good indicators of healthcare fraud. A two-way analysis of variance test and 95% confidence intervals show that the new features yield significantly better performance on the fraud detection task when used to enrich existing data sets. Finally, feature contributions are measured with Shapley values to illustrate the top 20 features that contribute to fraud estimation.
引用
收藏
页码:236 / 242
页数:7
相关论文
共 50 条
  • [41] PROBLEMS OF FRAUD CLASSIFICATION IN LENDING
    Ermakova, Olga, V
    TOMSK STATE UNIVERSITY JOURNAL, 2016, (406): : 197 - 201
  • [42] Test-specific funnel plots for healthcare provider profiling leveraging individual- and summary-level information
    Wenbo Wu
    Jonathan P. Kuriakose
    Wenjing Weng
    Richard E. Burney
    Kevin He
    Health Services and Outcomes Research Methodology, 2023, 23 : 45 - 58
  • [43] Test-specific funnel plots for healthcare provider profiling leveraging individual- and summary-level information
    Wu, Wenbo
    Kuriakose, Jonathan P.
    Weng, Wenjing
    Burney, Richard E.
    He, Kevin
    HEALTH SERVICES AND OUTCOMES RESEARCH METHODOLOGY, 2023, 23 (01) : 45 - 58
  • [44] Analyzing Healthcare Provider Centric Networks through secondary use of health claims data
    Sauter, Simone Katja
    Neuhofer, Lisa Maria
    Endel, Gottfried
    Klimek, Peter
    Duftschmid, Georg
    2014 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI), 2014, : 522 - 525
  • [45] PRACTITIONER SUMMARY Improving Fraud Performance with a Different Perspective
    Chui, Lawrence C.
    Curtis, Mary B.
    Pike, Byron J.
    CURRENT ISSUES IN AUDITING, 2023, 17 (02): : P11 - P17
  • [46] Fraud Application Detection Using Summary Risk Score
    Jundhare, Shraddha
    Gajare, Padmaja
    Gadekar, Priyanka
    Aher, Archana
    Wankhade, Shalini
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2017), 2017, : 15 - 17
  • [47] Would Internet Search Engine Provider Fraud in AdWords auction?
    Zhang E
    Wang Yingluo
    ICOSCM 2007 - INTERNATIONAL CONFERENCE ON OPERATIONS AND SUPPLY CHAIN MANAGEMENT IN CHINA, 2007, 1
  • [48] IS JUSTICE COLLAR-BLIND - PUNISHING MEDICAID PROVIDER FRAUD
    TILLMAN, R
    PONTELL, HN
    CRIMINOLOGY, 1992, 30 (04) : 547 - 574
  • [49] Workgroup Summary: Healthcare System
    不详
    BREASTFEEDING MEDICINE, 2009, 4 : S79 - S80
  • [50] Design and development of big data-based model for detecting fraud in healthcare insurance industry
    Mary, A. Jenita
    Claret, S. P. Angelin
    SOFT COMPUTING, 2023, 27 (12) : 8357 - 8369