Approaches for identifying U.S. medicare fraud in provider claims data

被引:0
|
作者
Matthew Herland
Richard A. Bauder
Taghi M. Khoshgoftaar
机构
[1] Florida Atlantic University,
来源
关键词
Medicare; Big data; Machine learning; Fraud detection;
D O I
暂无
中图分类号
学科分类号
摘要
Quality and affordable healthcare is an important aspect in people’s lives, particularly as they age. The rising elderly population in the United States (U.S.), with increasing number of chronic diseases, implies continuing healthcare later in life and the need for programs, such as U.S. Medicare, to help with associated medical expenses. Unfortunately, due to healthcare fraud, these programs are being adversely affected draining resources and reducing quality and accessibility of necessary healthcare services. The detection of fraud is critical in being able to identify and, subsequently, stop these perpetrators. The application of machine learning methods and data mining strategies can be leveraged to improve current fraud detection processes and reduce the resources needed to find and investigate possible fraudulent activities. In this paper, we employ an approach to predict a physician’s expected specialty based on the type and number of procedures performed. From this approach, we generate a baseline model, comparing Logistic Regression and Multinomial Naive Bayes, in order to test and assess several new approaches to improve the detection of U.S. Medicare Part B provider fraud. Our results indicate that our proposed improvement strategies (specialty grouping, class removal, and class isolation), applied to different medical specialties, have mixed results over the selected Logistic Regression baseline model’s fraud detection performance. Through our work, we demonstrate that improvements to current detection methods can be effective in identifying potential fraud.
引用
收藏
页码:2 / 19
页数:17
相关论文
共 50 条
  • [31] Curtailing Identity Fraud: U.S. Passports Get a Digital Facelift
    Nelson, Lee J.
    2002, Cygnus Business Media Inc (17)
  • [32] Identifying Chronic Conditions in Medicare Claims Data: Evaluating the Chronic Condition Data Warehouse Algorithm
    Gorina, Yelena
    Kramarow, Ellen A.
    HEALTH SERVICES RESEARCH, 2011, 46 (05) : 1610 - 1627
  • [33] Inflation and U.S. Public Libraries: Three Approaches for Measuring Inflation in Historical Data
    Baxa, Amanda
    Widdersheim, Michael M.
    JOURNAL OF LIBRARY ADMINISTRATION, 2023, 63 (03) : 339 - 357
  • [34] Information theoretic approaches to income density estimation with an application to the U.S. income data
    Sung Y. Park
    Anil K. Bera
    The Journal of Economic Inequality, 2018, 16 : 461 - 486
  • [35] Identifying Cancer-Directed Surgeries in medicare Claims: A Validation Study Using SEER-Medicare Data
    Lavery, Jessica A.
    Lipitz-Snyderman, Allison
    Li, Diane G.
    Bach, Peter B.
    Panageas, Katherine S.
    JCO CLINICAL CANCER INFORMATICS, 2019, 3 : 1 - 24
  • [36] Geographic variation in pharmacotherapy decisions for U.S. medicare enrollees with diabetes
    Sargen, Michael R.
    Hoffstad, Ole J.
    Wiebe, Douglas J.
    Margolis, David J.
    JOURNAL OF DIABETES AND ITS COMPLICATIONS, 2012, 26 (04) : 301 - 307
  • [37] Prescription Drug Coverage and Medicare Spending among U.S. Elderly
    Baoping Shang
    Dana Goldman
    The Geneva Papers on Risk and Insurance - Issues and Practice, 2010, 35 : 539 - 567
  • [38] Prescription Drug Coverage and Medicare Spending among U.S. Elderly
    Shang, Baoping
    Goldman, Dana
    GENEVA PAPERS ON RISK AND INSURANCE-ISSUES AND PRACTICE, 2010, 35 (04): : 539 - 567
  • [39] Transforming homeland security: U.S. and European approaches
    Alcaro, Riccardo
    INTERNATIONAL SPECTATOR, 2007, 42 (02): : 304 - 305
  • [40] Identifying types of nursing facility stays using medicare claims data: An algorithm and validation
    Yun H.
    Kilgore M.L.
    Curtis J.R.
    Delzell E.
    Gary L.C.
    Saag K.G.
    Morrisey M.A.
    Becker D.
    Matthews R.
    Smith W.
    Locher J.L.
    Health Services and Outcomes Research Methodology, 2010, 10 (1-2) : 100 - 110