Predicting Days in Hospital Using Health Insurance Claims

被引:21
|
作者
Xie, Yang [1 ]
Schreier, Guenter [2 ]
Chang, David C. W. [1 ]
Neubauer, Sandra [2 ]
Liu, Ying [1 ]
Redmond, Stephen J. [1 ]
Lovell, Nigel H. [1 ]
机构
[1] Univ New S Wales, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[2] AIT Austrian Inst Tech GmbH, A-8020 Graz, Austria
基金
澳大利亚研究理事会;
关键词
Australia; big data; health care; health insurance claims; hospitalizations; predictive modeling; CHARLSON COMORBIDITY INDEX; CARE COSTS; DISEASE MANAGEMENT; RISK ADJUSTMENT;
D O I
10.1109/JBHI.2015.2402692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health-care administrators worldwide are striving to lower the cost of care while improving the quality of care given. Hospitalization is the largest component of health expenditure. Therefore, earlier identification of those at higher risk of being hospitalized would help health-care administrators and health insurers to develop better plans and strategies. In this paper, a method was developed, using large-scale health insurance claims data, to predict the number of hospitalization days in a population. We utilized a regression decision tree algorithm, along with insurance claim data from 242 075 individuals over three years, to provide predictions of number of days in hospital in the third year, based on hospital admissions and procedure claims data. The proposed method performs well in the general population as well as in subpopulations. Results indicate that the proposed model significantly improves predictions over two established baseline methods (predicting a constant number of days for each customer and using the number of days in hospital of the previous year as the forecast for the following year). A reasonable predictive accuracy (AUC = 0.843) was achieved for the whole population. Analysis of two subpopulations-namely elderly persons aged 63 years or older in 2011 and patients hospitalized for at least one day in the previous year-revealed that the medical information (e.g., diagnosis codes) contributed more to predictions for these two subpopulations, in comparison to the population as a whole.
引用
收藏
页码:1224 / 1233
页数:10
相关论文
共 50 条
  • [21] Record linkage of health care insurance claims
    Victor, TW
    Mera, RM
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, 8 (03) : 281 - 288
  • [23] TIPS ON SUBMITTING HEALTH-INSURANCE CLAIMS
    不详
    NORTH CAROLINA MEDICAL JOURNAL, 1981, 42 (05) : 347 - 348
  • [24] Pattern recognition in health insurance claims databases
    Walker, AM
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2001, 10 (05) : 393 - 397
  • [25] Embedding for Anomaly Detection on Health Insurance Claims
    Lu, Jiaqi
    Fung, Benjamin C. M.
    Cheung, William K.
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 459 - 468
  • [26] Institutionalisation of Health Insurance Digital Claims Platform
    Renner-Micah, Anthony
    Effah, John
    Boateng, Richard
    AMCIS 2020 PROCEEDINGS, 2020,
  • [27] PSYCHIATRIST AND HEALTH-INSURANCE CLAIMS REVIEW
    FOWLER, DR
    JOURNAL OF CLINICAL PSYCHIATRY, 1978, 39 (06) : 519 - 522
  • [28] Predicting Hospital Readmission in Medicaid Patients With Diabetes Using Administrative and Claims Data
    Yun, Jaehyeon
    Filardo, Giovanni
    Ahuja, Vishal
    Bowen, Michael E.
    Heitjan, Daniel F.
    AMERICAN JOURNAL OF MANAGED CARE, 2023, 29 (08): : E229 - +
  • [29] Framework for the Identification of Fraudulent Health Insurance Claims using Association Rule Mining
    Kareem, Saba
    Ahmad, Rohiza Binti
    Sarlan, Aliza Binit
    2017 IEEE CONFERENCE ON BIG DATA AND ANALYTICS (ICBDA), 2017, : 99 - 104
  • [30] Provider profiling and labeling of fraudulent health insurance claims using Weighted MultiTree
    Settipalli, Lavanya
    Gangadharan, G. R.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (4) : 3487 - 3508