Predicting Days in Hospital Using Health Insurance Claims

被引:21
|
作者
Xie, Yang [1 ]
Schreier, Guenter [2 ]
Chang, David C. W. [1 ]
Neubauer, Sandra [2 ]
Liu, Ying [1 ]
Redmond, Stephen J. [1 ]
Lovell, Nigel H. [1 ]
机构
[1] Univ New S Wales, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[2] AIT Austrian Inst Tech GmbH, A-8020 Graz, Austria
基金
澳大利亚研究理事会;
关键词
Australia; big data; health care; health insurance claims; hospitalizations; predictive modeling; CHARLSON COMORBIDITY INDEX; CARE COSTS; DISEASE MANAGEMENT; RISK ADJUSTMENT;
D O I
10.1109/JBHI.2015.2402692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health-care administrators worldwide are striving to lower the cost of care while improving the quality of care given. Hospitalization is the largest component of health expenditure. Therefore, earlier identification of those at higher risk of being hospitalized would help health-care administrators and health insurers to develop better plans and strategies. In this paper, a method was developed, using large-scale health insurance claims data, to predict the number of hospitalization days in a population. We utilized a regression decision tree algorithm, along with insurance claim data from 242 075 individuals over three years, to provide predictions of number of days in hospital in the third year, based on hospital admissions and procedure claims data. The proposed method performs well in the general population as well as in subpopulations. Results indicate that the proposed model significantly improves predictions over two established baseline methods (predicting a constant number of days for each customer and using the number of days in hospital of the previous year as the forecast for the following year). A reasonable predictive accuracy (AUC = 0.843) was achieved for the whole population. Analysis of two subpopulations-namely elderly persons aged 63 years or older in 2011 and patients hospitalized for at least one day in the previous year-revealed that the medical information (e.g., diagnosis codes) contributed more to predictions for these two subpopulations, in comparison to the population as a whole.
引用
收藏
页码:1224 / 1233
页数:10
相关论文
共 50 条
  • [31] Provider profiling and labeling of fraudulent health insurance claims using Weighted MultiTree
    Lavanya Settipalli
    G. R. Gangadharan
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 3487 - 3508
  • [32] Using health insurance claims data to analyze substance abuse charges and utilization
    Garnick, DW
    Horgan, CM
    Hendricks, AM
    Comstock, C
    MEDICAL CARE RESEARCH AND REVIEW, 1996, 53 (03) : 350 - 368
  • [33] Algorithms for ascertaining keratinocyte carcinomas using health insurance claims and prescription records
    Zhang, T.
    Lee, T. K.
    Lui, H.
    Kunimoto, B.
    Han, C.
    Zhou, Y.
    Kalia, S.
    JOURNAL OF THE EUROPEAN ACADEMY OF DERMATOLOGY AND VENEREOLOGY, 2019, 33 (08) : E275 - E276
  • [34] Identification of Patients Receiving Peritoneal Dialysis Using Health Insurance Claims Data
    Berger, Ariel
    Edelsberg, John
    Inglese, Gary
    Bhattacharyya, Samir
    Oster, Gerry
    CLINICAL THERAPEUTICS, 2009, 31 (06) : 1321 - 1334
  • [35] Using insurance claims and demographic data for surveillance of children's oral health
    Heller, KE
    Eklund, SA
    Burt, BA
    Briskie, DM
    Lawrence, LM
    JOURNAL OF PUBLIC HEALTH DENTISTRY, 2004, 64 (01) : 5 - 13
  • [36] Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    Richter, Aaron
    Herland, Matthew
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 784 - 790
  • [37] Predicting demand for long-term care using Japanese healthcare insurance claims data
    Sato, Jumpei
    Mitsutake, Naohiro
    Kitsuregawa, Masaru
    Ishikawa, Tomoki
    Goda, Kazuo
    ENVIRONMENTAL HEALTH AND PREVENTIVE MEDICINE, 2022, 27
  • [38] Predicting Motor Insurance Claims Using Telematics Data-XGBoost versus Logistic Regression
    Pesantez-Narvaez, Jessica
    Guillen, Montserrat
    Alcaniz, Manuela
    RISKS, 2019, 7 (02)
  • [39] Cholesterol and breast cancer risk: a cohort study using health insurance claims and health checkup databases
    Narii, Nobuhiro
    Zha, Ling
    Komatsu, Masayo
    Kitamura, Tetsuhisa
    Sobue, Tomotaka
    Ogawa, Toshio
    BREAST CANCER RESEARCH AND TREATMENT, 2023, 199 (02) : 315 - 322
  • [40] Health insurance and the demand for medical care: Instrumental variable estimates using health insurer claims data
    Dunn, Abe
    JOURNAL OF HEALTH ECONOMICS, 2016, 48 : 74 - 88