Heavy-tailed longitudinal data modeling using copulas

被引:57
|
作者
Sun, Jiafeng [1 ]
Frees, Edward W. [1 ]
Rosenberg, Marjorie A. [1 ]
机构
[1] Univ Wisconsin, Sch Business, Dept Actuarial Sci Risk Management & Insurance, Madison, WI 53706 USA
来源
INSURANCE MATHEMATICS & ECONOMICS | 2008年 / 42卷 / 02期
基金
美国国家科学基金会; 美国医疗保健研究与质量局;
关键词
healthcare costs; predictive modeling;
D O I
10.1016/j.insmatheco.2007.09.009
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we consider "heavy-tailed" data, that is, data where extreme values are likely to occur. Heavy-tailed data have been analyzed using flexible distributions such as the generalized beta of the second kind, the generalized gamma and the Burr. These distributions allow us to handle data with either positive or negative skewness, as well as heavy tails. Moreover, it has been shown that they can also accommodate cross-sectional regression models by allowing functions of explanatory variables to serve as distribution parameters. The objective of this paper is to extend this literature to accommodate longitudinal data, where one observes repeated observations of cross-sectional data. Specifically, we use copulas to model the dependencies over time, and heavy-tailed regression models to represent the marginal distributions. We also introduce model exploration techniques to help us with the initial choice of the copula and a goodness-of-fit test of elliptical copulas for model validation. In a longitudinal data context, we argue that elliptical copulas will be typically preferred to the Archimedean copulas. To illustrate our methods, Wisconsin nursing homes utilization data from 1995 to 2001 are analyzed. These data exhibit long tails and negative skewness and so help us to motivate the need for our new techniques. We find that time and the nursing home facility size as measured through the number of beds and square footage are important predictors of future utilization. Moreover, using our parametric model, we provide not only point predictions but also an entire predictive distribution. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [31] Support Vector Machine with Heavy-tailed Distribution Data
    Kim, Chansoo
    Choi, ByoungSecon
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SIGNAL PROCESSING AND ARTIFICIAL INTELLIGENCE, ASPAI' 2020, 2020, : 197 - 198
  • [32] On Empirical Risk Minimization with Dependent and Heavy-Tailed Data
    Roy, Abhishek
    Balasubramanian, Krishnakumar
    Erdogdu, Murat A.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [33] INFERENCE FOR EXTREMAL REGRESSION WITH DEPENDENT HEAVY-TAILED DATA
    Daouia, Abdelaati
    Stupfler, Gilles
    Usseglio-carleve, Antoine
    ANNALS OF STATISTICS, 2023, 51 (05): : 2040 - 2066
  • [34] Private least absolute deviations with heavy-tailed data
    Wang, Di
    Xu, Jinhui
    THEORETICAL COMPUTER SCIENCE, 2025, 1030
  • [35] Conditional mixture modelling for heavy-tailed and skewed data
    Dong, Aqi
    Melnykov, Volodymyr
    Wang, Yang
    Zhu, Xuwen
    STAT, 2023, 12 (01):
  • [36] Inference for heavy-tailed data: Applications in insurance and finance
    Peng, Liang
    Qi, Yongcheng
    Inference for Heavy-Tailed Data: Applications in Insurance and Finance, 2017, : 1 - 170
  • [37] Matrix Mittag–Leffler distributions and modeling heavy-tailed risks
    Hansjörg Albrecher
    Martin Bladt
    Mogens Bladt
    Extremes, 2020, 23 : 425 - 450
  • [38] Heavy-Tailed Density Estimation
    Tokdar, Surya T.
    Jiang, Sheng
    Cunningham, Erika L.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (545) : 163 - 175
  • [39] On aggregation for heavy-tailed classes
    Shahar Mendelson
    Probability Theory and Related Fields, 2017, 168 : 641 - 674
  • [40] Heavy-tailed distributions and their applications
    Su, C
    Tang, QH
    PROBABILITY, FINANCE AND INSURANCE, 2004, : 218 - 236