Nonparametric estimation of distributions with categorical and continuous data

被引:103
|
作者
Li, Q [1 ]
Racine, J
机构
[1] Texas A&M Univ, Dept Econ, College Stn, TX 77843 USA
[2] Syracuse Univ, Dept Econ, Syracuse, NY 13244 USA
关键词
discrete and continuous variables; density estimation; nonparametric smoothing; cross-validation; asymptotic normality;
D O I
10.1016/S0047-259X(02)00025-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we consider the problem of estimating an unknown joint distribution which is defined over mixed discrete and continuous variables. A nonparametric kernel approach is proposed with smoothing parameters obtained from the cross-validated minimization of the estimator's integrated squared error. We derive the rate of convergence of the cross-validated smoothing parameters to their 'benchmark' optimal values, and we also establish the asymptotic normality of the resulting nonparametric kernel density estimator. Monte Carlo simulations illustrate that the proposed estimator performs substantially better than the conventional nonparametric frequency estimator in a range of settings. The simulations also demonstrate that the proposed approach does not suffer from known limitations of the likelihood cross-validation method which breaks down with commonly used kernels when the continuous variables are drawn from fat-tailed distributions. An empirical application demonstrates that the proposed method can yield superior predictions relative to commonly used parametric models. (C) 2003 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:266 / 292
页数:27
相关论文
共 50 条
  • [21] Successive Nonparametric Estimation of Conditional Distributions
    J. Antonio Vargas-Guzmán
    Roussos Dimitrakopoulos
    Mathematical Geology, 2003, 35 : 39 - 52
  • [22] Nonparametric estimation of unrestricted distributions and their jumps
    Mynbaev, Kairat
    Martins-Filho, Carlos
    Henderson, Daniel J.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (02): : 638 - 662
  • [23] Successive nonparametric estimation of conditional distributions
    Vargas-Guzmán, JA
    Dimitrakopoulos, R
    MATHEMATICAL GEOLOGY, 2003, 35 (01): : 39 - 52
  • [24] Nonparametric estimation of dark matter distributions
    Wang, Xiao
    Woodroofe, Michael
    Pal, Jayanta
    Walker, Matthew
    Mateo, Mario
    STATISTICAL CHALLENGES IN MODERN ASTRONOMY IV, 2007, 371 : 268 - +
  • [25] Nonparametric covariance estimation in multivariate distributions
    Detlef Plachky
    Andrew L. Rukhin
    Metrika, 1999, 50 : 131 - 136
  • [26] Nonparametric covariance estimation in multivariate distributions
    Plachky, D
    Rukhin, AL
    METRIKA, 1999, 50 (02) : 131 - 136
  • [27] The crs Package: Nonparametric Regression Splines for Continuous and Categorical Predictors
    Nie, Zhenghua
    Racine, Jeffrey S.
    R JOURNAL, 2012, 4 (02): : 48 - 56
  • [28] Nonparametric Benchmark Dose Estimation with Continuous Dose-Response Data
    Lin, Lizhen
    Piegorsch, Walter W.
    Bhattacharya, Rabi
    SCANDINAVIAN JOURNAL OF STATISTICS, 2015, 42 (03) : 713 - 731
  • [29] Neural learning for distributions on categorical data
    Albizuri, FX
    Gonzalez, AI
    Graña, M
    d'Anjou, A
    NEUROCOMPUTING, 2000, 34 : 1 - 9
  • [30] Nonparametric estimation of marginal failure distributions from dually censored automotive data
    Krivtsov, V
    Frankstein, M
    ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2004 PROCEEDINGS, 2004, : 86 - 89