Bayesian Estimation of Discrete Multivariate Latent Structure Models With Structural Zeros

被引:33
|
作者
Manrique-Vallier, Daniel [1 ]
Reiter, Jerome P. [2 ]
机构
[1] Indiana Univ, Dept Stat, Bloomington, IN 47408 USA
[2] Duke Univ, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Dirichlet process; Latent class; Multinomial; Disclosure risk; Confidentiality; Contingency table; LEVEL MIXTURE-MODELS; MULTIPLE IMPUTATION; POPULATION-SIZE; CONTINGENCY-TABLES; CATEGORICAL-DATA; PRIORS; DEFINITION; MULTILEVEL; MICRODATA; RISK;
D O I
10.1080/10618600.2013.844700
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In multivariate categorical data, models based on conditional independence assumptions, such as latent class models, offer efficient estimation of complex dependencies. However, Bayesian versions of latent structure models for categorical data typically do not appropriately handle impossible combinations of variables, also known as structural zeros. Allowing nonzero probability for impossible combinations results in inaccurate estimates of joint and conditional probabilities, even for feasible combinations. We present an approach for estimating posterior distributions in Bayesian latent structure models with potentially many structural zeros. The basic idea is to treat the observed data as a truncated sample from an augmented dataset, thereby allowing us to exploit the conditional independence assumptions for computational expediency. As part of the approach, we develop an algorithm for collapsing a large set of structural zero combinations into a much smaller set of disjoint marginal conditions, which speeds up computation. We apply the approach to sample from a semiparametric version of the latent class model with structural zeros in the context of a key issue faced by national statistical agencies seeking to disseminate confidential data to the public: estimating the number of records in a sample that are unique in the population on a set of publicly available categorical variables. The latent class model offers remarkably accurate estimates of population uniqueness, even in the presence of a large number of structural zeros.
引用
收藏
页码:1061 / 1079
页数:19
相关论文
共 50 条
  • [1] Bayesian Estimation of Multivariate Latent Regression Models: Gauss Versus Laplace
    Culpepper, Steven Andrew
    Park, Trevor
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2017, 42 (05) : 591 - 616
  • [2] Bayesian Pyramids: identifiable multilayer discrete latent structure models for discrete data
    Gu, Yuqi
    Dunson, David B.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (02) : 399 - 426
  • [3] Bayesian Regularized Multivariate Generalized Latent Variable Models
    Feng, Xiang-Nan
    Wu, Hao-Tian
    Song, Xin-Yuan
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2017, 24 (03) : 341 - 358
  • [4] Bayesian latent variable models for mixed discrete outcomes
    Dunson, DB
    Herring, AH
    BIOSTATISTICS, 2005, 6 (01) : 11 - 25
  • [5] Bayesian Analysis of Multivariate Latent Curve Models With Nonlinear Longitudinal Latent Effects
    Song, Xin-Yuan
    Lee, Sik-Yum
    Hser, Yih-Ing
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2009, 16 (02) : 245 - 266
  • [6] ESTIMATING MULTIVARIATE LATENT-STRUCTURE MODELS
    Bonhomme, Stephane
    Jochmans, Koen
    Robin, Jean-Marc
    ANNALS OF STATISTICS, 2016, 44 (02): : 540 - 563
  • [7] Bayesian Estimation of Dynamic Discrete Choice Models
    Imai, Susumu
    Jain, Neelam
    Ching, Andrew
    ECONOMETRICA, 2009, 77 (06) : 1865 - 1899
  • [8] DISCRETE APPROXIMATION OF MULTIVARIATE DENSITIES WITH APPLICATION TO BAYESIAN-ESTIMATION
    KATZ, D
    DARGENIO, DZ
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1984, 2 (01) : 27 - 36
  • [9] Efficient Bayesian estimation of multivariate state space models
    Strickland, Chris M.
    Turner, Ian. W.
    Denham, Robert
    Mengersen, Kerrie L.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (12) : 4116 - 4125
  • [10] Bayesian Semiparametric Structural Equation Models with Latent Variables
    Mingan Yang
    David B. Dunson
    Psychometrika, 2010, 75 : 675 - 693