Bayesian Mixture Modeling for Multivariate Conditional Distributions

被引:0
|
作者
DeYoreo, Maria [1 ]
Reiter, Jerome P. [2 ]
机构
[1] RAND Corp, 1776 Main St, Santa Monica, CA 90401 USA
[2] Duke Univ, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Dirichlet process; Fusion; Imputation; Latent; Missing; Mutual information; MUTUAL INFORMATION; DATA FUSION; DIRICHLET; INFERENCE;
D O I
10.1007/s42519-020-00109-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present a Bayesian mixture model for estimating the joint distribution of mixed ordinal, nominal, and continuous data conditional on a set of fixed variables. The modeling strategy is motivated by applied contexts in marketing and the social sciences, in particular data fusion and the analysis of stratified or quota samples. The model uses multivariate normal and categorical mixture kernels for the random variables. It induces dependence between the random and fixed variables through the means of the multivariate normal mixture kernels and via a truncated local Dirichlet process. The latter encourages observations with similar values of the fixed variables to share mixture components. We illustrate use of the model for missing data imputation, in particular data fusion of two surveys, and for the analysis of stratified or quota samples. The data fusion example suggests that the model can estimate underlying relationships in the data and the distributions of the missing values more accurately than several other approaches, including a mixture model applied to the random and fixed variables jointly. We also use the model to analyze consumers' reading behaviors from a quota sample, i.e., a sample where the empirical distribution of some variables is fixed by design and so should not be modeled as random, conducted by the book publisher HarperCollins.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Bayesian Mixture Modeling for Multivariate Conditional Distributions
    Maria DeYoreo
    Jerome P. Reiter
    Journal of Statistical Theory and Practice, 2020, 14
  • [2] Bayesian modeling of joint and conditional distributions
    Norets, Andriy
    Pelenis, Justinas
    JOURNAL OF ECONOMETRICS, 2012, 168 (02) : 332 - 346
  • [3] A multivariate mixture of Weibull distributions in reliability modeling
    Patra, K
    Dey, DK
    STATISTICS & PROBABILITY LETTERS, 1999, 45 (03) : 225 - 235
  • [4] Modeling conditional distributions of continuous variables in Bayesian networks
    Cobb, BR
    Rumí, R
    Salmerón, A
    ADVANCES IN INTELLIGENT DATA ANALYSIS VI, PROCEEDINGS, 2005, 3646 : 36 - 45
  • [5] Bayesian Inference of Genetic Parameters Based on Conditional Decompositions of Multivariate Normal Distributions
    Hallander, Jon
    Waldmann, Patrik
    Wang, Chunkao
    Sillanpaa, Mikko J.
    GENETICS, 2010, 185 (02) : 645 - 654
  • [6] Robust mixture modeling using multivariate skew t distributions
    Tsung-I Lin
    Statistics and Computing, 2010, 20 : 343 - 356
  • [7] Robust mixture modeling using multivariate skew t distributions
    Lin, Tsung-I
    STATISTICS AND COMPUTING, 2010, 20 (03) : 343 - 356
  • [8] Bayesian Spectral Modeling for Multivariate Spatial Distributions of Elemental Concentrations in Soil
    Terres, Maria A.
    Fuentes, Montserrat
    Hesterberg, Dean
    Polizzotto, Matthew
    BAYESIAN ANALYSIS, 2018, 13 (01): : 1 - 28
  • [9] Multivariate mixture modeling using skew-normal independent distributions
    Barbosa Cabral, Celso Romulo
    Lachos, Victor Hugo
    Prates, Marcos O.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) : 126 - 142
  • [10] MULTIVARIATE DISTRIBUTIONS WHICH HAVE BETA CONDITIONAL DISTRIBUTIONS
    JAMES, IR
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (351) : 681 - 684