Bayesian Mixture Modeling for Multivariate Conditional Distributions

被引:0
|
作者
DeYoreo, Maria [1 ]
Reiter, Jerome P. [2 ]
机构
[1] RAND Corp, 1776 Main St, Santa Monica, CA 90401 USA
[2] Duke Univ, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Dirichlet process; Fusion; Imputation; Latent; Missing; Mutual information; MUTUAL INFORMATION; DATA FUSION; DIRICHLET; INFERENCE;
D O I
10.1007/s42519-020-00109-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present a Bayesian mixture model for estimating the joint distribution of mixed ordinal, nominal, and continuous data conditional on a set of fixed variables. The modeling strategy is motivated by applied contexts in marketing and the social sciences, in particular data fusion and the analysis of stratified or quota samples. The model uses multivariate normal and categorical mixture kernels for the random variables. It induces dependence between the random and fixed variables through the means of the multivariate normal mixture kernels and via a truncated local Dirichlet process. The latter encourages observations with similar values of the fixed variables to share mixture components. We illustrate use of the model for missing data imputation, in particular data fusion of two surveys, and for the analysis of stratified or quota samples. The data fusion example suggests that the model can estimate underlying relationships in the data and the distributions of the missing values more accurately than several other approaches, including a mixture model applied to the random and fixed variables jointly. We also use the model to analyze consumers' reading behaviors from a quota sample, i.e., a sample where the empirical distribution of some variables is fixed by design and so should not be modeled as random, conducted by the book publisher HarperCollins.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Bayesian prediction under a class of multivariate distributions
    Essam K. AL-Hussaini
    Saieed F. Ateya
    Arabian Journal of Mathematics, 2012, 1 (3) : 283 - 293
  • [42] Bayesian inference for multivariate extreme value distributions
    Dombry, Clement
    Engelke, Sebastian
    Oesting, Marco
    ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (02): : 4813 - 4844
  • [43] Bayesian prediction under a class of multivariate distributions
    AL-Hussaini, Essam K.
    Ateya, Saieed F.
    ARABIAN JOURNAL OF MATHEMATICS, 2012, 1 (03) : 283 - 293
  • [44] ON A CHARACTERIZATION OF MULTIVARIATE DISTRIBUTION BY A SET OF ITS CONDITIONAL DISTRIBUTIONS
    PATIL, GP
    BULLETIN OF THE INTERNATIONAL STATISTICAL INSTITUTE, 1965, 41 (02): : 768 - 769
  • [45] Bayesian modal regression based on mixture distributions
    Liu, Qingyang
    Huang, Xianzheng
    Bai, Ray
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 199
  • [46] MIXED BAYESIAN NETWORKS - A MIXTURE OF GAUSSIAN DISTRIBUTIONS
    CHEVROLAT, JP
    RUTIGLIANO, F
    GOLMARD, JL
    METHODS OF INFORMATION IN MEDICINE, 1994, 33 (05) : 535 - 542
  • [47] On Bayesian credibility mean for finite mixture distributions
    Jahanbani, Ehsan
    Najafabadi, Amir T. Payandeh T.
    Masoumifard, Khaled
    ANNALS OF ACTUARIAL SCIENCE, 2024, 18 (01) : 5 - 29
  • [48] Multivariate exponential power distributions as mixtures of normal distributions with bayesian applications
    Gomez-Sanchez-Manzano, E.
    Gomez-Villegas, M. A.
    Marin, J. M.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2008, 37 (06) : 972 - 985
  • [49] Flexible Multivariate Mixture Models: A Comprehensive Approach for Modeling Mixtures of Non-Identical Distributions
    Pal, Samyajoy
    Heumann, Christian
    INTERNATIONAL STATISTICAL REVIEW, 2024,
  • [50] Robust finite mixture modeling of multivariate unrestricted skew-normal generalized hyperbolic distributions
    Mohsen Maleki
    Darren Wraith
    Reinaldo B. Arellano-Valle
    Statistics and Computing, 2019, 29 : 415 - 428