A structured Dirichlet mixture model for compositional data: inferential and applicative issues

被引:10
|
作者
Migliorati, Sonia [1 ,2 ]
Ongaro, Andrea [1 ,2 ]
Monti, Gianna S. [1 ,2 ]
机构
[1] Univ Milano Bicocca, Dept Econ Management & Stat, Milan, Italy
[2] NeuroMi Milan Ctr Neurosci, Milan, Italy
关键词
Simplex distribution; Dirichlet mixture; Identifiability; Multimodality; EM type algorithms; GENERALIZED LIOUVILLE DISTRIBUTIONS; MAXIMUM-LIKELIHOOD; EM ALGORITHM;
D O I
10.1007/s11222-016-9665-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The flexible Dirichlet (FD) distribution (Ongaro and Migliorati in J. Multvar. Anal. 114: 412-426, 2013) makes it possible to preserve many theoretical properties of the Dirichlet one, without inheriting its lack of flexibility in modeling the various independence concepts appropriate for compositional data, i.e. data representing vectors of proportions. In this paper we tackle the potential of the FD from an inferential and applicative viewpoint. In this regard, the key feature appears to be the special structure defining its Dirichlet mixture representation. This structure determines a simple and clearly interpretable differentiation among mixture components which can capture the main features of a large variety of data sets. Furthermore, it allows a substantially greater flexibility than the Dirichlet, including both unimodality and a varying number of modes. Very importantly, this increased flexibility is obtained without sharing many of the inferential difficulties typical of general mixtures. Indeed, the FD displays the identifiability and likelihood behavior proper to common (non-mixture) models. Moreover, thanks to a novel non random initialization based on the special FD mixture structure, an efficient and sound estimation procedure can be devised which suitably combines EM-types algorithms. Reliable complete-data likelihood-based estimators for standard errors can be provided as well.
引用
收藏
页码:963 / 983
页数:21
相关论文
共 50 条
  • [1] A structured Dirichlet mixture model for compositional data: inferential and applicative issues
    Sonia Migliorati
    Andrea Ongaro
    Gianna S. Monti
    Statistics and Computing, 2017, 27 : 963 - 983
  • [2] Clustering compositional data using Dirichlet mixture model
    Pal, Samyajoy
    Heumann, Christian
    PLOS ONE, 2022, 17 (05):
  • [3] Posterior convergence rate of a class of Dirichlet process mixture model for compositional data
    Barrientos, Andres F.
    Jara, Alejandro
    Wehrhahn, Claudia
    STATISTICS & PROBABILITY LETTERS, 2017, 120 : 45 - 51
  • [4] A Dirichlet Regression Model for Compositional Data with Zeros
    Tsagris M.
    Stewart C.
    Lobachevskii Journal of Mathematics, 2018, 39 (3) : 398 - 412
  • [5] Compositional Adjustment of Dirichlet Mixture Priors
    Ye, Xugang
    Yu, Yi-Kuo
    Altschul, Stephen F.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2010, 17 (12) : 1607 - 1620
  • [6] A Dirichlet Process Mixture Model for Spherical Data
    Straub, Julian
    Chang, Jason
    Freifeld, Oren
    Fisher, John W., III
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 930 - 938
  • [7] THE APPLICATIVE DATA MODEL
    HELD, JP
    CARLIS, JV
    INFORMATION SCIENCES, 1989, 49 (1-3) : 249 - 283
  • [8] REDUCING TYPES IN APPLICATIVE LANGUAGES WITH STRUCTURED DATA
    ASTESIANO, E
    COSTA, G
    LECTURE NOTES IN COMPUTER SCIENCE, 1981, 107 : 210 - 217
  • [9] A least squares algorithm for a mixture model for compositional data
    Mooijaart, A
    van der Heijden, PG
    van der Ark, LA
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1999, 30 (04) : 359 - 379
  • [10] Least squares algorithm for a mixture model for compositional data
    Mooijaart, Ab
    van der Heijden, Peter G.M.
    der Ark, L.Andries van
    Computational Statistics and Data Analysis, 1999, 30 (04): : 359 - 379