Mixture of experts models for multilevel data: Modeling framework and approximation theory

被引:0
|
作者
Fung, Tsz Chai [1 ]
Tseung, Spark C. [2 ]
机构
[1] Georgia State Univ, Maurice R Greenberg Sch Risk Sci, 35 Broad St NW, Atlanta, GA 30303 USA
[2] Univ Toronto, Dept Stat Sci, Ontario Power Bldg, 700 Univ Ave, 9th Floor, Toronto, ON M5G 1Z5, Canada
关键词
Artificial neural network; Crossed and nested random effects; Denseness; Mixed effects models; Universal approximation theorem; MIXED-EFFECTS MODEL; OF-EXPERTS; HIERARCHICAL MIXTURES; REGRESSION-MODELS;
D O I
10.1016/j.neucom.2025.129357
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilevel data are prevalent in many real-world applications. However, it remains an open research problem to identify and justify a class of models that flexibly capture a wide range of multilevel data. Motivated by the versatility of the mixture of experts (MoE) models in fitting regression data, in this article we extend upon the MoE and study a class of mixed MoE (MMoE) models for multilevel data. Under some regularity conditions, we prove that the MMoE is dense in the space of any continuous mixed effects models in the sense of weak convergence. Asa result, the MMoE has a potential to accurately resemble almost all characteristics inherited in multilevel data, including the marginal distributions, dependence structures, regression links, random intercepts and random slopes. Ina particular case where the multilevel data is hierarchical, we further show that a nested version of the MMoE universally approximates a broad range of dependence structures of the random effects among different factor levels.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Multilevel Mixture-of-Experts Framework for Pedestrian Classification
    Enzweiler, Markus
    Gavrila, Dariu M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (10) : 2967 - 2979
  • [2] A Universal Approximation Theorem for Mixture-of-Experts Models
    Nguyen, Hien D.
    Lloyd-Jones, Luke R.
    McLachlan, Geoffrey J.
    NEURAL COMPUTATION, 2016, 28 (12) : 2585 - 2593
  • [3] Steered Mixture-of-Experts Approximation of Spherical Image Data
    Verhack, Ruben
    Madhu, Nilesh
    Van Wallendael, Glenn
    Lambert, Peter
    Sikora, Thomas
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 256 - 260
  • [4] Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework
    Rees Chang
    Yu-Xiong Wang
    Elif Ertekin
    npj Computational Materials, 8
  • [5] Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework
    Chang, Rees
    Wang, Yu-Xiong
    Ertekin, Elif
    NPJ COMPUTATIONAL MATERIALS, 2022, 8 (01)
  • [6] MIXTURE MODELING METHODS FOR CAUSAL INFERENCE WITH MULTILEVEL DATA
    Kim, Jee-Seon
    Steiner, Peter M.
    Lim, Wen-Chiang
    ADVANCES IN MULTILEVEL MODELING FOR EDUCATIONAL RESEARCH: ADDRESSING PRACTICAL ISSUES FOUND IN REAL-WORLD APPLICATIONS, 2016, : 335 - 359
  • [7] A Multilevel Mixture IRT Framework for Modeling Response Times as Predictors or Indicators of Response Engagement in IRT Models
    Nagy, Gabriel
    Ulitzsch, Esther
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2022, 82 (05) : 845 - 879
  • [8] PROGRESSIVE MODELING OF STEERED MIXTURE-OF-EXPERTS FOR LIGHT FIELD VIDEO APPROXIMATION
    Verhack, Ruben
    Van Wallendael, Glenn
    Courteaux, Martijn
    Lambert, Peter
    Sikora, Thomas
    2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 268 - 272
  • [9] Surrogate modeling approximation using a mixture of experts based on EM joint estimation
    Bettebghor, Dimitri
    Bartoli, Nathalie
    Grihon, Stephane
    Morlier, Joseph
    Samuelides, Manuel
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2011, 43 (02) : 243 - 259
  • [10] Surrogate modeling approximation using a mixture of experts based on EM joint estimation
    Dimitri Bettebghor
    Nathalie Bartoli
    Stéphane Grihon
    Joseph Morlier
    Manuel Samuelides
    Structural and Multidisciplinary Optimization, 2011, 43 : 243 - 259