Hierarchical Routing Mixture of Experts

被引:2
|
作者
Zhao, Wenbo [1 ]
Gao, Yang [1 ]
Memon, Shahan Ali [1 ]
Raj, Bhiksha [1 ]
Singh, Rita [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
SUPPORT VECTOR MACHINES; APPROXIMATION; PREDICTION;
D O I
10.1109/ICPR48806.2021.9412813
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In regression tasks, the data distribution is often too complex to be fitted by a single model. In contrast, partition-based models are developed where data is divided and fitted by local models. These models partition the input space and do not leverage the input-output dependency of multimodal-distributed data, and strong local models are needed to make good predictions. Addressing these problems, we propose a binary tree-structured hierarchical routing mixture of experts (HRME) model that has classifiers as non-leaf node experts and simple regression models as leaf node experts. The classifier nodes jointly soft-partition the input-output space based on the natural separateness of multimodal data. This enables simple leaf experts to be effective for prediction. Further, we develop a probabilistic framework for the HRME model and propose a recursive Expectation-Maximization (EM) based algorithm to learn both the tree structure and the expert models. Experiments on a collection of regression tasks validate our method's effectiveness compared to various other regression models.
引用
收藏
页码:7900 / 7906
页数:7
相关论文
共 50 条
  • [31] Hierarchical mixture of experts and diagnostic modeling approach to reduce hydrologic model structural uncertainty
    Moges, Edom
    Demissie, Yonas
    Li, Hong-Yi
    WATER RESOURCES RESEARCH, 2016, 52 (04) : 2551 - 2570
  • [32] Mixture of vector experts
    Henderson, M
    Shawe-Taylor, J
    Zerovnik, J
    ALGORITHMIC LEARNING THEORY, 2005, 3734 : 386 - 398
  • [33] A hierarchical community of experts
    Hinton, GE
    Sallans, B
    Ghahramani, Z
    LEARNING IN GRAPHICAL MODELS, 1998, 89 : 479 - 494
  • [34] Algorithms of hierarchical mixture of opinions of experts in problems of synthesis of information management systems city development
    Pocebneva, Irina
    Belousov, Vadim
    Fateeva, Irina
    Lukinov, Vitaly
    Folomeeva, Tatyana
    INTERNATIONAL SCIENCE CONFERENCE SPBWOSCE-2017 BUSINESS TECHNOLOGIES FOR SUSTAINABLE URBAN DEVELOPMENT, 2018, 170
  • [35] Hierarchical strategy of model partitioning for VLSI-design using an improved mixture of experts approach
    Hering, K
    Haupt, R
    Villmann, T
    TENTH WORKSHOP ON PARALLEL AND DISTRIBUTED SIMULATION - PADS 96, PROCEEDINGS, 1996, : 106 - 113
  • [36] ON HIERARCHICAL ROUTING
    SOUKUP, J
    ROYLE, JC
    JOURNAL OF DIGITAL SYSTEMS, 1981, 5 (03): : 265 - 289
  • [37] Latent Mixture of Discriminative Experts
    Ozkan, Derya
    Morency, Louis-Philippe
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (02) : 326 - 338
  • [38] Mixture of experts: a literature survey
    Masoudnia, Saeed
    Ebrahimpour, Reza
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 42 (02) : 275 - 293
  • [39] Mixture of Experts with Genetic Algorithms
    Cleofas, Laura
    Maria Valdovinos, Rosa
    Juarez, C.
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2009, 61 : 331 - 338
  • [40] Statistical mechanics of the mixture of experts
    Kang, KJ
    Oh, JH
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 183 - 189