Classification and Categorical Inputs with Treed Gaussian Process Models

被引:4
|
作者
Broderick, Tamara [1 ]
Gramacy, Robert B. [2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Cambridge, Cambridge CB2 1TN, England
基金
英国工程与自然科学研究理事会;
关键词
Treed models; Gaussian process; Bayesian model averaging; Latent variable;
D O I
10.1007/s00357-011-9083-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Recognizing the successes of treed Gaussian process (TGP) models as an interpretable and thrifty model for nonparametric regression, we seek to extend the model to classification. Both treed models and Gaussian processes (GPs) have, separately, enjoyed great success in application to classification problems. An example of the former is Bayesian CART. In the latter, real-valued GP output may be utilized for classification via latent variables, which provide classification rules by means of a softmax function. We formulate a Bayesian model averaging scheme to combine these two models and describe a Monte Carlo method for sampling from the full posterior distribution with joint proposals for the tree topology and the GP parameters corresponding to latent variables at the leaves. We concentrate on efficient sampling of the latent variables, which is important to obtain good mixing in the expanded parameter space. The tree structure is particularly helpful for this task and also for developing an efficient scheme for handling categorical predictors, which commonly arise in classification problems. Our proposed classification TGP (CTGP) methodology is illustrated on a collection of synthetic and real data sets. We assess performance relative to existing methods and thereby show how CTGP is highly flexible, offers tractable inference, produces rules that are easy to interpret, and performs well out of sample.
引用
收藏
页码:244 / 270
页数:27
相关论文
共 50 条
  • [21] Analysis of computationally demanding models with continuous and categorical inputs
    Storlie, Curtis B.
    Reich, Brian J.
    Helton, Jon C.
    Swiler, Laura P.
    Sallaberry, Cedric J.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2013, 113 : 30 - 41
  • [22] Classification of Streetsigns Using Gaussian Process Latent Variable Models
    Woeber, Wilfried
    Aburaia, Mohamed
    Olaverri-Monreal, Cristina
    2019 8TH IEEE INTERNATIONAL CONFERENCE ON CONNECTED VEHICLES AND EXPO (IIEEE CCVE), 2019,
  • [23] The error bar estimation for the soft classification with Gaussian process models
    Gao, Junbin
    Zhang, Lei
    APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 675 - 684
  • [24] Categorical models for process planning
    Breiner, Spencer
    Jones, Albert
    Subrahmanian, Eswaran
    COMPUTERS IN INDUSTRY, 2019, 112
  • [25] CATEGORICAL MODELS OF PROCESS COOPERATION
    LABELLA, A
    PETTOROSSI, A
    LECTURE NOTES IN COMPUTER SCIENCE, 1986, 240 : 282 - 298
  • [26] Fuzzy treed Gaussian process for nonstationary regression with large-scale datasets
    Yao, Wen
    Wang, Ning
    Li, Xingchen
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [27] Global optimization for mixed categorical-continuous variables based on Gaussian process models with a randomized categorical space exploration step
    Zuniga, Miguel Munoz
    Sinoquet, Delphine
    INFOR, 2020, 58 (02) : 310 - 341
  • [28] Bayesian Treed Models
    Hugh A. Chipman
    Edward I. George
    Robert E. McCulloch
    Machine Learning, 2002, 48 : 299 - 320
  • [29] Bayesian treed models
    Chipman, HA
    George, EI
    McCulloch, RE
    MACHINE LEARNING, 2002, 48 (1-3) : 299 - 320
  • [30] A Gaussian Process Regression Model for Distribution Inputs
    Bachoc, Francois
    Gamboa, Fabrice
    Loubes, Jean-Michel
    Venet, Nil
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (10) : 6620 - 6637