Classification and Categorical Inputs with Treed Gaussian Process Models

被引:4
|
作者
Broderick, Tamara [1 ]
Gramacy, Robert B. [2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Cambridge, Cambridge CB2 1TN, England
基金
英国工程与自然科学研究理事会;
关键词
Treed models; Gaussian process; Bayesian model averaging; Latent variable;
D O I
10.1007/s00357-011-9083-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Recognizing the successes of treed Gaussian process (TGP) models as an interpretable and thrifty model for nonparametric regression, we seek to extend the model to classification. Both treed models and Gaussian processes (GPs) have, separately, enjoyed great success in application to classification problems. An example of the former is Bayesian CART. In the latter, real-valued GP output may be utilized for classification via latent variables, which provide classification rules by means of a softmax function. We formulate a Bayesian model averaging scheme to combine these two models and describe a Monte Carlo method for sampling from the full posterior distribution with joint proposals for the tree topology and the GP parameters corresponding to latent variables at the leaves. We concentrate on efficient sampling of the latent variables, which is important to obtain good mixing in the expanded parameter space. The tree structure is particularly helpful for this task and also for developing an efficient scheme for handling categorical predictors, which commonly arise in classification problems. Our proposed classification TGP (CTGP) methodology is illustrated on a collection of synthetic and real data sets. We assess performance relative to existing methods and thereby show how CTGP is highly flexible, offers tractable inference, produces rules that are easy to interpret, and performs well out of sample.
引用
收藏
页码:244 / 270
页数:27
相关论文
共 50 条
  • [1] Classification and Categorical Inputs with Treed Gaussian Process Models
    Tamara Broderick
    Robert B. Gramacy
    Journal of Classification, 2011, 28 : 244 - 270
  • [2] Treed Gaussian Process Models for Classification
    Broderick, Tamara
    Gramacy, Robert B.
    CLASSIFICATION AS A TOOL FOR RESEARCH, 2010, : 101 - 108
  • [3] Categorical Inputs, Sensitivity Analysis, Optimization and Importance Tempering with tgp Version 2, an R Package for Treed Gaussian Process Models
    Gramacy, Robert B.
    Taddy, Matthew
    JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (06): : 1 - 48
  • [4] Bayesian nonstationary Gaussian process models via treed process convolutions
    Waley W. J. Liang
    Herbert K. H. Lee
    Advances in Data Analysis and Classification, 2019, 13 : 797 - 818
  • [5] Group Kernels for Gaussian Process Metamodels with Categorical Inputs
    Roustant, Olivier
    Padonou, Esperan
    Deville, Yves
    Clement, Alois
    Perrin, Guillaume
    Giorla, Jean
    Wynn, Henry
    SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2020, 8 (02): : 775 - 806
  • [6] Bayesian nonstationary Gaussian process models via treed process convolutions
    Liang, Waley W. J.
    Lee, Herbert K. H.
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 797 - 818
  • [7] Bayesian Treed Gaussian Process Models With an Application to Computer Modeling
    Gramacy, Robert B.
    Lee, Herbert K. H.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (483) : 1119 - 1130
  • [8] Gaussian Process Regression with Categorical Inputs for Predicting the Blood Glucose Level
    Tomczak, Jakub M.
    ADVANCES IN SYSTEMS SCIENCE, ICSS 2016, 2017, 539 : 98 - 108
  • [9] Bayesian treed Gaussian process method for process monitoring
    Wang, Hangzhou
    de Melo, Vinicius Veloso
    26TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT B, 2016, 38B : 1773 - 1778
  • [10] On NARX models using treed Gaussian processes
    Zhang, T.
    Barthorpe, R. J.
    Worden, K.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING (ISMA2018) / INTERNATIONAL CONFERENCE ON UNCERTAINTY IN STRUCTURAL DYNAMICS (USD2018), 2018, : 2775 - 2782