Classification and Categorical Inputs with Treed Gaussian Process Models

被引:4
|
作者
Broderick, Tamara [1 ]
Gramacy, Robert B. [2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Cambridge, Cambridge CB2 1TN, England
基金
英国工程与自然科学研究理事会;
关键词
Treed models; Gaussian process; Bayesian model averaging; Latent variable;
D O I
10.1007/s00357-011-9083-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Recognizing the successes of treed Gaussian process (TGP) models as an interpretable and thrifty model for nonparametric regression, we seek to extend the model to classification. Both treed models and Gaussian processes (GPs) have, separately, enjoyed great success in application to classification problems. An example of the former is Bayesian CART. In the latter, real-valued GP output may be utilized for classification via latent variables, which provide classification rules by means of a softmax function. We formulate a Bayesian model averaging scheme to combine these two models and describe a Monte Carlo method for sampling from the full posterior distribution with joint proposals for the tree topology and the GP parameters corresponding to latent variables at the leaves. We concentrate on efficient sampling of the latent variables, which is important to obtain good mixing in the expanded parameter space. The tree structure is particularly helpful for this task and also for developing an efficient scheme for handling categorical predictors, which commonly arise in classification problems. Our proposed classification TGP (CTGP) methodology is illustrated on a collection of synthetic and real data sets. We assess performance relative to existing methods and thereby show how CTGP is highly flexible, offers tractable inference, produces rules that are easy to interpret, and performs well out of sample.
引用
收藏
页码:244 / 270
页数:27
相关论文
共 50 条
  • [31] COMPARISON OF GAUSSIAN AND LOGISTIC CATEGORICAL OPINION DISTRIBUTION MODELS
    MCKENZIE, J
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1975, 24 (01) : 112 - 122
  • [32] A mixed-categorical correlation kernel for Gaussian process
    Saves, P.
    Diouane, Y.
    Bartoli, N.
    Lefebvre, T.
    Morlier, J.
    NEUROCOMPUTING, 2023, 550
  • [33] Latent Gaussian process for anomaly detection in categorical data
    Lv, Fengmao
    Liang, Tao
    Zhao, Jiayi
    Zhuo, Zhongliu
    Wu, Jinzhao
    Yang, Guowu
    KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [34] Parsimonious Gaussian Process Models for the Classification of Hyperspectral Remote Sensing Images
    Fauvel, Mathieu
    Bouveyron, Charles
    Girard, Stephane
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (12) : 2423 - 2427
  • [35] PARSIMONIOUS GAUSSIAN PROCESS MODELS FOR THE CLASSIFICATION OF MULTIVARIATE REMOTE SENSING IMAGES
    Fauvel, M.
    Bouveyron, C.
    Girard, S.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [36] Predictive Modeling of Student Performance Through Classification with Gaussian Process Models
    Zhang, Xiaowei
    Yue, Junlin
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 1214 - 1227
  • [37] Bayesian Treed Multivariate Gaussian Process With Adaptive Design: Application to a Carbon Capture Unit
    Konomi, Bledar
    Karagiannis, Georgios
    Sarkar, Avik
    Sun, Xin
    Lin, Guang
    TECHNOMETRICS, 2014, 56 (02) : 145 - 158
  • [38] Treed Gaussian processes for animal movement modeling
    Rieber, Camille J.
    Hefley, Trevor J.
    Haukos, David A.
    ECOLOGY AND EVOLUTION, 2024, 14 (06):
  • [39] Expected Classification Accuracy for Categorical Growth Models
    Murphy, Daniel
    Quesen, Sarah
    Brunetti, Matthew
    Love, Quintin
    EDUCATIONAL MEASUREMENT-ISSUES AND PRACTICE, 2024, 43 (02) : 64 - 73
  • [40] Gaussian process classification bandits
    Hayashi, Tatsuya
    Ito, Naoki
    Tabata, Koji
    Nakamura, Atsuyoshi
    Fujita, Katsumasa
    Harada, Yoshinori
    Komatsuzaki, Tamiki
    PATTERN RECOGNITION, 2024, 149